3260 papers • 126 benchmarks • 313 datasets
The cloze task refers to infilling individual words.
(Image credit: Papersgraph)
These leaderboards are used to track progress in cloze-test-7
Use these libraries to find cloze-test-7 models and implementations
No subtasks available.
The BIDAF network is introduced, a multi-stage hierarchical process that represents the context at different levels of granularity and uses bi-directional attention flow mechanism to obtain a query-aware context representation without early summarization.
Experimental results show that ERNIE outperforms other baseline methods, achieving new state-of-the-art results on five Chinese natural language processing tasks including natural language inference, semantic similarity, named entity recognition, sentiment analysis and question answering.
CPM, with 2.6 billion parameters and 100GB Chinese training data, is the largest Chinese pre-trained language model, which could facilitate several downstream Chinese NLP tasks, such as conversation, essay generation, cloze test, and language understanding.
This paper introduces CodeXGLUE, a benchmark dataset to foster machine learning research for program understanding and generation that includes a collection of 10 tasks across 14 datasets and a platform for model evaluation and comparison.
This work presents a system based on a deep learning architecture combined with a rich set of manually-crafted linguistic features that outperforms all known baselines for the Story Cloze test, suggesting that the chosen approach is promising.
A novel method, tracking various semantic aspects with external neural memory chains while encouraging each to focus on a particular semantic aspect of the narrative, demonstrates superior performance to a collection of competitive baselines, setting a new state of the art.
A neural recommendation model for Chengyu, which is a special type of Chinese idiom, is presented, which achieves 89.5% accuracy on cloze test and outperforms human subjects who attended competitive universities in China.
This work introduces the new task of inferring what is the advice-seeking goal behind a personal narrative, and uses human annotation to determine the degree to which the task relies on common sense and social intuition in addition to a semantic understanding of the narrative.
A large-scale Chinese cloze test dataset ChID is proposed, which studies the comprehension of idiom, a unique language phenomenon in Chinese, in which the idioms in a passage are replaced by blank symbols and the correct answer needs to be chosen from well-designed candidate idioms.
Adding a benchmark result helps the community track progress.