natural-language-processing-7

WNLI

3260 papers • 126 benchmarks • 313 datasets

This task has no description! Would you like to contribute one?

(Image credit: Papersgraph)

Benchmarks

These leaderboards are used to track progress in wnli-7

Trend

Dataset

Best Model

Actions

No benchmarks available.

Libraries

i

Use these libraries to find wnli-7 models and implementations

Datasets

No datasets available.

Subtasks

No subtasks available.

Most implemented papers

A Hybrid Neural Network Model for Commonsense Reasoning

Jianfeng Gao, Weizhu Chen, Xiaodong Liu, Pengcheng He•Fri Jul 26 2019

An ablation study shows that language models and semantic similarity models are complementary approaches to commonsense reasoning, and HNN effectively combines the strengths of both.

29

Content

0

Paper Graph

WikiCREM: A Large Unsupervised Corpus for Coreference Resolution

Phil Blunsom, Oana-Maria Camburu, Thomas Lukasiewicz, Vid Kocijan, Ana-Maria Cretu, Yordan Yordanov•Tue Aug 20 2019

This work introduces WikiCREM (Wikipedia CoREferences Masked) a large-scale, yet accurate dataset of pronoun disambiguation instances, and uses a language-model-based approach for pronoun resolution in combination with this dataset, beating previous state-of-the-art approaches on 6 out of 7 datasets.

30 0

Paper Graph

Time Travel in LLMs: Tracing Data Contamination in Large Language Models

M. Surdeanu, Shahriar Golchin•Tue Aug 15 2023

The best method achieves an accuracy between 92% and 100% in detecting if an LLM is contaminated with seven datasets, containing train and test/validation partitions, when contrasted with manual evaluation by human experts.

150 0

Paper Graph

Adding a benchmark result helps the community track progress.