Home Research Papers Datasets State of the Art Pricing

Discover, visualize, and connect AI research papers. Explore the latest trends and insights in artificial intelligence research.

Product

Home
Research Papers
About

Support

Contact
Terms of Service
Privacy Policy

© 2026 Papersgraph. All rights reserved.

natural-language-processing

Question Generation

3260 papers • 126 benchmarks • 313 datasets

The goal of Question Generation is to generate a valid and fluent question according to a given passage and the target answer. Question Generation can be used in many scenarios, such as automatic tutoring systems, improving the performance of Question Answering models and enabling chatbots to lead a conversation. Source: Generating Highly Relevant Questions

(Image credit: Papersgraph)

Benchmarks

These leaderboards are used to track progress in question-generation

Trend

Dataset

Best Model

Actions

SQuAD1.1

SQuAD1.1

COCO Visual Question Answering (VQA) real images 1.0 open ended

COCO Visual Question Answering (VQA) real images 1.0 open ended

Libraries

i

Use these libraries to find question-generation models and implementations

microsoft/unilm

4 papers 17,208

Datasets

SQuAD

Natural Questions

Natural Questions

TriviaQA

OK-VQA

XQuAD

VQG

Subtasks

Poll Generation

Most implemented papers

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

Noam Shazeer, Peter J. Liu, Sharan Narang, Yanqi Zhou, Adam Roberts, Colin Raffel, Katherine Lee, Michael Matena, Wei Li•Tue Oct 22 2019

This systematic study compares pre-training objectives, architectures, unlabeled datasets, transfer approaches, and other factors on dozens of language understanding tasks and achieves state-of-the-art results on many benchmarks covering summarization, question answering, text classification, and more.

Content

Introduction Benchmarks Datasets Subtasks Libraries Papers

FairytaleQA

FairytaleQA

WeiboPolls

WeiboPolls

SQuAD

SQuAD

Natural Questions

Natural Questions

TriviaQA

TriviaQA

Visual Question Generation

Visual Question Generation

patil-suraj/question_generation

3 papers 1,051

zeaver/multifactor

3 papers 10

huggingface/transformers

2 papers 120,866

asahi417/lm-question-generation

2 papers 238

GauthierDmn/question_generation

2 papers 103

facebookresearch/data2vec_vision

2 papers 74

SciQ

SciQ

HybridQA

ROPES

24077

0

Diverse Beam Search: Decoding Diverse Solutions from Neural Sequence Models

Dhruv Batra, Ramprasaath R. Selvaraju, Michael Cogswell, David J. Crandall, Stefan Lee, Ashwin K. Vijayakumar, Q. Sun•Thu Oct 06 2016

Diverse Beam Search is proposed, an alternative to BS that decodes a list of diverse outputs by optimizing for a diversity-augmented objective and consistently outperforms BS and previously proposed techniques for diverse decoding from neural sequence models.

617 0

Learning to Ask: Neural Question Generation for Reading Comprehension

Claire Cardie, Xinya Du, Junru Shao•Fri Apr 28 2017

An attention-based sequence learning model for the task and the effect of encoding sentence- vs. paragraph-level information is investigated and results show that the system significantly outperforms the state-of-the-art rule-based system.

712 0

Unified Language Model Pre-training for Natural Language Understanding and Generation

Jianfeng Gao, M. Zhou, Li Dong, Furu Wei, Nan Yang, Wenhui Wang, Xiaodong Liu, Yu Wang, H. Hon•Tue May 07 2019

A new Unified pre-trained Language Model (UniLM) that can be fine-tuned for both natural language understanding and generation tasks that compares favorably with BERT on the GLUE benchmark, and the SQuAD 2.0 and CoQA question answering tasks.

1640 0

Machine Comprehension by Text-to-Text Neural Question Generation

Çaglar Gülçehre, Adam Trischler, Philip Bachman, Alessandro Sordoni, Sandeep Subramanian, Tong Wang, Xingdi Yuan, Saizheng Zhang•Wed May 03 2017

A recurrent neural model is proposed that generates natural-language questions from documents, conditioned on answers, and fine-tune the model using policy gradient techniques to maximize several rewards that measure question quality.

193 0

Neural Question Generation from Text: A Preliminary Study

M. Zhou, Furu Wei, Chuanqi Tan, Nan Yang, Qingyu Zhou, Hangbo Bao•Wed Apr 05 2017

A preliminary study on neural question generation from text with the SQuAD dataset is conducted, and the experiment results show that the method can produce fluent and diverse questions.

364 0

ProphetNet: Predicting Future N-gram for Sequence-to-Sequence Pre-training

Ming Zhou, Nan Duan, Weizhen Qi, Yeyun Gong, Ruofei Zhang, Dayiheng Liu, Yu Yan, Jiusheng Chen•Sun Jan 12 2020

A new sequence-to-sequence pre-training model called ProphetNet is presented, which introduces a novel self-supervised objective named future n-gram prediction and the proposed n-stream self-attention mechanism that predicts the next n tokens simultaneously based on previous context tokens at each time step.

475 0

Synthetic QA Corpora Generation with Roundtrip Consistency

Chris Alberti, Michael Collins, Jacob Devlin, D. Andor, Emily Pitler•Fri May 31 2019

A novel method of generating synthetic question answering corpora is introduced by combining models of question generation and answer extraction, and by filtering the results to ensure roundtrip consistency, establishing a new state-of-the-art on SQuAD2 and NQ.

267 0

Simplifying Paragraph-Level Question Generation via Transformer Language Models

Jan Christian Blaise Cruz, C. Cheng, Luis Enrico Lopez, Diane Kathryn Cruz•Sat May 02 2020

The QG model, finetuned from GPT-2 Small, outperforms several paragraph-level QG baselines on the SQuAD dataset by 0.95 METEOR points and is rated as easy to answer, relevant to their context paragraph, and corresponding well to natural human speech.

38 0

ERNIE-GEN: An Enhanced Multi-Flow Pre-training and Fine-tuning Framework for Natural Language Generation

Hua Wu, Haifeng Wang, Yu Sun, Hao Tian, Yukun Li, Dongling Xiao•Sat Jan 25 2020

An enhanced multi-flow sequence to sequence pre-training and fine-tuning framework named ERNIE-GEN, which bridges the discrepancy between training and inference with an infilling generation mechanism and a noise-aware generation method to make generation closer to human writing patterns.

133 0

Adding a benchmark result helps the community track progress.

Question Generation | State-of-the-Art