Wikipedia Biography Dataset
Introduced in Neural Text Generation from Structured Data with Application to the Biography Domain2015
This dataset gathers 728,321 biographies from English Wikipedia. It aims at evaluating text generation algorithms. For each article, we provide the first paragraph and the infobox (both tokenized).