3260 papers • 126 benchmarks • 313 datasets
Automatic disentanglement could be used to provide more interpretable results when searching over chat logs, and to help users understand what is happening when they join a channel. Source: Kummerfeld et al.
(Image credit: Papersgraph)
These leaderboards are used to track progress in conversation-disentanglement-14
Use these libraries to find conversation-disentanglement-14 models and implementations
No subtasks available.
A new dataset of 77,563 messages manually annotated with reply-structure graphs that both disentangle conversations and define internal conversation structure is created, which is 16 times larger than all previously released datasets combined, the first to include adjudication of annotation disagreements, and theFirst to include context.
A new model, named Dialogue BERT (DialBERT), is proposed, which integrates local and global semantics in a single stream of messages to disentangle the conversations that mixed together and achieves a state-of-the-art result on the a new dataset proposed by IBM.
Experimental results on the large Movie Dialogue Dataset demonstrate that the proposed approach achieves competitive performance compared to previous supervised methods and can promote the performance on the downstream task of multi-party response selection.
We present a new dataset for studying conversation disentanglement in movies and TV series. While previous work has focused on conversation disentanglement in IRC chatroom dialogues, movies and TV shows provide a space for studying complex pragmatic patterns of floor and topic change in face-to-face multi-party interactions. In this work, we draw on theoretical research in sociolinguistics, sociology, and film studies to operationalize a conversational thread (including the notion of a floor change) in dramatic texts, and use that definition to annotate a dataset of 10,033 dialogue turns (comprising 2,209 threads) from 831 movies. We compare the performance of several disentanglement models on this dramatic dataset, and apply the best-performing model to disentangle 808 movies. We see that, contrary to expectation, average thread lengths do not decrease significantly over the past 40 years, and characters portrayed by actors who are women, while underrepresented, initiate more new conversational threads relative to their speaking time.
Adding a benchmark result helps the community track progress.