Word Sense Induction

Word sense induction (WSI) is widely known as the “unsupervised version” of WSD. The problem states as: Given a target word (e.g., “cold”) and a collection of sentences (e.g., “I caught a cold”, “The weather is cold”) that use the word, cluster the sentences according to their different senses/meanings. We do not need to know the sense/meaning of each cluster, but sentences inside a cluster should have used the target words with the same sense. Description from NLP Progress

Benchmarks

Libraries

Datasets

Subtasks

Most implemented papers

Breaking Sticks and Ambiguities with Adaptive Skip-gram

Content

A Simple Approach to Learn Polysemous Word Embeddings

Towards better substitution-based word sense induction

RuDSI: Graph-based Word Sense Induction Dataset for Russian

unimelb: Topic Modelling-based Word Sense Induction

Exploring Topic Coherence over Many Models and Many Topics

Watset: Automatic Induction of Synsets from a Graph of Synonyms

Improved Word Representation Learning with Sememes

Russian word sense induction by clustering averaged word embeddings