Semantic Similarity

The main objective Semantic Similarity is to measure the distance between the semantic meanings of a pair of words, phrases, sentences, or documents. For example, the word “car” is more similar to “bus” than it is to “cat”. The two main approaches to measuring Semantic Similarity are knowledge-based approaches and corpus-based, distributional methods. Source: Visual and Semantic Knowledge Transfer for Large Scale Semi-supervised Object Detection

Benchmarks

Libraries

Datasets

Subtasks

Most implemented papers

Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks

Content

Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks

ERNIE: Enhanced Representation through Knowledge Integration

Language-agnostic BERT Sentence Embedding

Calculating the similarity between words and sentences using a lexical database and corpus statistics

MedSTS: a resource for clinical semantic textual similarity

Transfer Learning in Biomedical Natural Language Processing: An Evaluation of BERT and ELMo on Ten Benchmarking Datasets

Label Noise Reduction in Entity Typing by Heterogeneous Partial-Label Embedding

Portuguese Word Embeddings: Evaluating on Word Analogies and Natural Language Tasks