Controversy and Conformity: from Generalized to Personalized Aggressiveness Detection (2021-01-01T00:00:00.000000Z)

TL;DR

It is found that only a few annotations of most controversial documents are enough for all the personalization methods to significantly outperform classic, generalized solutions.

Abstract

There is content such as hate speech, offensive, toxic or aggressive documents, which are perceived differently by their consumers. They are commonly identified using classifiers solely based on textual content that generalize pre-agreed meanings of difficult problems. Such models provide the same results for each user, which leads to high misclassification rate observable especially for contentious, aggressive documents. Both document controversy and user nonconformity require new solutions. Therefore, we propose novel personalized approaches that respect individual beliefs expressed by either user conformity-based measures or various embeddings of their previous text annotations. We found that only a few annotations of most controversial documents are enough for all our personalization methods to significantly outperform classic, generalized solutions. The more controversial the content, the greater the gain. The personalized solutions may be used to efficiently filter unwanted aggressive content in the way adjusted to a given person.

Authors

Tomasz Kajdanowicz

3 papers

Jan Kocoń

2 papers

Kamil Kanclerz

1 papers

TL;DR

Abstract

Authors

References64 items

Offensive, aggressive, and hate speech analysis: From data-centric to human-centered approach

Mapping WordNet onto human brain connectome in emotion processing and semantic similarity recognition

Detecting and visualizing hate speech in social media: A cyber Watchdog for surveillance

Modeling Annotator Perspective and Polarized Opinions to Improve Hate Speech Detection

Resources and benchmark corpora for hate speech detection: a systematic review

Aggression and Misogyny Detection using BERT: A Multi-Task Approach

Aggressive, Repetitive, Intentional, Visible, and Imbalanced: Refining Representations for Cyberbullying Classification

Overview of the HASOC track at FIRE 2019: Hate Speech and Offensive Content Identification in Indo-European Languages

Unsupervised Cross-lingual Representation Learning at Scale

Automatic Hate Speech Detection on Social Media: A Brief Survey

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

Comprehensive Analysis of Aspect Term Extraction Methods using Various Text Embeddings

RoBERTa: A Robustly Optimized BERT Pretraining Approach

XLNet: Generalized Autoregressive Pretraining for Language Understanding

The FRENK Datasets of Socially Unacceptable Discourse in Slovene and English

Aspect Detection using Word and Char Embeddings with (Bi) LSTM and CRF

Does My Rebuttal Matter? Insights from a Major NLP Conference

SemEval-2019 Task 6: Identifying and Categorizing Offensive Language in Social Media (OffensEval)

Predicting the Type and Target of Offensive Posts in Social Media

Deep learning for detecting inappropriate content in text

Learning Representations for Detecting Abusive Language

Aggression Identification Using Deep Learning and Data Augmentation

Filtering Aggression from the Multilingual Social Media Feed

A Survey on Automatic Detection of Hate Speech in Text

What is so special about online (as compared to offline) hate speech?

Classifier-based Polarity Propagation in a WordNet

A Survey on Hate Speech Detection using Natural Language Processing

Wikipedia Talk Labels: Aggression

Ex Machina: Personal Attacks Seen at Scale

Enriching Word Vectors with Subword Information

Bag of Tricks for Efficient Text Classification

Hateful Symbols or Hateful People? Predictive Features for Hate Speech Detection on Twitter

Hate Speech and Covert Discrimination on Social Media: Monitoring the Facebook Pages of Extreme-Right Political Parties in Spain

Belief Propagation Method for Word Sentiment in WordNet 3.0

Measuring crowd truth: disagreement metrics combined with worker behavior filters

Crowd Truth: Harnessing disagreement in crowdsourcing a relation extraction gold standard

Detecting Offensive Language in Social Media to Protect Adolescent Online Safety

Understanding bag-of-words model: a statistical framework

Offensive Language Detection Using Multi-level Classification

Hate Speech, Public Discourse, and the First Amendment

Text classification by boosting weak learners based on terms and concepts

Exploiting Agreement and Disagreement of Human Annotators for Word Sense Disambiguation

Hate Crime: Criminal Law and Identity Politics

Hate Speech in Constitutional Jurisprudence: A Comparative Analysis

Aggression detection through deep neural model on Twitter

Personal Bias in Prediction of Emotions Elicited by Textual Opinions

Cross-lingual deep neural transfer learning in sentiment analysis

Multilingual and Language-Agnostic Recognition of Emotions, Valence and Arousal in Large-Scale Multi-domain Text Reviews

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Multi-Level Sentiment Analysis of PolEmo 2.0: Extended Corpus of Multi-Domain Consumer Reviews

Propagation of emotions, arousal and polarity in WordNet using Heterogeneous Structured Synset Embeddings

Results of the poleval 2019 shared task 6: First dataset and open shared task for automatic cyberbullying detection in polish twit-ter

Overview of the GermEval 2018 Shared Task on the Identification of Offensive Language

Ojha, Marcos Zampieri, and Shervin Malmasi, editors

Proceedings of the First Workshop on Trolling, Aggression and Cy-berbullying (TRAC-2018)

plWordNet as a Basis for Large Emotive Lexicons of Polish

Eliminating Spammers and Ranking Annotators for Crowdsourced Labeling Tasks

The Offensive Internet: Speech, Privacy, and Reputation

A Wordnet from the ground up

A haven for hate: the foreign and domestic implications of protecting internet hate speech under the ﬁrst amendment

Text Classification Using WordNet Hypernyms

Information from the ﬁrst step is used to extract individually-speciﬁc features reﬂecting personal user beliefs

A subset of the same users (upper rows in Fig. 1) annotate next documents

using multi-level classiﬁcation

Field of Study

Venue Information

Name

Type

URL

Alternate Names