UTMN at SemEval-2020 Task 11: A Kitchen Solution to Automatic Propaganda Detection (2020-08-22T00:00:00.000000Z)

TL;DR

A fast solution to propaganda detection at SemEval-2020 Task 11, based on feature adjustment, using per-token vectorization of features and a simple Logistic Regression classifier to quickly test different hypotheses about the data.

Abstract

The article describes a fast solution to propaganda detection at SemEval-2020 Task 11, based on feature adjustment. We use per-token vectorization of features and a simple Logistic Regression classifier to quickly test different hypotheses about our data. We come up with what seems to us the best solution, however, we are unable to align it with the result of the metric suggested by the organizers of the task. We test how our system handles class and feature imbalance by varying the number of samples of two classes (Propaganda and None) in the training set, the size of a context window in which a token is vectorized and combination of vectorization means. The result of our system at SemEval2020 Task 11 is F-score=0.37.

Authors

Anna Glazkova

2 papers

Elena Mikhalkova

1 papers

Nadezhda Ganzherli

1 papers

TL;DR

Abstract

Authors

References40 items

SemEval-2020 Task 11: Detection of Propaganda Techniques in News Articles

American Political Science Review

Poor Man's BERT: Smaller and Faster Transformer Models

Cost-Sensitive BERT for Generalisable Sentence Classification on Imbalanced Data

Fine-Tuned Neural Models for Propaganda Detection at the Sentence and Fragment levels

HuggingFace's Transformers: State-of-the-art Natural Language Processing

Fine-Grained Analysis of Propaganda in News Article

Neural Architectures for Fine-Grained Propaganda Detection in News

Proppy: A System to Unmask Propaganda in Online News

Defending Against Neural Fake News

Sampling the News Producers: A Large News and Feature Data Set for the Study of the Complex Media Landscape

Deep Contextualized Word Representations

Truth of Varying Shades: Analyzing Language in Fake News and Political Fact-Checking

Detecting Intentional Lexical Ambiguity in English Puns

PunFields at SemEval-2017 Task 7: Employing Roget’s Thesaurus in Automatic Pun Recognition and Interpretation

The Development and Psychometric Properties of LIWC2015

An Improved Non-monotonic Transition System for Dependency Parsing

Teaching about Propaganda: An Examination of the Historical Roots of Media Literacy

VADER: A Parsimonious Rule-Based Model for Sentiment Analysis of Social Media Text

Journalism, ideology and linguistics: The paradox of Chomsky’s linguistic legacy and his ‘propaganda model’

Scikit-learn: Machine Learning in Python

Steven Bird, Ewan Klein and Edward Loper: Natural Language Processing with Python, Analyzing Text with the Natural Language Toolkit

Software Framework for Topic Modelling with Large Corpora

Introduction to Psychology: Gateways to Mind and Behavior

Munitions of the mind

Communication Theories: Origins, Methods and Uses in the Mass Media

A Rulebook for Arguments

Critically Reading for Propaganda Techniques in Grade Six.

Propaganda and the European War

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Synthetic Propaganda Embeddings To Train A Linear Projection

Fine-Grained Propaganda Detection with Fine-Tuned BERT

Using NLG for speech synthesis of mathematical sentences

Symbiotic radicalisation strategies: Propaganda tools and neuro linguistic programming

The propaganda model and sociology: understanding the media and society

Reductio ad Hitlerum: Trumping the Judicial Nazi Card

A sentiment vector of the context window : size 4, acquired with NLTK Vader Sentiment Intensity Analyzer (Hutto and Gilbert, 2014; Bird et al.,

Propaganda , volume 8

Childs

An embedding vector of the context window : size 300, acquired with SpaCy “vector” command

Field of Study

Venue Information

Name

Type

URL

Alternate Names