Research Connect

TL;DR

A method and annotation scheme for studying the values encoded in documents such as research papers is introduced and systematic textual evidence that these top values are being defined and applied with assumptions and implications generally supporting the centralization of power is found.

Abstract

Machine learning currently exerts an outsized influence on the world, increasingly affecting institutional practices and impacted communities. It is therefore critical that we question vague conceptions of the field as value-neutral or universally beneficial, and investigate what specific values the field is advancing. In this paper, we first introduce a method and annotation scheme for studying the values encoded in documents such as research papers. Applying the scheme, we analyze 100 highly cited machine learning papers published at premier machine learning conferences, ICML and NeurIPS. We annotate key features of papers which reveal their values: their justification for their choice of project, which attributes of their project they uplift, their consideration of potential negative consequences, and their institutional affiliations and funding sources. We find that few of the papers justify how their project connects to a societal need (15%) and far fewer discuss negative potential (1%). Through line-by-line content analysis, we identify 59 values that are uplifted in ML research, and, of these, we find that the papers most frequently justify and assess themselves based on Performance, Generalization, Quantitative evidence, Efficiency, Building on past work, and Novelty. We present extensive textual evidence and identify key themes in the definitions and operationalization of these values. Notably, we find systematic textual evidence that these top values are being defined and applied with assumptions and implications generally supporting the centralization of power. Finally, we find increasingly close ties between these highly cited papers and tech companies and elite universities.

Authors

Abeba Birhane

2 Papers

Pratyusha Kalluri

1 Paper

Dallas Card

1 Paper

References190 items

PyTorch: An Imperative Style, High-Performance Deep Learning Library

Computer ScienceMathematics

TL;DR

Abstract

Authors

References190 items

PyTorch: An Imperative Style, High-Performance Deep Learning Library

Simplifying Graph Convolutional Networks

Hierarchical Graph Representation Learning with Differentiable Pooling

Neural Tangent Kernel: Convergence and Generalization in Neural Networks

Neural Ordinary Differential Equations

Using Pre-Training Can Improve Model Robustness and Uncertainty

Disentangling by Factorising

Do ImageNet Classifiers Generalize to ImageNet?

A unified architecture for natural language processing: deep neural networks with multitask learning

Improving Fairness in Machine Learning Systems: What Do Industry Practitioners Need?

Gender Shades: Intersectional Accuracy Disparities in Commercial Gender Classification

Curriculum learning

A Framework for Understanding Unintended Consequences of Machine Learning

Unpacking the Expressed Consequences of AI Research in Broader Impact Statements

On the Dangers of Stochastic Parrots: Can Language Models Be Too Big? 🦜

Like a Researcher Stating Broader Impact For the Very First Time

Data feminism

The De-democratization of AI: Deep Learning and the Compute Divide in Artificial Intelligence Research

Against Scale: Provocations and Resistances to Scale Thinking

Utility Is in the Eye of the User: A Critique of NLP Leaderboard Design

The Grey Hoodie Project: Big Tobacco, Big Tech, and the Threat on Academic Integrity

Algorithmic Colonization of Africa

Decolonial AI: Decolonial Theory as Sociotechnical Foresight in Artificial Intelligence

Don’t ask if artificial intelligence is good or fair, ask how it shifts power

Large image datasets: A pyrrhic win for computer vision?

Language (Technology) is Power: A Critical Survey of “Bias” in NLP

Performative Prediction

Race after technology. Abolitionist tools for the new Jim Code

An overview of the qualitative descriptive design within nursing research

Value-laden disciplinary shifts in machine learning

ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks

XLNet: Generalized Autoregressive Pretraining for Language Understanding

A Unified Framework of Five Principles for AI in Society

Unlabeled Data Improves Adversarial Robustness

Generalization Bounds of Stochastic Gradient Descent for Wide and Deep Neural Networks

Defending Against Neural Fake News

EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks

Unified Language Model Pre-training for Natural Language Understanding and Generation

MASS: Masked Sequence to Sequence Pre-training for Language Generation

MixMatch: A Holistic Approach to Semi-Supervised Learning

Adversarial Examples Are Not Bugs, They Are Features

SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems

Adversarial Training and Robustness for Multiple Perturbations

Adversarial Training for Free!

On Exact Computation with an Infinitely Wide Neural Net

Algorithms of oppression: how search engines reinforce racism

NAS-Bench-101: Towards Reproducible Neural Architecture Search

Wide neural networks of any depth evolve as linear models under gradient descent

Certified Adversarial Robustness via Randomized Smoothing

BIVA: A Very Deep Hierarchy of Latent Variables for Generative Modeling

Decentralized Stochastic Optimization and Gossip Algorithms with Compressed Communication

Adversarial Examples Are a Natural Consequence of Test Error in Noise

Fairness in representation: quantifying stereotyping as a representational harm

Error Feedback Fixes SignSGD and other Gradient Compression Schemes

Theoretically Principled Trade-off between Robustness and Accuracy

Fine-Grained Analysis of Optimization and Generalization for Overparameterized Two-Layer Neural Networks

Cross-lingual Language Model Pretraining

Learning and Generalization in Overparameterized Neural Networks, Going Beyond Two Layers

Gradient Descent Finds Global Minima of Deep Neural Networks

A Convergence Theory for Deep Learning via Over-Parameterization

Video-to-Video Synthesis

Learning Overparameterized Neural Networks via Stochastic Gradient Descent on Structured Data

Design Justice, A.I., and Escape from the Matrix of Domination

Troubling Trends in Machine Learning Scholarship

Glow: Generative Flow with Invertible 1x1 Convolutions

How Does Batch Normalization Help Optimization? (No, It Is Not About Internal Covariate Shift)

Self-Attention Generative Adversarial Networks

Data-Efficient Hierarchical Reinforcement Learning

Construction of the Literature Graph in Semantic Scholar

Black-box Adversarial Attacks with Limited Queries and Information

Gender Recognition or Gender Reductionism?: The Social Implications of Embedded Gender Recognition Systems

Adversarially Robust Generalization Requires More Data

Adversarial Logit Pairing

Addressing Function Approximation Error in Actor-Critic Methods

Stronger generalization bounds for deep nets via a compression approach

Isolating Sources of Disentanglement in Variational Autoencoders