Research Connect

TL;DR

This article provides the taxonomies for both attacks and defenses, based on their characterizations, and discusses their pros and cons, and point out several promising future research directions to inspire the researchers who wish to follow this area.

Abstract

Machine learning (ML) models have been widely applied to various applications, including image classification, text generation, audio recognition, and graph data analysis. However, recent studies have shown that ML models are vulnerable to membership inference attacks (MIAs), which aim to infer whether a data record was used to train a target model or not. MIAs on ML models can directly lead to a privacy breach. For example, via identifying the fact that a clinical record that has been used to train a model associated with a certain disease, an attacker can infer that the owner of the clinical record has the disease with a high chance. In recent years, MIAs have been shown to be effective on various ML models, e.g., classification models and generative models. Meanwhile, many defense methods have been proposed to mitigate MIAs. Although MIAs on ML models form a newly emerging and rapidly growing research area, there has been no systematic survey on this topic yet. In this article, we conduct the first comprehensive survey on membership inference attacks and defenses. We provide the taxonomies for both attacks and defenses, based on their characterizations, and discuss their pros and cons. Based on the limitations and gaps identified in this survey, we point out several promising future research directions to inspire the researchers who wish to follow this area. This survey not only serves as a reference for the research community but also provides a clear description for researchers outside this research domain. To further help the researchers, we have created an online resource repository, which we will keep updated with future relevant work. Interested readers can find the repository at https://github.com/HongshengHu/membership-inference-machine-learning-literature.

Authors

Hongsheng Hu

1 Paper

Z. Salcic

1 Paper

Lichao Sun

1 Paper

Datasets

Foursquare

MIMIC-III

The Medical Information Mart for Intensive Care III

TL;DR

Abstract

Authors

Datasets

Foursquare

MIMIC-III

Fashion-MNIST

References267 items

GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding

Semi-Supervised Classification with Graph Convolutional Networks

node2vec: Scalable Feature Learning for Networks

Deep Residual Learning for Image Recognition

Distributed Representations of Words and Phrases and their Compositionality

BookCorpus

Pubmed

MNIST

ChestX-ray8

CIFAR-10

CelebA

Cityscapes

Colored MNIST

SVHN

CIFAR-100

RCV1

UTKFace

ImageNet

ImageNet: A large-scale hierarchical image database

Meta-Learning in Neural Networks: A Survey

Generative adversarial networks

Gradient-based learning applied to document recognition

Neural Collaborative Filtering

Deep Learning

A survey on deep learning in medical image analysis

Improved Baselines with Momentum Contrastive Learning

Momentum Contrast for Unsupervised Visual Representation Learning

mixup: Beyond Empirical Risk Minimization

ChestX-Ray8: Hospital-Scale Chest X-Ray Database and Benchmarks on Weakly-Supervised Classification and Localization of Common Thorax Diseases

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Intriguing properties of neural networks

Extracting Training Data from Large Language Models

GAN-Leaks: A Taxonomy of Membership Inference Attacks against Generative Models

Monte Carlo and Reconstruction Membership Inference Attacks against Generative Models

GANobfuscator: Mitigating Information Leakage Under GAN via Differential Privacy

Demystifying Membership Inference Attacks in Machine Learning as a Service

Differentially Private Data Generative Models

Exploiting Unintended Feature Leakage in Collaborative Learning

Differentially Private Generative Adversarial Network

Privacy-Preserving Generative Deep Neural Networks Support Clinical Data Sharing

Attention is All you Need

LOGAN: Membership Inference Attacks Against Generative Models

Generating Multi-label Discrete Patient Records using Generative Adversarial Networks

Membership Inference Attacks Against Machine Learning Models

Auto-Encoding Variational Bayes

ar X iv : 1 80 1 . 01 59 4 v 2 [ cs . C R ] 2 5 M ar 2 01 8 Differentially Private Releasing via Deep Generative Model ( Technical Report )

Learning Multiple Layers of Features from Tiny Images

Foundations of Machine Learning

GloVe: Global Vectors for Word Representation

On the Difficulties of Disclosure Prevention in Statistical Databases or The Case for Differential Privacy

Calibrating Noise to Sensitivity in Private Data Analysis

Evaluating Differentially Private Machine Learning in Practice

Deep Learning with Differential Privacy

Dropout: a simple way to prevent neural networks from overfitting

Explaining and Harnessing Adversarial Examples

Deep learning for digital pathology image analysis: A comprehensive tutorial with selected use cases

Rethinking the Inception Architecture for Computer Vision

Understanding deep learning (still) requires rethinking generalization

Understanding deep learning requires rethinking generalization

Distilling the Knowledge in a Neural Network

The Cityscapes Dataset for Semantic Urban Scene Understanding

Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks

Train faster, generalize better: Stability of stochastic gradient descent

Progressive Growing of GANs for Improved Quality, Stability, and Variation

Deep Learning Face Attributes in the Wild

Reading Digits in Natural Images with Unsupervised Feature Learning

The human splicing code reveals new insights into the genetic determinants of disease

Communication-Efficient Learning of Deep Networks from Decentralized Data

Bayesian Learning via Stochastic Gradient Langevin Dynamics

Collective Classification in Network Data

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

MIMIC-III, a freely accessible critical care database