Data Augmentation

Data augmentation involves techniques used for increasing the amount of data, based on different modifications, to expand the amount of examples in the original dataset. Data augmentation not only helps to grow the dataset but it also increases the diversity of the dataset. When training machine learning models, data augmentation acts as a regularizer and helps to avoid overfitting. Data augmentation techniques have been found useful in domains like NLP and computer vision. In computer vision, transformations like cropping, flipping, and rotation are used. In NLP, data augmentation techniques can include swapping, deletion, random insertion, among others. Further readings: A Survey of Data Augmentation Approaches for NLP A survey on Image Data Augmentation for Deep Learning ( Image credit: Albumentations )

Benchmarks

Libraries

Datasets

Subtasks

Most implemented papers

YOLOv4: Optimal Speed and Accuracy of Object Detection

Content

Improved Baselines with Momentum Contrastive Learning

AutoAugment: Learning Augmentation Policies from Data

SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition

Supervised Contrastive Learning

SimCSE: Simple Contrastive Learning of Sentence Embeddings

Improved Regularization of Convolutional Neural Networks with Cutout

3D U-Net: Learning Dense Volumetric Segmentation from Sparse Annotation

Unsupervised Data Augmentation for Consistency Training

EfficientNetV2: Smaller Models and Faster Training