Home Research Papers Datasets State of the Art Pricing

Discover, visualize, and connect AI research papers. Explore the latest trends and insights in artificial intelligence research.

Product

Home
Research Papers
About

Support

Contact
Terms of Service
Privacy Policy

© 2026 Papersgraph. All rights reserved.

image-generation

Image-to-Image Translation

3260 papers • 126 benchmarks • 313 datasets

Image-to-Image Translation is a task in computer vision and machine learning where the goal is to learn a mapping between an input image and an output image, such that the output image can be used to perform a specific task, such as style transfer, data augmentation, or image restoration. ( Image credit: Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks )

(Image credit: Papersgraph)

Benchmarks

These leaderboards are used to track progress in image-generation

Trend

Dataset

Best Model

Actions

SYNTHIA-to-Cityscapes

SYNTHIA-to-Cityscapes

GTAV-to-Cityscapes Labels

GTAV-to-Cityscapes Labels

Libraries

i

Use these libraries to find image-generation models and implementations

eriklindernoren/PyTorch-GAN

8 papers 15,669

Datasets

Cityscapes

KITTI

ADE20K

CelebA-HQ

SYNTHIA

GTA5

Subtasks

Unsupervised Image-To-Image Translation Synthetic-to-Real Translation Multimodal Unsupervised Image-To-Image Translation Cross-View Image-to-Image Translation Cross-View Image-to-Image Translation

Most implemented papers

Deep Residual Learning for Image Recognition

Kaiming He, X. Zhang, Shaoqing Ren•Wed Dec 09 2015

This work presents a residual learning framework to ease the training of networks that are substantially deeper than those used previously, and provides comprehensive empirical evidence showing that these residual networks are easier to optimize, and can gain accuracy from considerably increased depth.

214235

Content

Introduction Benchmarks Datasets Subtasks Libraries Papers

Image-to-Image Translation | State-of-the-Art

Cityscapes Labels-to-Photo

Cityscapes Labels-to-Photo

ADE20K Labels-to-Photos

ADE20K Labels-to-Photos

COCO-Stuff Labels-to-Photos

COCO-Stuff Labels-to-Photos

ADE20K-Outdoor Labels-to-Photos

ADE20K-Outdoor Labels-to-Photos

IXI

IXI

CelebA-HQ

CelebA-HQ

Cityscapes-to-Foggy Cityscapes

Cityscapes-to-Foggy Cityscapes

Cityscapes Photo-to-Labels

Cityscapes Photo-to-Labels

cat2dog

cat2dog

RaFD

RaFD

selfie2anime

selfie2anime

LLVIP

LLVIP

BCI

BCI

horse2zebra

horse2zebra

photo2vangogh

photo2vangogh

zebra2horse

zebra2horse

vangogh2photo

vangogh2photo

SYNTHIA Fall-to-Winter

SYNTHIA Fall-to-Winter

Aerial-to-Map

Aerial-to-Map

selfie-to-anime

selfie-to-anime

anime-to-selfie

anime-to-selfie

AFHQ

AFHQ

Deep-Fashion

Deep-Fashion

Object Transfiguration (sheep-to-giraffe)

Object Transfiguration (sheep-to-giraffe)

ADE-Indoor Labels-to-Photo

ADE-Indoor Labels-to-Photo

photo2portrait

photo2portrait

dog2cat

dog2cat

portrait2photo

portrait2photo

KITTI Object Tracking Evaluation 2012

KITTI Object Tracking Evaluation 2012

Zebra and Horses

Zebra and Horses

Apples and Oranges

Apples and Oranges

2017_test set

2017_test set

BRATS

BRATS

AFHQ (Cat to Dog)

AFHQ (Cat to Dog)

AFHQ (Wild to Dog)

AFHQ (Wild to Dog)

Wenchao-Du/LIR-for-Unsupervised-IR

3 papers 95

yaxingwang/SEMIT

3 papers 52

ganslate-team/ganslate

3 papers 32

3 papers 30

yaxingwang/SDIT

3 papers 27

thuml/Transfer-Learning-Library

2 papers 3,126

open-mmlab/mmgeneration

2 papers 1,796

mindslab-ai/hififace

2 papers 328

kunheek/style-aware-discriminator

2 papers 110

2 papers 63

ExplainableML/UncerGuidedI2I

2 papers 47

ranery/Bayesian-CycleGAN

2 papers 45

2 papers 30

taivu1998/GANime

2 papers 28

icon-lab/pflsynth

2 papers 14

DeepFashion

DeepFashion

Perceptual Similarity

Perceptual Similarity

AFHQ

COCO-Stuff

Fundus to Angiography Generation

Facial Makeup Transfer

Real-to-Cartoon translation

Photo-To-Caricature Translation

Bird View Synthesis

0

Image-to-Image Translation with Conditional Adversarial Networks

Phillip Isola, Alexei A. Efros, Jun-Yan Zhu, Tinghui Zhou•Sun Nov 20 2016

Conditional adversarial networks are investigated as a general-purpose solution to image-to-image translation problems and it is demonstrated that this approach is effective at synthesizing photos from label maps, reconstructing objects from edge maps, and colorizing images, among other tasks.

21622 0

Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks

Phillip Isola, Alexei A. Efros, Jun-Yan Zhu, Taesung Park•Wed Mar 29 2017

This work presents an approach for learning to translate an image from a source domain X to a target domain Y in the absence of paired examples, and introduces a cycle consistency loss to push F(G(X)) ≈ X (and vice versa).

5621 0

U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation

Minjae Kim, Hyeonwoo Kang, Kwanghee Lee•Wed Jul 24 2019

A novel method for unsupervised image- to-image translation, which incorporates a new attention module and a new learnable normalization function in an end-to-end manner, which can translate both images requiring holistic changes and images requiring large shape changes.

618 0

Semantic Image Synthesis With Spatially-Adaptive Normalization

Ming-Yu Liu, Jun-Yan Zhu, Ting-Chun Wang, Taesung Park•Sun Mar 17 2019

S spatially-adaptive normalization is proposed, a simple but effective layer for synthesizing photorealistic images given an input semantic layout that allows users to easily control the style and content of image synthesis results as well as create multi-modal results.

3008 0

High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs

Bryan Catanzaro, Ming-Yu Liu, Jan Kautz, Jun-Yan Zhu, Ting-Chun Wang, Andrew Tao•Wed Nov 29 2017

A new method for synthesizing high-resolution photo-realistic images from semantic label maps using conditional generative adversarial networks (conditional GANs) is presented, which significantly outperforms existing methods, advancing both the quality and the resolution of deep image synthesis and editing.

4278 0

Multimodal Unsupervised Image-to-Image Translation

Serge J. Belongie, Xun Huang, Ming-Yu Liu, Jan Kautz•Wed Apr 11 2018

A Multimodal Unsupervised Image-to-image Translation (MUNIT) framework that assumes that the image representation can be decomposed into a content code that is domain-invariant, and a style code that captures domain-specific properties.

2620 0

StarGAN v2: Diverse Image Synthesis for Multiple Domains

Yunjey Choi, Youngjung Uh, Jaejun Yoo, Jung-Woo Ha•Tue Dec 03 2019

StarGAN v2, a single framework that tackles image-to-image translation models with limited diversity and multiple models for all domains, is proposed and shows significantly improved results over the baselines.

1972 0

Everybody Dance Now

Alexei A. Efros, Caroline Chan, Shiry Ginosar, Tinghui Zhou•Tue Aug 21 2018

This paper presents a simple method for “do as I do” motion transfer: given a source video of a person dancing, it is shown that it can transfer that performance to a novel (amateur) target after only a few minutes of the target subject performing standard moves.

819 0

StarGAN: Unified Generative Adversarial Networks for Multi-domain Image-to-Image Translation

Sunghun Kim, J. Choo, Yunjey Choi, Jung-Woo Ha, Min-Je Choi, M. Kim•Thu Nov 23 2017

A unified model architecture of StarGAN allows simultaneous training of multiple datasets with different domains within a single network, which leads to StarGAN's superior quality of translated images compared to existing models as well as the novel capability of flexibly translating an input image to any desired target domain.

3802 0

Adding a benchmark result helps the community track progress.