Home Research Papers Datasets State of the Art Pricing

Discover, visualize, and connect AI research papers. Explore the latest trends and insights in artificial intelligence research.

Product

Home
Research Papers
About

Support

Contact
Terms of Service
Privacy Policy

© 2026 Papersgraph. All rights reserved.

computer-vision-3

Image Cropping

3260 papers • 126 benchmarks • 313 datasets

Image Cropping is a common photo manipulation process, which improves the overall composition by removing unwanted regions. Image Cropping is widely used in photographic, film processing, graphic design, and printing businesses. Source: Listwise View Ranking for Image Cropping

(Image credit: Papersgraph)

Benchmarks

These leaderboards are used to track progress in image-cropping-3

Trend

Dataset

Best Model

Actions

FLMS

FLMS

Libraries

i

Use these libraries to find image-cropping-3 models and implementations

2 papers 9,322

Datasets

AADB

Flickr Cropping Dataset

Flickr Cropping Dataset

CUHK Image Cropping

CUHK Image Cropping

GNMC

Subtasks

No subtasks available.

Most implemented papers

Kornia: an Open Source Differentiable Computer Vision Library for PyTorch

Dmytro Mishkin, Edgar Riba, D. Ponsa, Ethan Rublee, Gary R. Bradski•Fri Oct 04 2019

Kornia is composed of a set of modules containing operators that can be inserted inside neural networks to train models to perform image transformations, camera calibration, epipolar geometry, and low level image processing techniques, such as filtering and edge detection that operate directly on high dimensional tensor representations.

410

Content

Introduction Benchmarks Datasets Subtasks Libraries Papers

0

See Better Before Looking Closer: Weakly Supervised Data Augmentation Network for Fine-Grained Visual Classification

H. Qi, Tao Hu•Fri Jan 25 2019

Comprehensive experiments in common fine-grained visual classification datasets show that the proposed Weakly Supervised Data Augmentation Network (WS-DAN) surpasses the state-of-the-art methods, which demonstrates its effectiveness.

259 0

A2-RL: Aesthetics Aware Reinforcement Learning for Image Cropping

Kaiqi Huang, Huikai Wu, Junge Zhang, Debang Li•Wed Sep 13 2017

Image cropping aims at improving the aesthetic quality of images by adjusting their composition. Most weakly supervised cropping methods (without bounding box supervision) rely on the sliding window mechanism. The sliding window mechanism requires fixed aspect ratios and limits the cropping region with arbitrary size. Moreover, the sliding window method usually produces tens of thousands of windows on the input image which is very time-consuming. Motivated by these challenges, we firstly formulate the aesthetic image cropping as a sequential decision-making process and propose a weakly supervised Aesthetics Aware Reinforcement Learning (A2-RL) framework to address this problem. Particularly, the proposed method develops an aesthetics aware reward function which especially benefits image cropping. Similar to human's decision making, we use a comprehensive state representation including both the current observation and the historical experience. We train the agent using the actor-critic architecture in an end-to-end manner. The agent is evaluated on several popular unseen cropping datasets. Experiment results show that our method achieves the state-of-the-art performance with much fewer candidate windows and much less time compared with previous weakly supervised methods.

113 0

An End-to-End Neural Network for Image Cropping by Learning Composition from Aesthetic Photos

Haotong Zhang, Peng Lu, Xujun Peng, Xiaofu Jin•Mon Jul 01 2019

A deep learning based framework to learn the objects composition from photos with high aesthetic qualities is proposed, where an anchor region is detected through a convolutional neural network (CNN) with the Gaussian kernel to maintain the interested objects' integrity.

17 0

Image Cropping on Twitter: Fairness Metrics, their Limitations, and the Importance of Representation, Design, and Agency

Kyra Yee, U. Tantipongpipat, Shubhanshu Mishra•Mon May 17 2021

An extensive analysis using formalized group fairness metrics finds systematic disparities in cropping and identifies contributing factors, including the fact that the cropping based on the single most salient point can amplify the disparities because of an effect the authors term argmax bias.

55 0

Resolution enhancement processing on low quality images using swin transformer based on interval dense connection strategy

Ruikang Ju, Chih-Chia Chen, Jen-Shiun Chiang, Yu-Shian Lin, Wei-Han Chen•Wed Mar 15 2023

This research work proposes the Interval Dense Connection Strategy, which connects different blocks according to the newly designed algorithm to improve the model feature reuse, and presents a new model, which is named SwinOIR (Object Image Restoration Using Swin Transformer).

21 0

FiT: Flexible Vision Transformer for Diffusion Model

Wanli Ouyang, Zeyu Lu, Zidong Wang, Di Huang, Chengyue Wu, Xihui Liu, Lei Bai•Sun Feb 18 2024

The Flexible Vision Transformer (FiT), a transformer architecture specifically designed for generating images with unrestricted resolutions and aspect ratios, exhibits remarkable flexibility in resolution extrapolation generation.

76 0

Quantitative Analysis of Automatic Image Cropping Algorithms: A Dataset and Comparative Study

Yi-Ling Chen, Tzu-Wei Huang, Kai-han Chang, Y. Tsai, Hwann-Tzong Chen, Bing-Yu Chen•Wed Jan 04 2017

This work conducts an extensive study on traditional approaches as well as ranking-based croppers trained on various image features, and a new dataset consisting of high quality cropping and pairwise ranking annotations is presented to evaluate the performance of various baselines.

83 0

Learning to Compose with Professional Photographs on the Web

Yi-Ling Chen, Min Sun, Shao-Yi Chien, K. Ma, Jan P. Klopp•Tue Jan 31 2017

This work forms the photo composition problem as a view finding process which successively examines pairs of views and determines their aesthetic preferences, and exploits the rich professional photographs on the web to mine unlimited high-quality ranking samples and demonstrates that an aesthetics-aware deep ranking network can be trained without explicitly modeling any photographic rules.

94 0

Adding a benchmark result helps the community track progress.

Image Cropping | State-of-the-Art