3260 papers • 126 benchmarks • 313 datasets
Generate image based on given human-object interaction input
(Image credit: Papersgraph)
These leaderboards are used to track progress in conditional-image-generation
No benchmarks available.
Use these libraries to find conditional-image-generation models and implementations
No datasets available.
No subtasks available.
A pluggable interaction control model is proposed that extends existing pre-trained T2I diffusion models to enable them being better conditioned on interactions, and outperforms existing baselines by a large margin in HOI detection score, as well as fidelity in FID and KID.
Adding a benchmark result helps the community track progress.