GIE-Bench is a benchmark designed to evaluate text-guided image editing models across two critical dimensions:
Functional correctness — assessed via VQA-style multiple-choice questions Content preservation — evaluated through object-aware masking and image similarity It includes over 1,000 high-quality editing examples across 20 categories and 9 edit types, with masks, instructions, and questions.