3260 papers • 126 benchmarks • 313 datasets
The task aims at labeling the pixels of an image or video that represent an object instance referred by a linguistic expression. In particular, the referring expression (RE) must allow the identification of an individual object in a discourse or scene (the referent). REs unambiguously identify the target instance.
(Image credit: Open Source)
These leaderboards are used to track progress in referring-expression-segmentation-18
Use these libraries to find referring-expression-segmentation-18 models and implementations
No datasets available.
Adding a benchmark result helps the community track progress.