Correlation-guided Query-Dependency Calibration in Video Representation Learning for Temporal Grounding (2023-01-01T00:00:00.000000Z)

TL;DR

This paper designs an adaptive cross-attention layer with dummy tokens, and uses a moment-adaptive saliency detector to exploit each video clip’s degrees of text engagement, and validate the superiority of CG-DETR with the state-of-the-art results on various benchmarks for both moment retrieval and highlight detection.

Authors

WonJun Moon

3 papers

Jae-Pil Heo

2 papers

Sangeek Hyun

1 papers

Subeen Lee

2 papers

Correlation-guided Query-Dependency Calibration in Video Representation Learning for Temporal Grounding

TL;DR

Authors

Field of Study

Journal Information

Name

Volume

Venue Information

Name

Type

URL

Alternate Names