Home Research Papers Datasets State of the Art Pricing

Discover, visualize, and connect AI research papers. Explore the latest trends and insights in artificial intelligence research.

Product

Home
Research Papers
About

Support

Contact
Terms of Service
Privacy Policy

© 2026 Papersgraph. All rights reserved.

audio-19

Zero-shot Audio Captioning

3260 papers • 126 benchmarks • 313 datasets

Zero-shot audio captioning aims at automatically generating descriptive textual captions for audio content without any prior training for this task. Audio captioning is commonly concerned with ambient sounds, or sounds produced by a human performing an action.

(Image credit: Papersgraph)

Benchmarks

These leaderboards are used to track progress in zero-shot-audio-captioning-19

Trend

Dataset

Best Model

Actions

AudioCaps

AudioCaps

Clotho

Clotho

Libraries

i

Use these libraries to find zero-shot-audio-captioning-19 models and implementations

Datasets

No datasets available.

Subtasks

No subtasks available.

Most implemented papers

Content

Introduction Benchmarks Datasets Subtasks Libraries Papers

Adding a benchmark result helps the community track progress.

Zero-shot Audio Captioning | State-of-the-Art