3260 papers • 126 benchmarks • 313 datasets
Speaker Diarization is the task of segmenting and co-indexing audio recordings by speaker. The way the task is commonly defined, the goal is not to identify known speakers, but to co-index segments that are attributed to the same speaker; in other words, diarization implies finding speaker boundaries and grouping segments that belong to the same speaker, and, as a by-product, determining the number of distinct speakers. In combination with speech recognition, diarization enables speaker-attributed speech-to-text transcription. Source: Improving Diarization Robustness using Diversification, Randomization and the DOVER Algorithm
(Image credit: Papersgraph)
These leaderboards are used to track progress in speaker-diarization-18
Use these libraries to find speaker-diarization-18 models and implementations
No datasets available.
No subtasks available.
Adding a benchmark result helps the community track progress.