3260 papers • 126 benchmarks • 313 datasets
This task has no description! Would you like to contribute one?
(Image credit: Papersgraph)
These leaderboards are used to track progress in voice-similarity-7
No benchmarks available.
Use these libraries to find voice-similarity-7 models and implementations
No datasets available.
No subtasks available.
The YourTTS model builds upon the VITS model and adds several novel modifications for zero-shot multi-speaker and multilingual training, achieving state-of-the-art (SOTA) results in zero- shot multi- Speaker TTS and results comparable to SOTA in zero -shot voice conversion on the VCTK dataset.
This paper proposes the first intuitive visualisation of pseudonymisation performance for speech signals and two novel metrics for objective assessment that reflect the two, key pseudonymisation requirements of de-identification and voice distinctiveness.
This work proposes a framework for training singer identity encoders to extract representations suitable for various singing-related tasks, such as singing voice similarity and synthesis, and evaluates the quality of the resulting representations on singer similarity and identification tasks across multiple datasets.
Adding a benchmark result helps the community track progress.