3260 papers • 126 benchmarks • 313 datasets
This task has no description! Would you like to contribute one?
(Image credit: Papersgraph)
These leaderboards are used to track progress in audio-denoising-11
Use these libraries to find audio-denoising-11 models and implementations
No datasets available.
No subtasks available.
This work introduces a co-separation training paradigm that permits learning object-level sounds from unlabeled multi-source videos, and obtains state-of-the-art results on visually-guided audio source separation and audio denoising for the MUSIC, AudioSet, and AV-Bench datasets.
This work is the first to learn audio source separation from large-scale "in the wild" videos containing multiple audio sources per video, and obtains state-of-the-art results on visually-aided audio sources separation and audio denoising.
This work shows that speech denoising deep neural networks can be successfully trained utilizing only noisy training audio, and it is revealed that such training regimes achieve superiorDenoising performance over conventional training regimes utilizing clean training audio targets, in cases involving complex noise distributions and low Signal-to-Noise ratios (high noise environments).
This paper proposes a new U-Net based prior that does not impact either the network complexity or convergence behavior of existing convolutional architectures, yet leads to significantly improved restoration, and advocates the use of carefully designed dilation schedules and dense connections in the U- net architecture to obtain powerful audio priors.
Experimental results show that the proposed method not only eliminates the high dependence on clean targets of traditional audio denoising tasks, but also achieves on-par or better performance than other training strategies.
The first to transfer the audio denoising problem into an image segmentation problem and propose a deep visual audio Denoising (DVAD) model, which achieves state-of-the-art performance and can be easily generalized to speech denoisation, audio separation, audio enhancement, and noise estimation.
The Intel Neuromorphic Deep Noise Suppression Challenge (Intel N-DNS Challenge), inspired by the Microsoft DNS Challenge, tackles a ubiquitous and commercially relevant task: real-time audio denoising.
A complex image generation SwinTransformer network is developed and structure similarity and detailed loss functions are imposed to generate high-quality images and an SDR loss is developed to minimize the difference between denoised and clean audios.
Adding a benchmark result helps the community track progress.