Experimental results show that the proposed method not only eliminates the high dependence on clean targets of traditional audio denoising tasks, but also achieves on-par or better performance than other training strategies.
Authors
Lei Li
2 papers
Jiasong Wu
1 papers
Qingchun Li
1 papers
Guanyu Yang
1 papers
L. Senhadji
1 papers
H. Shu
1 papers
References46 items
1
Training Speech Enhancement Systems with Noisy Speech Datasets
2
CAUNet: Context-Aware U-Net for Speech Enhancement in Time Domain
3
Speech Denoising without Clean Training Data: a Noise2Noise Approach
4
MetricGAN+: An Improved Version of MetricGAN for Speech Enhancement
5
Efficient Personalized Speech Enhancement Through Self-Supervised Learning
6
Personalized Speech Enhancement through Self-Supervised Data Augmentation and Purification
7
Noisy-target Training: A Training Strategy for DNN-based Speech Enhancement without Clean Speech
8
Neighbor2Neighbor: Self-Supervised Denoising from Single Noisy Images
9
Training Noisy Single-Channel Speech Separation with Noisy Oracle Sources: A Large Gap and a Small Step
10
Real Time Speech Enhancement in the Waveform Domain
11
Unsupervised Sound Separation Using Mixture Invariant Training
12
Self-supervised Learning for Speech Enhancement
13
HiFi-GAN: High-Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks
14
Densely Connected Neural Network with Dilated Convolutions for Real-Time Speech Enhancement in The Time Domain
15
PHASEN: A Phase-and-Harmonics-Aware Speech Enhancement Network
16
Dual-Path RNN: Efficient Long Sequence Modeling for Time-Domain Single-Channel Speech Separation
17
T-GSA: Transformer with Gaussian-Weighted Self-Attention for Speech Enhancement
18
MetricGAN: Generative Adversarial Networks based Black-box Metric Scores Optimization for Speech Enhancement
19
Sergan: Speech Enhancement Using Relativistic Generative Adversarial Networks with Gradient Penalty
20
Improving deep speech denoising by Noisy2Noisy signal mapping
21
Phase-aware Speech Enhancement with Deep Complex U-Net
22
Improved Speech Enhancement with the Wave-U-Net
23
Conv-TasNet: Surpassing Ideal Time–Frequency Magnitude Masking for Speech Separation
24
Wave-U-Net: A Multi-Scale Neural Network for End-to-End Audio Source Separation
25
Time-Frequency Masking-Based Speech Enhancement Using Generative Adversarial Network
26
Noise2Noise: Learning Image Restoration without Clean Data
27
End-to-End Waveform Utterance Enhancement for Direct Evaluation Metrics Optimization by Fully Convolutional Neural Networks
28
Noisy speech database for training speech enhancement algorithms and TTS models
DNN-based enhancement of noisy and reverberant speech
33
Complex Ratio Masking for Monaural Speech Separation
34
Speech Enhancement with LSTM Recurrent Neural Networks and its Application to Noise-Robust ASR
35
U-Net: Convolutional Networks for Biomedical Image Segmentation
36
A Dataset and Taxonomy for Urban Sound Research
37
The voice bank corpus: Design, collection and data analysis of a large regional accent speech database
38
Ideal ratio mask estimation using deep neural networks for robust speech recognition
39
The Diverse Environments Multi-channel Acoustic Noise Database (DEMAND): A database of multichannel environmental noise recordings
40
An Algorithm for Intelligibility Prediction of Time–Frequency Weighted Noisy Speech
41
Subjective comparison and evaluation of speech enhancement algorithms
42
An Experimental Study on Speech Enhancement Based on Deep Neural Networks
43
Evaluation of Objective Quality Measures for Speech Enhancement
44
On binary and ratio time-frequency masks for robust speech recognition
45
Methods for objective and subjective assessment of quality Perceptual evaluation of speech quality ( PESQ ) : An objective method for end-to-end speech quality assessment of narrow-band telephone networks and speech codecs