3260 papers • 126 benchmarks • 313 datasets
This task has no description! Would you like to contribute one?
(Image credit: Papersgraph)
These leaderboards are used to track progress in melody-extraction-4
No benchmarks available.
Use these libraries to find melody-extraction-4 models and implementations
No subtasks available.
A novel streamlined encoder/decoder network that is designed for melody extraction in polyphonic musical audio can achieve result close to the state-of-the-art with much fewer convolutional layers and simpler convolution modules.
An algorithm to detect on variable length audio based on Sound Event Detection by FCN for musical instrument playing technique detection and chooses Erhu, a well-known Chinese bowed-stringed instrument, to experiment with the method.
This article presents a benchmark study of symbolic piano music classification using the masked language modelling approach of the Bidirectional Encoder Representations from Transformers (BERT), and shows that the BERT approach leads to higher classification accuracy than recurrent neural network (RNN)-based baselines.
TONet1, a plug-and-play model that improves both tone and octave perceptions by leveraging a novel input representation and a novel network architecture, is proposed and results show that tone-octave fusion with Tone-CFP can significantly improve the singing voice extraction performance across various datasets.
This paper proposes an input feature modification and a training objective modification based on two assumptions of harmonics in the spectrograms of audio data decay rapidly along the frequency axis to enhance the model's sensitivity on the trailing harmonics.
Adding a benchmark result helps the community track progress.