3260 papers • 126 benchmarks • 313 datasets
Recognizing the genre (e.g. rock, pop, jazz, etc.) of a piece of music.
(Image credit: Papersgraph)
These leaderboards are used to track progress in music-genre-recognition-10
Use these libraries to find music-genre-recognition-10 models and implementations
No datasets available.
No subtasks available.
This work proposes to jointly learn the graph structure and the parameters of graph convolutional networks (GCNs) by approximately solving a bilevel program that learns a discrete probability distribution on the edges of the graph.
This paper develops a novel CNN architecture that takes the multi-scale time-frequency information into considerations, which transfers more suitable semantic features for the decision-making layer to discriminate the genre of the unknown music clip.
It is found that features extracted from harmonic elements can satisfactorily predict music genre for the Brazilian case, as well as features obtained from the Spotify API, also known as the random forest model.
This paper proposes a novel, lightweight method to generate animated graphical interchange format images (GIFs) using the computational resources of a client device that analyzes an acoustic feature from the climax section of an audio file to estimate the timestamp corresponding to the maximum pitch.
Multi-source domain adaptation is a key technique that allows a model to be trained on data coming from various probability distribution. To overcome the challenges posed by this learning scenario, we propose a method for constructing an intermediate domain between sources and target domain, the Wasserstein Barycenter Transport (WBT). This method relies on the barycenter on Wasserstein spaces for aggregating the source probability distributions. Once the sources have been aggregated, they are transported to the target domain using standard Optimal Transport for Domain Adaptation framework. Additionally, we revisit previous single-source domain adaptation tasks in the context of multi-source scenario. In particular, we apply our algorithm to object and face recognition datasets. Moreover, to diversify the range of applications, we also examine the tasks of music genre recognition and music-speech discrimination. The experiments show that our method has similar performance with the existing state-of-the-art.
Adding a benchmark result helps the community track progress.