Audio onset detection: A wavelet packet based approach with recurrent neural networks

Erik Marchi; Giacomo Ferroni; Florian Eyben; Stefano Squartini; Bjorn Schuller

doi:10.1109/IJCNN.2014.6889669

Audio onset detection: A wavelet packet based approach with recurrent neural networks

Marchi, Erik, Ferroni, Giacomo, Eyben, Florian, Squartini, Stefano, Schuller, Bjorn

Źródło

2014 International Joint Conference on Neural Networks (IJCNN) > 3585 - 3591

Abstrakt

This paper concerns the exploitation of multi-resolution time-frequency features via Wavelet Packet Transform to improve audio onset detection. In our approach, Wavelet Packet Energy Coefficients (WPEC) and Auditory Spectral Features (ASF) are processed by Bidirectional Long Short-Term Memory (BLSTM) recurrent neural network that yields the onsets location. The combination of the two feature sets, together with the BLSTM based detector, form an advanced energy-based approach that takes advantage from the multi-resolution analysis given by the wavelet decomposition of the audio input signal. The neural network is trained with a large database of onset data covering various genres and onset types. Due to its data-driven nature, our approach does not require the onset detection method and its parameters to be tuned to a particular type of music. We show a comparison with other types and sizes of recurrent neural networks and we compare results with state-of-the-art methods on the whole onset dataset. We conclude that our approach significantly increase performance in terms of F-measure without any music genres or onset type constraints.

Identyfikatory

e-ISSN książki :	2161-4407
e-ISBN książki :	978-1-4799-1484-5 , 978-1-4799-6627-1
DOI	10.1109/IJCNN.2014.6889669

Autorzy

Marchi, Erik

Machine Intelligence & Signal Processing Group, Technische Universität München, Germany

Ferroni, Giacomo

A3LAB, Department of Information Engineering, Universitá Politecnica delle Marche, Italy

Eyben, Florian

Machine Intelligence & Signal Processing Group, Technische Universität München, Germany

Squartini, Stefano

A3LAB, Department of Information Engineering, Universitá Politecnica delle Marche, Italy

Zobacz wszystkich

Informacje dodatkowe

Zbiór danych: ieee

Wydawca

IEEE

rozdział

Czytaj online
Pobierz
Dodaj do przeczytania
Dodaj do kolekcji
Dodaj do obserwowanych
Podziel się

Eksport do bibliografii


Przypisz innemu użytkownikowi
	×
Niepoprawny email

INFONA - portal komunikacji naukowej

Audio onset detection: A wavelet packet based approach with recurrent neural networks $("#expandableTitles").expandable();

Źródło

Abstrakt

Identyfikatory

Autorzy

Przypisywanie użytkownika

Potwierdzenie anulowania przypisania

Czy jesteś pewien, że chcesz anulować to przypisanie?

Marchi, Erik

Ferroni, Giacomo

Eyben, Florian

Squartini, Stefano

Informacje dodatkowe

Wydawca

Podziel się

Eksport do bibliografii

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu

Audio onset detection: A wavelet packet based approach with recurrent neural networks