Search results for: Giambattista Parascandolo

Items from 1 to 6 out of 6 results

chapter

Low latency sound source separation using convolutional recurrent neural networks

Gaurav Naithani, Tom Barker, Giambattista Parascandolo, Lars Bramslow, more

2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) > 71 - 75

2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

Deep neural networks (DNN) have been successfully employed for the problem of monaural sound source separation achieving state-of-the-art results. In this paper, we propose using convolutional recurrent neural network (CRNN) architecture for tackling this problem. We focus on a scenario where low algorithmic delay (< 10 ms) is paramount, and relatively little training data is available. We show...

chapter

Convolutional recurrent neural networks for bird audio detection

Emre Cakir, Sharath Adavanne, Giambattista Parascandolo, Konstantinos Drossos, more

2017 25th European Signal Processing Conference (EUSIPCO) > 1744 - 1748

2017 25th European Signal Processing Conference (EUSIPCO)

Bird sounds possess distinctive spectral structure which may exhibit small shifts in spectrum depending on the bird species and environmental conditions. In this paper, we propose using convolutional recurrent neural networks on the task of automated bird audio detection in real-life environments. In the proposed method, convolutional layers extract high dimensional, local frequency shift invariant...

article

Convolutional Recurrent Neural Networks for Polyphonic Sound Event Detection

Emre Cakir, Giambattista Parascandolo, Toni Heittola, Heikki Huttunen, more

IEEE/ACM Transactions on Audio, Speech, and Language Processing > 2017 > 25 > 6 > 1291 - 1303

Sound events often occur in unstructured environments where they exhibit wide variations in their frequency content and temporal structure. Convolutional neural networks (CNNs) are able to extract higher level features that are invariant to local spectral and temporal variations. Recurrent neural networks (RNNs) are powerful in learning the longer term temporal context in the audio signals. CNNs and...

chapter

A convolutional neural network approach for acoustic scene classification

Michele Valenti, Stefano Squartini, Aleksandr Diment, Giambattista Parascandolo, more

2017 International Joint Conference on Neural Networks (IJCNN) > 1547 - 1554

2017 International Joint Conference on Neural Networks (IJCNN)

This paper presents a novel application of convolutional neural networks (CNNs) for the task of acoustic scene classification (ASC). We here propose the use of a CNN trained to classify short sequences of audio, represented by their log-mel spectrogram. We also introduce a training method that can be used under particular circumstances in order to make full use of small datasets. The proposed system...

chapter

Low-latency sound source separation using deep neural networks

Gaurav Naithani, Giambattista Parascandolo, Tom Barker, Niels Henrik Pontoppidan, more

2016 IEEE Global Conference on Signal and Information Processing (GlobalSIP) > 272 - 276

2016 IEEE Global Conference on Signal and Information Processing (GlobalSIP)

Sound source separation at low-latency requires that each incoming frame of audio data be processed at very low delay, and outputted as soon as possible. For practical purposes involving human listeners, a 20 ms algorithmic delay is the uppermost limit which is comfortable to the listener. In this paper, we propose a low-latency (algorithmic delay < 20 ms) deep neural network (DNN) based source...

chapter

Recurrent neural networks for polyphonic sound event detection in real life recordings

Giambattista Parascandolo, Heikki Huttunen, Tuomas Virtanen

2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6440 - 6444

ICASSP 2016 - 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper we present an approach to polyphonic sound event detection in real life recordings based on bi-directional long short term memory (BLSTM) recurrent neural networks (RNNs). A single multilabel BLSTM RNN is trained to map acoustic features of a mixture signal consisting of sounds from multiple classes, to binary activity indicators of each event class. Our method is tested on a large database...

Filter options

Publication date

Set your own date range

Publication type

book (5)
article (1)

Keywords

ACOUSTICS (3)
DEEP NEURAL NETWORKS (3)
FEATURE EXTRACTION (3)
TRAINING (3)
CONTEXT (2)
CONVOLUTION (2)
LOW-LATENCY (2)
NEURAL NETWORKS (2)
RECURRENT NEURAL NETWORKS (2)
SOURCE SEPARATION (2)
SPEECH (2)
TIME-FREQUENCY ANALYSIS (2)
BIDIRECTIONAL LSTM (1)
BIOLOGICAL NEURAL NETWORKS (1)
BIRDS (1)
CONVOLUTIONAL NEURAL NETWORKS (CNNS) (1)
CONVOLUTIONAL RECURRENT NEURAL NETWORKS (1)
DEEP LEARNING (1)
ELECTRONIC MAIL (1)
EVENT DETECTION (1)
HIDDEN MARKOV MODELS (1)
KERNEL (1)
MEASUREMENT (1)
NEURONS (1)
POLYPHONIC SOUND EVENT DETECTION (1)
RECURRENT NEURAL NETWORK (1)
RECURRENT NEURAL NETWORKS (RNNS) (1)
SOUND EVENT DETECTION (1)
SPECTROGRAM (1)
TRAINING DATA (1)
more

INFONA - science communication portal

Search results for: Giambattista Parascandolo

Low latency sound source separation using convolutional recurrent neural networks

Convolutional recurrent neural networks for bird audio detection

Convolutional Recurrent Neural Networks for Polyphonic Sound Event Detection

A convolutional neural network approach for acoustic scene classification

Low-latency sound source separation using deep neural networks

Recurrent neural networks for polyphonic sound event detection in real life recordings

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options