Wyniki wyszukiwania dla: Sharath Adavanne

Pozycje od 1 do 5 spośród 5 wyników

rozdział

Automated audio captioning with recurrent neural networks

Konstantinos Drossos, Sharath Adavanne, Tuomas Virtanen

2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) > 374 - 378

2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

We present the first approach to automated audio captioning. We employ an encoder-decoder scheme with an alignment model in between. The input to the encoder is a sequence of log mel-band energies calculated from an audio file, while the output is a sequence of words, i.e. a caption. The encoder is a multi-layered, bi-directional gated recurrent unit (GRU) and the decoder a multi-layered GRU with...

rozdział

Stacked convolutional and recurrent neural networks for bird audio detection

Sharath Adavanne, Konstantinos Drossos, Emre Cakir, Tuomas Virtanen

2017 25th European Signal Processing Conference (EUSIPCO) > 1729 - 1733

2017 25th European Signal Processing Conference (EUSIPCO)

This paper studies the detection of bird calls in audio segments using stacked convolutional and recurrent neural networks. Data augmentation by blocks mixing and domain adaptation using a novel method of test mixing are proposed and evaluated in regard to making the method robust to unseen data. The contributions of two kinds of acoustic features (dominant frequency and log mel-band energy) and their...

rozdział

Convolutional recurrent neural networks for bird audio detection

Emre Cakir, Sharath Adavanne, Giambattista Parascandolo, Konstantinos Drossos, więcej

2017 25th European Signal Processing Conference (EUSIPCO) > 1744 - 1748

2017 25th European Signal Processing Conference (EUSIPCO)

Bird sounds possess distinctive spectral structure which may exhibit small shifts in spectrum depending on the bird species and environmental conditions. In this paper, we propose using convolutional recurrent neural networks on the task of automated bird audio detection in real-life environments. In the proposed method, convolutional layers extract high dimensional, local frequency shift invariant...

rozdział

Assessment of support vector machines and convolutional neural networks to detect snoring using Emfit mattress

Jose M. Perez-Macias, Sharath Adavanne, Jari Viik, Alpo Varri, więcej

2017 39th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC) > 2883 - 2886

2017 39th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC)

Snoring (SN) is an essential feature of sleep breathing disorders, such as obstructive sleep apnea (OSA). In this study, we evaluate epoch-based snoring detection methods using an unobtrusive electromechanical film transducer (Emfit) mattress sensor using polysomnography recordings as a reference. Two different approaches were investigated: a support vector machine (SVM) classifier fed with a subset...

rozdział

Sound event detection using spatial features and convolutional recurrent neural network

Sharath Adavanne, Pasi Pertila, Tuomas Virtanen

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 771 - 775

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper proposes to use low-level spatial features extracted from multichannel audio for sound event detection. We extend the convolutional recurrent neural network to handle more than one type of these multichannel features by learning from each of them separately in the initial stages. We show that instead of concatenating the features of each channel into a single feature vector the network...

Opcje filtrowania

Data publikacji

Ustaw własny zakres dat

Słowa kluczowe

FEATURE EXTRACTION (4)
RECURRENT NEURAL NETWORKS (4)
BIRDS (2)
TRAINING (2)
ACOUSTICS (1)
ATTENTION MECHANISM (1)
AUDIO CAPTIONING (1)
CONTEXT (1)
CONVOLUTION (1)
CONVOLUTIONAL RECURRENT NEURAL NETWORK (1)
CORRELATION (1)
DECODING (1)
ELECTRONIC MAIL (1)
EVENT DETECTION (1)
GATED RECURRENT UNIT (1)
GRU (1)
HARMONIC ANALYSIS (1)
LIBRARIES (1)
MEASUREMENT (1)
MULTICHANNEL AUDIO (1)
RNN (1)
SOUND EVENT DETECTION (1)
SPATIAL FEATURES (1)
TIME-FREQUENCY ANALYSIS (1)
TWO DIMENSIONAL DISPLAYS (1)
więcej

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania dla: Sharath Adavanne

Automated audio captioning with recurrent neural networks

Stacked convolutional and recurrent neural networks for bird audio detection

Convolutional recurrent neural networks for bird audio detection

Assessment of support vector machines and convolutional neural networks to detect snoring using Emfit mattress

Sound event detection using spatial features and convolutional recurrent neural network

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Słowa kluczowe

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu