A speech event detection and localization task for multiroom environments

Alessio Brutti; Mirco Ravanelli; Piergiorgio Svaizer; Maurizio Omologo

doi:10.1109/HSCMA.2014.6843271

A speech event detection and localization task for multiroom environments

Brutti, Alessio, Ravanelli, Mirco, Svaizer, Piergiorgio, Omologo, Maurizio

Źródło

2014 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays (HSCMA) > 157 - 161

Abstrakt

Domestic environments are particularly challenging for distant speech recognition and audio processing in general. Reverberation, background noise and interfering sources, as well as the propagation of acoustic events across adjacent rooms, critically degrade the performance of standard speech processing algorithms. The DIRHA EU project addresses the development of distant-speech interaction with devices and services within the multiple rooms of typical apartments. A corpus of multichannel acoustic data has been created to represent realistic acoustic scenes, of different degrees of complexity, occurring in such an environment. It includes multichannel simulations based on measured impulse responses and real data collected in the same apartment. A basic but fundamental task of the front-end processing enabling effective ASR is the detection and localization of speech events generated by users, without constraints on their position or orientation within the various rooms. In this paper we describe the acoustic corpus and present a baseline approach to the joint task of speech detection and source localization, using speech related features such as pitch, combined with features derived from spatial coherence.

Identyfikatory

e-ISBN książki :	978-1-4799-3109-5
DOI	10.1109/HSCMA.2014.6843271

Autorzy

Brutti, Alessio

Fondazione Bruno Kessler, Center for Information and Communication Technology, Trento, Italy

Ravanelli, Mirco

Fondazione Bruno Kessler, Center for Information and Communication Technology, Trento, Italy

Svaizer, Piergiorgio

Fondazione Bruno Kessler, Center for Information and Communication Technology, Trento, Italy

Omologo, Maurizio

Fondazione Bruno Kessler, Center for Information and Communication Technology, Trento, Italy

Słowa kluczowe

Speech Acoustics Joints Speech recognition Microphone arrays Noise acoustic corpora Speech activity detection source localization distributed microphone networks

Informacje dodatkowe

Zbiór danych: ieee

Wydawca

IEEE

rozdział

Czytaj online
Pobierz
Dodaj do przeczytania
Dodaj do kolekcji
Dodaj do obserwowanych
Podziel się

Eksport do bibliografii


Przypisz innemu użytkownikowi
	×
Niepoprawny email

INFONA - portal komunikacji naukowej

A speech event detection and localization task for multiroom environments $("#expandableTitles").expandable();

Źródło

Abstrakt

Identyfikatory

Autorzy

Przypisywanie użytkownika

Potwierdzenie anulowania przypisania

Czy jesteś pewien, że chcesz anulować to przypisanie?

Brutti, Alessio

Ravanelli, Mirco

Svaizer, Piergiorgio

Omologo, Maurizio

Słowa kluczowe

Informacje dodatkowe

Wydawca

Podziel się

Eksport do bibliografii

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu

A speech event detection and localization task for multiroom environments