Search results for: Isidoros Rodomagoulakis

Items from 1 to 6 out of 6 results

article

Room-localized spoken command recognition in multi-room, multi-microphone environments

Isidoros Rodomagoulakis, Athanasios Katsamanis, Gerasimos Potamianos, Panagiotis Giannoulis, more

Computer Speech & Language > 2017 > 46 > C > 419-443

The paper focuses on the design of a practical system pipeline for always-listening, far-field spoken command recognition in everyday smart indoor environments that consist of multiple rooms equipped with sparsely distributed microphone arrays. Such environments, for example domestic and multi-room offices, present challenging acoustic scenes to state-of-the-art speech recognizers, especially under...

chapter

On the improvement of modulation features using multi-microphone energy tracking for robust distant speech recognition

Isidoros Rodomagoulakis, Petros Maragos

2017 25th European Signal Processing Conference (EUSIPCO) > 558 - 562

2017 25th European Signal Processing Conference (EUSIPCO)

In this work, we investigate robust speech energy estimation and tracking schemes aiming at improved energy-based multiband speech demodulation and feature extraction for multi-microphone distant speech recognition. Based on the spatial diversity of the speech and noise recordings of a multi-microphone setup, the proposed Multichannel, Multiband Demodulation (MMD) scheme includes: 1) energy selection...

chapter

The MOBOT rollator human-robot interaction model and user evaluation process

Eleni Efthimiou, Stavroula-Evita Fotinea, Theodore Goulas, Maria Koutsombogera, more

2016 IEEE Symposium Series on Computational Intelligence (SSCI) > 1 - 8

2016 IEEE Symposium Series on Computational Intelligence (SSCI)

In this paper we discuss the integration of a communication model in the MOBOT assistive robotic platform and its evaluation by target users. The MOBOT platform envisions the development of cognitive robotic assistant prototypes that act proactively, adaptively and interactively with respect to elderly humans with slight walking and cognitive impairments. The respective multimodal action recognition...

chapter

Advances in Large Vocabulary Continuous Speech Recognition in Greek: Modeling and nonlinear features

Isidoros Rodomagoulakis, Gerasimos Potamianos, Petros Maragos

21st European Signal Processing Conference (EUSIPCO 2013) > 1 - 5

2013 21st European Signal Processing Conference (EUSIPCO)

The main goal of this work is the development of an improved Large Vocabulary Continuous Speech Recognition (LVCSR) framework in Greek. Language modeling is carried out in a collection of journalistic text and in the acoustic signal processing, a nonlinear approach is implemented for deriving features of the AM-FM type. Experimentation is carried out in both clean and simulated far-field speech offering...

chapter

Recognitionwith raw canonical phonetic movement and handshape subunits on videos of continuous Sign Language

Stavros Theodorakis, Vassilis Pitsikalis, Isidoros Rodomagoulakis, Petros Maragos

2012 19th IEEE International Conference on Image Processing > 1413 - 1416

2012 19th IEEE International Conference on Image Processing (ICIP 2012)

The visual processing of Sign Language (SL) videos offers multiple interdisciplinary challenges for image processing and recognition. Based on tracking and visual feature extraction, we investigate SL visual phonetic modeling by exploiting statistical subunit (SU) models of movement-position and handshape. We further propose a new framework to construct a data-driven lexicon that retains phonetics'...

chapter

Unsupervised classification of extreme facial events using active appearance models tracking for sign language videos

Epameinondas Antonakos, Vassilis Pitsikalis, Isidoros Rodomagoulakis, Petros Maragos

2012 19th IEEE International Conference on Image Processing > 1409 - 1412

2012 19th IEEE International Conference on Image Processing (ICIP 2012)

We propose an Unsupervised method for Extreme States Classification (UnESC) on feature spaces of facial cues of interest. The method is built upon Active Appearance Models (AAM) face tracking and on feature extraction of Global and Local AAMs. UnESC is applied primarily on facial pose, but is shown to be extendable for the case of local models on the eyes and mouth. Given the importance of facial...

Filter options

Publication date

Set your own date range

INFONA - science communication portal

Search results for: Isidoros Rodomagoulakis

Room-localized spoken command recognition in multi-room, multi-microphone environments

On the improvement of modulation features using multi-microphone energy tracking for robust distant speech recognition

The MOBOT rollator human-robot interaction model and user evaluation process

Advances in Large Vocabulary Continuous Speech Recognition in Greek: Modeling and nonlinear features

Recognitionwith raw canonical phonetic movement and handshape subunits on videos of continuous Sign Language

Unsupervised classification of extreme facial events using active appearance models tracking for sign language videos

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Data set

Reporting an error / abuse

Sending the report failed

Accessibility options