Development of Recognition System Using Fusion of Natural Gesture/Speech

Young-Giu Jung; Mun-Sung Han; Jun Seok Park; Jo Lee

doi:10.1109/ICCE.2008.4588016

Source

2008 Digest of Technical Papers - International Conference on Consumer Electronics > 1 - 2

Abstract

A multimodal interface can achieve more natural and effective human-computer interaction. In this paper, we present an isolated-word recognizer using a fusion of speech and natural visual gestures. The fusion of audio and visual signals can be carried out either at the class level or the feature level. Our system incorporates a fusion system at the feature level which supports 10 natural gestures. One of most difficult problems in feature level fusion is synchronization between audio and visual features. To solve this problem, we propose a modified time delay neural network (TDNN) architecture with a dedicated fusion layer and optimize parameters of this recognition model. Experimental results show that this system yields a performance improvement when compared to the performance of automatic speech recognition (ASR) under various signal-to-noise rate (SNR) conditions.

Identifiers

book ISBN :	978-1-4244-1458-1
book e-ISBN :	978-1-4244-1459-8
DOI	10.1109/ICCE.2008.4588016

Keywords

user interfaces human computer interaction image recognition neural nets speech recognition signal-to-noise rate conditions multimodal interface human-computer interaction audio signals visual signals time delay neural network automatic speech recognition Feature extraction Speech Synchronization Visualization Microphones Cameras

Additional information

Data set: ieee

Publisher

IEEE

INFONA - science communication portal

Development of Recognition System Using Fusion of Natural Gesture/Speech

Source

Abstract

Identifiers

Authors

Young-Giu Jung

Mun-Sung Han

Jun Seok Park

Jo Lee

Keywords

Additional information

Publisher


Assign to other user
	×
Wrong email address

INFONA - science communication portal

Development of Recognition System Using Fusion of Natural Gesture/Speech $("#expandableTitles").expandable();

Source

Abstract

Identifiers

Authors

User assignment

Assignment remove confirmation

You're going to remove this assignment. Are you sure?

Young-Giu Jung

Mun-Sung Han

Jun Seok Park

Jo Lee

Keywords

Additional information

Publisher

Share

Export to bibliography

Reporting an error / abuse

Sending the report failed

Accessibility options

Development of Recognition System Using Fusion of Natural Gesture/Speech