A Low-Complexity Dynamic Face-Voice Feature Fusion Approach to Multimodal Person Recognition

D. Shah; K.J. Han; S.S. Narayanan

doi:10.1109/ISM.2009.78

Source

2009 11th IEEE International Symposium on Multimedia > 24 - 31

Abstract

In this paper, we show the importance of face-voice correlation for audio-visual person recognition. We evaluate the performance of a system which uses the correlation between audio-visual features during speech against audio-only, video-only and audio-visual systems which use audio and visual features independently neglecting the interdependency of a person's spoken utterance and the associated facial movements. Experiments performed on the Vid-TIMIT dataset show that the proposed multimodal scheme has lower error rate than all other comparison conditions and is more robust against replay attacks. The simplicity of the fusion technique also allows the use of only one classifier which greatly simplifies system design and allows for a simple real-time DSP implementation.

Identifiers

book ISBN :	978-1-4244-5231-6
book e-ISBN :	978-0-7695-3890-7
DOI	10.1109/ISM.2009.78

Keywords

speaker recognition biometrics (access control) face recognition feature extraction gesture recognition image classification sensor fusion dynamic face-voice feature fusion multimodal person recognition face-voice correlation audio-visual person recognition speech features spoken utterance facial movement multimodal scheme replay attack Face Robustness Discrete cosine transforms Data mining Hidden Markov models Accuracy speaker audio-visual multimodal biometric

Additional information

Data set: ieee

Publisher

IEEE

INFONA - science communication portal

A Low-Complexity Dynamic Face-Voice Feature Fusion Approach to Multimodal Person Recognition

Source

Abstract

Identifiers

Authors

Shah, D.

Han, K.J.

Narayanan, S.S.

Keywords

Additional information

Publisher


Assign to other user
	×
Wrong email address

INFONA - science communication portal

A Low-Complexity Dynamic Face-Voice Feature Fusion Approach to Multimodal Person Recognition $("#expandableTitles").expandable();

Source

Abstract

Identifiers

Authors

User assignment

Assignment remove confirmation

You're going to remove this assignment. Are you sure?

Shah, D.

Han, K.J.

Narayanan, S.S.

Keywords

Additional information

Publisher

Share

Export to bibliography

Reporting an error / abuse

Sending the report failed

Accessibility options

A Low-Complexity Dynamic Face-Voice Feature Fusion Approach to Multimodal Person Recognition