Modular BDPCA based visual feature representation for lip-reading

Guanyong Wu; Jie Zhu

doi:10.1109/ICIP.2008.4712008

Source

2008 15th IEEE International Conference on Image Processing > 1328 - 1331

Abstract

Most of the appearance based visual feature extraction methods in the lip-reading system treat the mouth image in a whole manner. However, the vision of speech process is three dimensional and treating the mouth image as a whole may lose the speech information. Motivated by the bidirectional PCA (BDPCA) and decomposition methods used in the face recognition domain, in this paper, a modular bidirectional PCA (MBDPCA) based visual feature extraction method was presented. In this method, the original mouth image sequences are divided into smaller sub-images, and two approaches are compared to build the covariance matrix: one is using all the sub-image sets together to build a global covariance matrix; the other is using the different sub-image sets independently to build the local covariance matrices. Then the BDPCA is applied to each sub-image set. Experimental results show that the MBDPCA method has a better performance than both the conventional PCA and BDPCA methods; moreover, further experimental results demonstrate that our lip-reading system provides significant enhancement of robustness in noisy environments compared to the audio-only speech recognition.

Identifiers

book ISSN :	1522-4880
book ISBN :	978-1-4244-1765-0
book e-ISBN :	978-1-4244-1764-3
DOI	10.1109/ICIP.2008.4712008

Keywords

principal component analysis covariance matrices face recognition feature extraction image sequences audio-visual speech recognition modular BDPCA based visual feature representation visual feature extraction methods lip-reading system speech process speech information decomposition methods face recognition domain original mouth image sequences global covariance matrix local covariance matrices audio-only speech recognition Covariance matrix Speech recognition Visualization Accuracy lip-reading

Additional information

Data set: ieee

Publisher

IEEE

INFONA - science communication portal

Modular BDPCA based visual feature representation for lip-reading

Source

Abstract

Identifiers

Authors

Guanyong Wu

Jie Zhu

Keywords

Additional information

Publisher


Assign to other user
	×
Wrong email address

INFONA - science communication portal

Modular BDPCA based visual feature representation for lip-reading $("#expandableTitles").expandable();

Source

Abstract

Identifiers

Authors

User assignment

Assignment remove confirmation

You're going to remove this assignment. Are you sure?

Guanyong Wu

Jie Zhu

Keywords

Additional information

Publisher

Share

Export to bibliography

Reporting an error / abuse

Sending the report failed

Accessibility options

Modular BDPCA based visual feature representation for lip-reading