Combining multiple kernel models for automatic intelligibility detection of pathological speech

Dong-Yan Huang; Minghui Dong; Haizhou Li

doi:10.1109/ICASSP.2016.7472926

Combining multiple kernel models for automatic intelligibility detection of pathological speech

Huang, Dong-Yan, Dong, Minghui, Li, Haizhou

Source

2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 6485 - 6489

Abstract

Automatic detection of pathological voice is a challenging task in speech processing. Appropriate acoustic cues of voice can be used to differentiate between normal voices and pathological voices. We propose a method to represent each speech utterance using three types of speech signal representations (i.e., cross-correlation matrix, Gaussian distribution and linear subspace) respectively. Various kernels were applied to these representations for measuring resemblance and difference. Four classifiers, i.e., KNN, kernel partial least squares, kernel SVM, and logistic regression, are studied for comparing their performance of classification. Finally, a simple fusion of learning classifiers from different acoustic representations was carried out at the score decision level for enhancing the performance. The different classifiers were evaluated on the Interspeech 2012 challenge development data set and test data set. Their effects in a fusion scheme are studied. The accuracy of the fusion system attained 78.0 % on test set, with an improved gain of 9.1 % over the challenge baseline 68.9 %.

Identifiers

book e-ISSN :	2379-190X
book e-ISBN :	978-1-4799-9988-0 , 978-1-4799-9987-3
DOI	10.1109/ICASSP.2016.7472926

Authors

Huang, Dong-Yan

Human Language Technology Department Institute for Infocomm Research, A∗STAR 21-01, Fusionopolis Way, Connexis (South Tower), Singapore 138632

Dong, Minghui

Human Language Technology Department Institute for Infocomm Research, A∗STAR 21-01, Fusionopolis Way, Connexis (South Tower), Singapore 138632

Li, Haizhou

Human Language Technology Department Institute for Infocomm Research, A∗STAR 21-01, Fusionopolis Way, Connexis (South Tower), Singapore 138632

Keywords

multiple kernel models pathological speech intelligibility correlation structure feature

Additional information

Data set: ieee

Publisher

IEEE

chapter

Read online
Download
Add to read later
Add to collection
Add to followed
Share

Export to bibliography


Assign to other user
	×
Wrong email address

INFONA - science communication portal

Combining multiple kernel models for automatic intelligibility detection of pathological speech $("#expandableTitles").expandable();

Source

Abstract

Identifiers

Authors

User assignment

Assignment remove confirmation

You're going to remove this assignment. Are you sure?

Huang, Dong-Yan

Dong, Minghui

Li, Haizhou

Keywords

Additional information

Publisher

Share

Export to bibliography

Reporting an error / abuse

Sending the report failed

Accessibility options

Combining multiple kernel models for automatic intelligibility detection of pathological speech