Non-negative matrix factorization as noise-robust feature extractor for speech recognition

B Schuller; F Weninger; M Wöllmer; Y Sun; G Rigoll

doi:10.1109/ICASSP.2010.5495567

Source

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 4562 - 4565

Abstract

We introduce a novel approach for noise-robust feature extraction in speech recognition, based on non-negative matrix factorization (NMF). While NMF has previously been used for speech denoising and speaker separation, we directly extract time-varying features from the NMF output. To this end we extend basic unsupervised NMF to a hybrid supervised/unsupervised algorithm. We present a Dynamic Bayesian Network (DBN) architecture that can exploit these features in a Tandem manner together with the maximum likelihood phoneme estimate of a bidirectional long short-term memory (BLSTM) recurrent neural network. We show that addition of NMF features to spelling recognition systems can increase word accuracy by up to 7% absolute in a noisy car environment.

Identifiers

book ISSN :	1520-6149
book ISBN :	978-1-4244-4295-9
book e-ISBN :	978-1-4244-4296-6
DOI	10.1109/ICASSP.2010.5495567

Keywords

word processing belief networks feature extraction matrix decomposition maximum likelihood estimation recurrent neural nets signal denoising speech recognition noisy car environment nonnegative matrix factorization noise robust feature extractor speech denoising speaker separation time varying feature extraction unsupervised NMF hybrid supervised-unsupervised algorithm dynamic Bayesian network architecture bidirectional long short term memory recurrent neural network spelling recognition system Indexes Speech Noise Noise measurement Mel frequency cepstral coefficient Long Short-Term Memory Non-Negative Matrix Factorization Noise robustness Dynamic Bayesian Networks

Additional information

Data set: ieee

Publisher

IEEE

INFONA - science communication portal

Non-negative matrix factorization as noise-robust feature extractor for speech recognition

Source

Abstract

Identifiers

Authors

Schuller, B.

Weninger, F.

Wöllmer, M.

Sun, Y.

Keywords

Additional information

Publisher


Assign to other user
	×
Wrong email address

INFONA - science communication portal

Non-negative matrix factorization as noise-robust feature extractor for speech recognition $("#expandableTitles").expandable();

Source

Abstract

Identifiers

Authors

User assignment

Assignment remove confirmation

You're going to remove this assignment. Are you sure?

Schuller, B.

Weninger, F.

Wöllmer, M.

Sun, Y.

Keywords

Additional information

Publisher

Share

Export to bibliography

Reporting an error / abuse

Sending the report failed

Accessibility options

Non-negative matrix factorization as noise-robust feature extractor for speech recognition