Latent topic modeling of word vicinity information for speech recognition

Kuan-Yu Chen; Hsuan-Sheng Chiu; Berlin Chen

doi:10.1109/ICASSP.2010.5494942

Source

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 5394 - 5397

Abstract

Topic language models, mostly revolving around the discovery of “word-document” co-occurrence dependence, have attracted significant attention and shown good performance in a wide variety of speech recognition tasks over the years. In this paper, a new topic language model, named word vicinity model (WVM), is proposed to explore the co-occurrence relationship between words, as well as the long-span latent topical information for language model adaptation. A search history is modeled as a composite WVM model for predicting a decoded word. The underlying characteristics and different kinds of model structures are extensively investigated, while the performance of WVM is thoroughly analyzed and verified by comparison with a few existing topic language models. Moreover, we also present a new modeling approach to our recently proposed word topic model (WTM), and design an efficient way to simultaneously extract “word-document” and “word-word” co-occurrence characteristics through the sharing of the same set of latent topics. Experiments on broadcast news transcription seem to demonstrate the utility of the presented models.

Identifiers

book ISSN :	1520-6149
book ISBN :	978-1-4244-4295-9
book e-ISBN :	978-1-4244-4296-6
DOI	10.1109/ICASSP.2010.5494942

Keywords

speech recognition natural language processing long span latent topical information latent topic modeling word vicinity information topic language models word vicinity model Adaptation model History Training Predictive models Speech Mathematical model broadcast news transcription topic language model

Additional information

Data set: ieee

Publisher

IEEE

INFONA - science communication portal

Latent topic modeling of word vicinity information for speech recognition

Source

Abstract

Identifiers

Authors

Kuan-Yu Chen

Hsuan-Sheng Chiu

Chen, B.

Keywords

Additional information

Publisher


Assign to other user
	×
Wrong email address

INFONA - science communication portal

Latent topic modeling of word vicinity information for speech recognition $("#expandableTitles").expandable();

Source

Abstract

Identifiers

Authors

User assignment

Assignment remove confirmation

You're going to remove this assignment. Are you sure?

Kuan-Yu Chen

Hsuan-Sheng Chiu

Chen, B.

Keywords

Additional information

Publisher

Share

Export to bibliography

Reporting an error / abuse

Sending the report failed

Accessibility options

Latent topic modeling of word vicinity information for speech recognition