Vietnamese large vocabulary continuous speech recognition

Ngoc Thang Vu; T. Schultz

doi:10.1109/ASRU.2009.5373424

Source

2009 IEEE Workshop on Automatic Speech Recognition&Understanding > 333 - 338

Abstract

We report on our recent efforts toward a large vocabulary Vietnamese speech recognition system. In particular, we describe the Vietnamese text and speech database recently collected as part of our GlobalPhone corpus. The data was complemented by a large collection of text data crawled from various Vietnamese websites. To bootstrap the Vietnamese speech recognition system we used our Rapid Language Adaptation scheme applying a multilingual phone inventory. After initialization we investigated the peculiarities of the Vietnamese language and achieved significant improvements by implementing different tone modeling schemes, extended by pitch extraction, handling multiwords to address the monosyllable structure of Vietnamese, and featuring language modeling based on 5-grams. Furthermore, we addressed the issue of dialectal variations between South and North Vietnam by creating dialect dependent pronunciations and including dialect in the context decision tree of the recognizer. Our currently best recognition system achieves a word error rate of 11.7% on read newspaper speech.

Identifiers

book ISBN :	978-1-4244-5478-5
book e-ISBN :	978-1-4244-5479-2
DOI	10.1109/ASRU.2009.5373424

Keywords

speech recognition database management systems decision trees context decision tree Vietnamese large vocabulary continuous speech recognition Vietnamese text speech database GlobalPhone corpus Vietnamese websites rapid language adaptation scheme multilingual phone inventory pitch extraction Speech Data models Biological system modeling Adaptation model Training Context modeling

Additional information

Data set: ieee

Publisher

IEEE

INFONA - science communication portal

Vietnamese large vocabulary continuous speech recognition

Source

Abstract

Identifiers

Authors

Ngoc Thang Vu

Schultz, T.

Keywords

Additional information

Publisher


Assign to other user
	×
Wrong email address

INFONA - science communication portal

Vietnamese large vocabulary continuous speech recognition $("#expandableTitles").expandable();

Source

Abstract

Identifiers

Authors

User assignment

Assignment remove confirmation

You're going to remove this assignment. Are you sure?

Ngoc Thang Vu

Schultz, T.

Keywords

Additional information

Publisher

Share

Export to bibliography

Reporting an error / abuse

Sending the report failed

Accessibility options

Vietnamese large vocabulary continuous speech recognition