Search results for: F. Chen

Items from 1 to 5 out of 5 results

chapter

Speech recognition of under-resourced languages using mismatched transcriptions

Van Hai Do, Nancy F. Chen, Boon Pang Lim, Mark Hasegawa-Johnson

2016 International Conference on Asian Language Processing (IALP) > 112 - 115

2016 International Conference on Asian Language Processing (IALP)

Mismatched crowdsourcing is a technique to derive speech transcriptions using crowd-workers unfamiliar with the language being spoken. This technique is especially useful for under-resourced languages since it is hard to hire native transcribers. In this paper, we demonstrate that using mismatched transcription for adaptation improves performance of speech recognition under limited matched training...

chapter

A many-to-one phone mapping approach for cross-lingual speech recognition

Van Hai Do, Nancy F. Chen, Boon Pang Lim, Mark Hasegawa-Johnson

2016 IEEE RIVF International Conference on Computing & Communication Technologies, Research, Innovation, and Vision for the Future (RIVF) > 120 - 124

2016 IEEE RIVF International Conference on Computing & Communication Technologies, Research, Innovation, and Vision for the Future (RIVF)

This paper presents a novel method for acoustic modeling of an under-resourced language by “mapping” from acoustic models of well-resourced languages. The proposed method can be considered as a “many-to-one mapping” method where one speech unit in the target language is built as a linear combination of the source speech unit models and hence we can explicitly observe the relationship of the source...

article

Characterizing Phonetic Transformations and Acoustic Differences Across English Dialects

Nancy F. Chen, Sharon W. Tam, Wade Shen, Joseph P. Campbell

IEEE/ACM Transactions on Audio, Speech, and Language Processing > 2014 > 22 > 1 > 110 - 124

In this work, we propose a framework that automatically discovers dialect-specific phonetic rules. These rules characterize when certain phonetic or acoustic transformations occur across dialects. To explicitly characterize these dialect-specific rules, we adapt the conventional hidden Markov model to handle insertion and deletion transformations. The proposed framework is able to convert pronunciation...

chapter

Multi-class Model M

Ahmad Emami, Stanley F. Chen

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5516 - 5519

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Model M, a novel class-based exponential language model, has been shown to significantly outperform word n-gram models in state-of-the-art machine translation and speech recognition systems. The model was motivated by the observation that shrinking the sum of the parameter magnitudes in an exponential language model leads to better performance on unseen data. Being a class-based language model, Model...

chapter

Informative dialect recognition using context-dependent pronunciation modeling

Nancy F. Chen, Wade Shen, Joseph P. Campbell, Pedro A. Torres-Carrasquillo

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4396 - 4399

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We propose an informative dialect recognition system that learns phonetic transformation rules, and uses them to identify dialects. A hidden Markov model is used to align reference phones with dialect-specific pronunciations to characterize when and how often substitutions, insertions, and deletions occur. Decision tree clustering is used to find context-dependent phonetic rules. We ran recognition...

Filter options

Keywords:
ADAPTATION MODELS
Publication language:
English

Publication date

Set your own date range

Publication type

book (4)
article (1)

INFONA - science communication portal

Search results for: F. Chen

Speech recognition of under-resourced languages using mismatched transcriptions

A many-to-one phone mapping approach for cross-lingual speech recognition

Characterizing Phonetic Transformations and Acoustic Differences Across English Dialects

Multi-class Model M

Informative dialect recognition using context-dependent pronunciation modeling

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options