F. Chen

chapter

Speech recognition of under-resourced languages using mismatched transcriptions

Van Hai Do, Nancy F. Chen, Boon Pang Lim, Mark Hasegawa-Johnson

2016 International Conference on Asian Language Processing (IALP) > 112 - 115

2016 International Conference on Asian Language Processing (IALP)

Mismatched crowdsourcing is a technique to derive speech transcriptions using crowd-workers unfamiliar with the language being spoken. This technique is especially useful for under-resourced languages since it is hard to hire native transcribers. In this paper, we demonstrate that using mismatched transcription for adaptation improves performance of speech recognition under limited matched training...

chapter

A many-to-one phone mapping approach for cross-lingual speech recognition

Van Hai Do, Nancy F. Chen, Boon Pang Lim, Mark Hasegawa-Johnson

2016 IEEE RIVF International Conference on Computing & Communication Technologies, Research, Innovation, and Vision for the Future (RIVF) > 120 - 124

2016 IEEE RIVF International Conference on Computing & Communication Technologies, Research, Innovation, and Vision for the Future (RIVF)

This paper presents a novel method for acoustic modeling of an under-resourced language by “mapping” from acoustic models of well-resourced languages. The proposed method can be considered as a “many-to-one mapping” method where one speech unit in the target language is built as a linear combination of the source speech unit models and hence we can explicitly observe the relationship of the source...

chapter

Low-resource keyword search strategies for tamil

Nancy F. Chen, Chongjia Ni, I-Fan Chen, Sunil Sivadas, more

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5366 - 5370

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We propose strategies for a state-of-the-art keyword search (KWS) system developed by the SINGA team in the context of the 2014 NIST Open Keyword Search Evaluation (OpenKWS14) using conversational Tamil provided by the IARPA Babel program. To tackle low-resource challenges and the rich morphological nature of Tamil, we present highlights of our current KWS system, including: (1) Submodular optimization...

chapter

Unsupervised data selection and word-morph mixed language model for tamil low-resource keyword search

Chongjia Ni, Cheung-Chi Leung, Lei Wang, Nancy F. Chen, more

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4714 - 4718

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper considers an unsupervised data selection problem for the training data of an acoustic model and the vocabulary coverage of a keyword search system in low-resource settings. We propose to use Gaussian component index based n-grams as acoustic features in a submodular function for unsupervised data selection. The submodular function provides a near-optimal solution in terms of the objective...

chapter

Simulating the spectral properties of iron-bearing regions of Mars using the SPLITS model

Gladimir V.G. Baranoski, Bradley W. Kimmel, Tenn F. Chen, Erik Miranda

2014 IEEE Geoscience and Remote Sensing Symposium > 3013 - 3016

IGARSS 2014 - 2014 IEEE International Geoscience and Remote Sensing Symposium

The mineralogy and environmental history of Mars are been extensively investigated through remote sensing observations paired with laboratory and in situ experiments. A significant portion of these experiments is being devoted to the identification and quantification of different iron oxides present in the Martian terrains. Although such experiments can provide valuable information regarding the presence...

chapter

Multi-class Model M

Ahmad Emami, Stanley F. Chen

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5516 - 5519

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Model M, a novel class-based exponential language model, has been shown to significantly outperform word n-gram models in state-of-the-art machine translation and speech recognition systems. The model was motivated by the observation that shrinking the sum of the parameter magnitudes in an exponential language model leads to better performance on unseen data. Being a class-based language model, Model...

INFONA - science communication portal

Search results for: F. Chen

Speech recognition of under-resourced languages using mismatched transcriptions

A many-to-one phone mapping approach for cross-lingual speech recognition

Low-resource keyword search strategies for tamil

Unsupervised data selection and word-morph mixed language model for tamil low-resource keyword search

Simulating the spectral properties of iron-bearing regions of Mars using the SPLITS model

Multi-class Model M

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results for: F. Chen

Speech recognition of under-resourced languages using mismatched transcriptions

A many-to-one phone mapping approach for cross-lingual speech recognition

Low-resource keyword search strategies for tamil

Unsupervised data selection and word-morph mixed language model for tamil low-resource keyword search

Simulating the spectral properties of iron-bearing regions of Mars using the SPLITS model

Multi-class Model M

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options