Search results for: F. Chen

Items from 1 to 11 out of 11 results

chapter

Efficient methods to train multilingual bottleneck feature extractors for low resource keyword search

Chongjia Ni, Cheung-Chi Leung, Lei Wang, Nancy F. Chen, more

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5650 - 5654

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Training a bottleneck feature (BNF) extractor with multilingual data has been common in low resource keyword search. In a low resource application, the amount of transcribed target language data is limited while there are usually plenty of multilingual data. In this paper, we investigated two methods to train efficient multilingual BNF extractors for low resource keyword search. One method is to use...

chapter

Using tone-based extended recognition network to detect non-native Mandarin tone mispronunciations

Wei Li, Sabato Marco Siniscalchi, Nancy F. Chen, Chin-Hui Lee

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 4

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

In this paper, we investigate a DNN tone-based extended recognition network (ERN) approach to Mandarin tone recognition and tone mispronunciation detection. Given a toneless syllable sequence, a tone-based ERN is constructed by assigning five different tones to each toneless syllable, obtaining a fully expanded tonal syllable network. Next, Viterbi decoding is carried out on the tone-based ERN to...

chapter

Computer-assisted pronunciation training: From pronunciation scoring towards spoken language learning

Nancy F. Chen, Haizhou Li

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 7

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

This paper reviews the research approaches used in computer-assisted pronunciation training (CAPT), addresses the existing challenges, and discusses emerging trends and opportunities. To complement existing work, our analysis places more emphasis on pronunciation teaching and learning (as opposed to pronunciation assessment), prosodic error detection (as opposed to phonetic error detection), and research...

chapter

Speech recognition of under-resourced languages using mismatched transcriptions

Van Hai Do, Nancy F. Chen, Boon Pang Lim, Mark Hasegawa-Johnson

2016 International Conference on Asian Language Processing (IALP) > 112 - 115

2016 International Conference on Asian Language Processing (IALP)

Mismatched crowdsourcing is a technique to derive speech transcriptions using crowd-workers unfamiliar with the language being spoken. This technique is especially useful for under-resourced languages since it is hard to hire native transcribers. In this paper, we demonstrate that using mismatched transcription for adaptation improves performance of speech recognition under limited matched training...

chapter

A many-to-one phone mapping approach for cross-lingual speech recognition

Van Hai Do, Nancy F. Chen, Boon Pang Lim, Mark Hasegawa-Johnson

2016 IEEE RIVF International Conference on Computing & Communication Technologies, Research, Innovation, and Vision for the Future (RIVF) > 120 - 124

2016 IEEE RIVF International Conference on Computing & Communication Technologies, Research, Innovation, and Vision for the Future (RIVF)

This paper presents a novel method for acoustic modeling of an under-resourced language by “mapping” from acoustic models of well-resourced languages. The proposed method can be considered as a “many-to-one mapping” method where one speech unit in the target language is built as a linear combination of the source speech unit models and hence we can explicitly observe the relationship of the source...

chapter

Low-resource keyword search strategies for tamil

Nancy F. Chen, Chongjia Ni, I-Fan Chen, Sunil Sivadas, more

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5366 - 5370

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We propose strategies for a state-of-the-art keyword search (KWS) system developed by the SINGA team in the context of the 2014 NIST Open Keyword Search Evaluation (OpenKWS14) using conversational Tamil provided by the IARPA Babel program. To tackle low-resource challenges and the rich morphological nature of Tamil, we present highlights of our current KWS system, including: (1) Submodular optimization...

chapter

A keyword-aware grammar framework for LVCSR-based spoken keyword search

I-Fan Chen, Chongjia Ni, Boon Pang Lim, Nancy F. Chen, more

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5196 - 5200

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper, we proposed a method to realize the recently developed keyword-aware grammar for LVCSR-based keyword search using weight finite-state automata (WFSA). The approach creates a compact and deterministic grammar WFSA by inserting keyword paths to an existing n-gram WFSA. Tested on the evalpart1 data of the IARPA Babel OpenKWS13 Vietnamese and OpenKWS14 Tamil limitedlanguage pack tasks,...

chapter

Unsupervised data selection and word-morph mixed language model for tamil low-resource keyword search

Chongjia Ni, Cheung-Chi Leung, Lei Wang, Nancy F. Chen, more

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4714 - 4718

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper considers an unsupervised data selection problem for the training data of an acoustic model and the vocabulary coverage of a keyword search system in low-resource settings. We propose to use Gaussian component index based n-grams as acoustic features in a submodular function for unsupervised data selection. The submodular function provides a near-optimal solution in terms of the objective...

article

Converting Neural Network Language Models into Back-off Language Models for Efficient Decoding in Automatic Speech Recognition

Ebru Arisoy, Stanley F. Chen, Bhuvana Ramabhadran, Abhinav Sethy

IEEE/ACM Transactions on Audio, Speech, and Language Processing > 2014 > 22 > 1 > 184 - 192

Neural network language models (NNLMs) have achieved very good performance in large-vocabulary continuous speech recognition (LVCSR) systems. Because decoding with NNLMs is computationally expensive, there is interest in developing methods to approximate NNLMs with simpler language models that are suitable for fast decoding. In this work, we propose an approximate method for converting a feedforward...

chapter

Distributed training of large scale exponential language models

Abhinav Sethy, Stanley F. Chen, Bhuvana Ramabhadran

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5520 - 5523

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Shrinkage-based exponential language models, such as the recently introduced Model M, have provided significant gains over a range of tasks [1]. Training such models requires a large amount of computational resources in terms of both time and memory. In this paper, we present a distributed training algorithm for such models based on the idea of cluster expansion [2]. Cluster expansion allows us to...

chapter

Context and observation driven latent variable model for human pose estimation

A. Gupta, T. Chen, F. Chen, D. Kimber, more

2008 IEEE Conference on Computer Vision and Pattern Recognition > 1 - 8

2008 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Current approaches to pose estimation and tracking can be classified into two categories: generative and discriminative. While generative approaches can accurately determine human pose from image observations, they are computationally expensive due to search in the high dimensional human pose space. On the other hand, discriminative approaches do not generalize well, but are computationally efficient...

Filter options

Keywords:
TRAINING

Publication date

Set your own date range

Publication type

book (10)
article (1)

Keywords

SPEECH (8)
ACOUSTICS (6)
SPEECH RECOGNITION (6)
DATA MODELS (4)
KEYWORD SEARCH (3)
KEYWORD SPOTTING (3)
TRAINING DATA (3)
ACTIVE LEARNING (2)
ADAPTATION MODELS (2)
COMPUTATIONAL MODELING (2)
DECODING (2)
FEATURE EXTRACTION (2)
HIDDEN MARKOV MODELS (2)
HISTORY (2)
LANGUAGE MODELING (2)
OPTIMIZATION (2)
SPOKEN TERM DETECTION (2)
AGGLUTINATIVE LANGUAGES (1)
APPROXIMATION METHODS (1)
ARTIFICIAL NEURAL NETWORKS (1)
AUTOMATA (1)
BACK-OFF LANGUAGE MODELS (1)
BUILDINGS (1)
COMPUTER ASSISTED LANGUAGE LEARNING (CALL) (1)
CONTEXT (1)
CONTEXT MODELING (1)
DATA AUGMENTATION (1)
DATA MINING (1)
DEEP NEURAL NETWORK (DNN) (1)
DISTANCE MEASUREMENT (1)
DISTRIBUTED TRAINING (1)
ENTROPY (1)
ESTIMATION (1)
EXPONENTIAL N-GRAM MODELS (1)
GAUSSIAN PROCESS LATENT VARIABLE MODEL (1)
GAUSSIAN PROCESSES (1)
GESTURE RECOGNITION (1)
GRAMMAR (1)
GRAMMAR NETWORK (1)
HUMAN POSE ESTIMATION (1)
IMAGE OBSERVATIONS (1)
IMAGE PROCESSING (1)
INDEXES (1)
INFLECTIVE LANGUAGES (1)
INTEGRATED LEARNING (1)
JOINTS (1)
KERNEL (1)
LANGUAGE IDENTIFICATION (1)
MACHINE LEARNING (1)
MEMORY MANAGEMENT (1)
MISMATCHED TRANSCRIPTION (1)
MODEL ADAPTATION (1)
MORPHOLOGY (1)
MULTILINGUAL DATA SELECTION (1)
NEURAL NETWORK LANGUAGE MODELS (1)
PARAMETERIZED GESTURES (1)
POSE ESTIMATION (1)
POSE TRACKING (1)
PREDICTIVE MODELS (1)
PRONUNCIATION TUTORING (1)
RECURRENT NEURAL NETWORK (1)
RECURRENT NEURAL NETWORKS (1)
SECOND LANGUAGE LEARNING (1)
SEMI-SUPERVISED LEARNING (1)
SPEECH AND LANGUAGE TECHNOLOGIES IN EDUCATION (1)
SPOKEN TERM DETECTION (STD) (1)
STRESS (1)
SUBMODULAR OPTIMIZATION (1)
UNDER-RESOURCED LANGUAGE (1)
UNDER-RESOURCED LANGUAGES (1)
UNSUPERVISED LEARNING (1)
VOCABULARY (1)
WEIGHTED FINITE-STATE AUTOMATON (1)
more

INFONA - science communication portal

Search results for: F. Chen

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options