Search results for: Bin Ma

Items from 1 to 7 out of 7 results

chapter

Efficient methods to train multilingual bottleneck feature extractors for low resource keyword search

Chongjia Ni, Cheung-Chi Leung, Lei Wang, Nancy F. Chen, more

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5650 - 5654

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Training a bottleneck feature (BNF) extractor with multilingual data has been common in low resource keyword search. In a low resource application, the amount of transcribed target language data is limited while there are usually plenty of multilingual data. In this paper, we investigated two methods to train efficient multilingual BNF extractors for low resource keyword search. One method is to use...

chapter

Submodular data selection with acoustic and phonetic features for automatic speech recognition

Chongjia Ni, Lei Wang, Haibo Liu, Cheung-Chi Leung, more

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4629 - 4633

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper, we propose to use acoustic feature based submodular function optimization to select a subset of untranscribed data for manual transcription, and retrain the initial acoustic model with the additional transcribed data. The acoustic features are obtained from an unsupervised Gaussian mixture model. We also integrate the acoustic features with the phonetic features, which are obtained...

chapter

An acoustic segment modeling approach to query-by-example spoken term detection

Haipeng Wang, Cheung-Chi Leung, Tan Lee, Bin Ma, more

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5157 - 5160

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

The framework of posteriorgram-based template matching has been shown to be successful for query-by-example spoken term detection (STD). This framework employs a tokenizer to convert query examples and test utterances into frame-level posteriorgrams, and applies dynamic time warping to match the query posteriorgrams with test posteriorgrams to locate possible occurrences of the query term. It is not...

chapter

Framewise Phone Classification Using Weighted Fuzzy Classification Rules

Omid Dehzangi, Bin Ma, Eng Siong Chng, Haizhou Li

2010 20th International Conference on Pattern Recognition > 4186 - 4189

2010 20th International Conference on Pattern Recognition (ICPR 2010)

Our aim in this paper is to propose a rule-weight learning algorithm in fuzzy rule-based classifiers. The proposed algorithm is presented in two modes: first, all training examples are assumed to be equally important and the algorithm attempts to minimize the error-rate of the classifier on the training data by adjusting the weight of each fuzzy rule in the rule-base, and second, a weight is assigned...

chapter

Soft margin estimation of Gaussian mixture model parameters for spoken language recognition

Donglai Zhu, Bin Ma, Haizhou Li

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 4990 - 4993

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

This paper extends our previous work on large margin estimation (LME) of GMM parameters with extend Baum-Welch (EBW) for spoken language recognition. To overcome the problem in the LME that negative samples in the training set are not used in parameter estimation, we propose a soft margin estimation (SME) method in this paper. The soft margin is scaled by a loss function measuring the distance between...

chapter

Subspace construction and selection for speaker recognition

Yanhua Long, Wu Guo, Bin Ma, Eng Siong Chng, more

2009 7th International Conference on Information, Communications and Signal Processing (ICICS) > 1 - 4

2009 7th International Conference on Information, Communications & Signal Processing (ICICS)

In this paper, we propose a subspace construction and selection strategy (SUBS) for speaker recognition with limited training and testing speech data. Based on the individual Gaussian distributions of Gaussian mixture model (GMM), each speaker's characteristic subspace is constructed by training an SVM using the corresponding Gaussian mean vectors from the GMMs of both enrollment and imposter speakers...

chapter

A Lattice-Based Phonotactic Language Recognition System with CMLLR Adaptation and Its Implementation Issues

Cheung-Chi Leung, Rong Tong, Bin Ma, Haizhou Li

2009 International Conference on Asian Language Processing > 285 - 288

2009 International Conference on Asian Language Processing (IALP 2009)

This paper presents a ??non-complicated?? automatic spoken language recognition system which can be effectively implemented using publicly available toolkits (such as HTK, SRILM and SVM-Light) and corpus resources (such as Switchboard, CallFriend, OHSU and NIST LRE07 speech corpora). This system involves two context-independent phone recognizers, a vector space modelling classifier and an equal weight...

Filter options

Keywords:
TRAINING DATA

Publication date

Set your own date range

Content availability

Available (6)
None (1)

Keywords

SPEECH (6)
TRAINING (6)
HIDDEN MARKOV MODELS (4)
SPEECH RECOGNITION (4)
ACOUSTICS (2)
ADAPTATION MODEL (2)
FEATURE EXTRACTION (2)
NIST (2)
SPOKEN LANGUAGE RECOGNITION (2)
SUPPORT VECTOR MACHINES (2)
ACCURACY (1)
ACOUSTIC SEGMENT MODEL (1)
ACTIVE LEARNING (1)
AUTOMATIC SPEECH RECOGNITION (1)
AUTOMATIC SPOKEN LANGUAGE RECOGNITION SYSTEM (1)
BUILDINGS (1)
CLASSIFICATION ALGORITHMS (1)
CLASSIFICATION BOUNDARY (1)
CMLLR ADAPTATION (1)
CONTEXT-INDEPENDENT PHONE RECOGNIZERS (1)
CORPUS RESOURCES (1)
DATA MINING (1)
DATA MODELS (1)
DATA SELECTION (1)
DECODING (1)
EBW ALGORITHM (1)
ERROR RATE REDUCTION (1)
ESTIMATION (1)
EXTENDED BAUM-WELCH (1)
FRAMEWISE PHONE CLASSIFICATION (1)
FUZZY RULE-BASED CLASSIFIER (1)
FUZZY SET THEORY (1)
GAUSSIAN DISTRIBUTION (1)
GAUSSIAN MEAN VECTORS (1)
GAUSSIAN MIXTURE MODEL (1)
GAUSSIAN MIXTURE MODEL PARAMETERS (1)
GAUSSIAN PROCESSES (1)
GMM SUPERVECTOR BASELINE SYSTEM (1)
INDIVIDUAL GAUSSIAN DISTRIBUTIONS (1)
KEYWORD SPOTTING (1)
KNOWLEDGE BASED SYSTEMS (1)
LANGUAGE IDENTIFICATION (1)
LARGE MARGIN ESTIMATION (1)
LATTICE BASED PHONOTACTIC LANGUAGE RECOGNITION SYSTEM (1)
LATTICES (1)
LEARNING (ARTIFICIAL INTELLIGENCE) (1)
LEARNING SYSTEMS (1)
LOSS FUNCTION (1)
MEASUREMENT (1)
MULTILINGUAL DATA SELECTION (1)
NATURAL LANGUAGES (1)
NIST LANGUAGE RECOGNITION (1)
NIST LANGUAGE RECOGNITION EVALUATION TASK (1)
NIST SPEAKER RECOGNITION EVALUATIONS (1)
OPTIMISATION (1)
OPTIMIZATION (1)
PARAMETER ESTIMATION (1)
PATTERN CLASSIFICATION (1)
PENALTY FUNCTION (1)
PHONE LATTICE (1)
PHONE RECOGNIZER (1)
POSTERIORGRAM-BASED TEMPLATE MATCHING (1)
QUERY-BY-EXAMPLE (1)
RECURRENT NEURAL NETWORK (1)
RECURRENT NEURAL NETWORKS (1)
RULE-WEIGHT LEARNING ALGORITHM (1)
SME CONSTRAINED OPTIMIZATION (1)
SOFT MARGIN ESTIMATION (1)
SOFT MARGIN ESTIMATION METHOD (1)
SPEAKER CHARACTERISTIC SUBSPACE (1)
SPEAKER RECOGNITION (1)
SPEECH DATA TESTING (1)
SPEECH PROCESSING (1)
SPOKEN TERM DETECTION (1)
STRUCTURE RISK CRITERION (1)
SUBMODULAR OPTIMIZATION (1)
SUBSPACE CONSTRUCTION AND SELECTION STRATEGY (1)
SUPPORT VECTOR MACHINE (1)
SUPPORT VECTOR MACHINE CLASSIFICATION (1)
SVM TRAINING (1)
TIMIT SPEECH CORPUS (1)
UCI-ML (1)
VECTOR SPACE MODELLING CLASSIFIER (1)
VECTORS (1)
WEIGHT FUSION (1)
WEIGHTED FUZZY CLASSIFICATION RULE (1)
more

INFONA - science communication portal

Search results for: Bin Ma

Efficient methods to train multilingual bottleneck feature extractors for low resource keyword search

Submodular data selection with acoustic and phonetic features for automatic speech recognition

An acoustic segment modeling approach to query-by-example spoken term detection

Framewise Phone Classification Using Weighted Fuzzy Classification Rules

Soft margin estimation of Gaussian mixture model parameters for spoken language recognition

Subspace construction and selection for speaker recognition

A Lattice-Based Phonotactic Language Recognition System with CMLLR Adaptation and Its Implementation Issues

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options