Search results for: Haizhou Li

Items from 1 to 5 out of 5 results

chapter

Low-resource keyword search strategies for tamil

Nancy F. Chen, Chongjia Ni, I-Fan Chen, Sunil Sivadas, more

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5366 - 5370

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We propose strategies for a state-of-the-art keyword search (KWS) system developed by the SINGA team in the context of the 2014 NIST Open Keyword Search Evaluation (OpenKWS14) using conversational Tamil provided by the IARPA Babel program. To tackle low-resource challenges and the rich morphological nature of Tamil, we present highlights of our current KWS system, including: (1) Submodular optimization...

article

Optimization Algorithms and Applications for Speech and Language Processing

Stephen J. Wright, Dimitri Kanevsky, Li Deng, Xiaodong He, more

IEEE Transactions on Audio, Speech, and Language Processing > 2013 > 21 > 11 > 2231 - 2243

Optimization techniques have been used for many years in the formulation and solution of computational problems arising in speech and language processing. Such techniques are found in the Baum-Welch, extended Baum-Welch (EBW), Rprop, and GIS algorithms, for example. Additionally, the use of regularization terms has been seen in other applications of sparse optimization. This paper outlines a range...

chapter

Classifier subset selection and fusion for speaker verification

Filip Sedlak, Tomi Kinnunen, Ville Hautamaki, Kong-Aik Lee, more

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4544 - 4547

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

State-of-the-art speaker verification systems consists of a number of complementary subsystems whose outputs are fused, to arrive at more accurate and reliable verification decision. In speaker verification, fusion is typically implemented as a linear combination of the subsystem scores. Parameters of the linear model are commonly estimated using the logistic regression method, as implemented in the...

chapter

Soft margin estimation of Gaussian mixture model parameters for spoken language recognition

Donglai Zhu, Bin Ma, Haizhou Li

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 4990 - 4993

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

This paper extends our previous work on large margin estimation (LME) of GMM parameters with extend Baum-Welch (EBW) for spoken language recognition. To overcome the problem in the LME that negative samples in the training set are not used in parameter estimation, we propose a soft margin estimation (SME) method in this paper. The soft margin is scaled by a loss function measuring the distance between...

chapter

Refining Unit Boundaries for Mandarin Text-to-Speech Database

Minghui Dong, Ling Cen, P. Chan, Haizhou Li

2009 International Conference on Asian Language Processing > 245 - 248

2009 International Conference on Asian Language Processing (IALP 2009)

In unit selection based text-to-speech (TTS) synthesis, the accurate position of the unit boundaries in the unit selection database is one of the factors that determine the quality of the synthesized speech. To ensure the accuracy of the boundary positions, developers often have to manually verify the speech boundaries that are generated by automatic speech recognition techniques. In order to reduce...

Filter options

Keywords:
OPTIMIZATION

Publication date

Set your own date range

Publication type

book (4)
article (1)

Keywords

SPEECH (4)
SPEECH RECOGNITION (4)
TRAINING (4)
HIDDEN MARKOV MODELS (2)
MEL FREQUENCY CEPSTRAL COEFFICIENT (2)
NIST (2)
ACCURACY (1)
ACOUSTICS (1)
ACTIVE LEARNING (1)
AGGLUTINATIVE LANGUAGES (1)
AUTOMATIC SPEECH RECOGNITION TECHNIQUES (1)
CLASSIFICATION BOUNDARY (1)
CLASSIFIER SELECTION (1)
COMPUTATIONAL MODELING (1)
DATA MODELS (1)
DATABASE MANAGEMENT SYSTEMS (1)
DATABASES (1)
DEEP NEURAL NETWORK (DNN) (1)
EBW ALGORITHM (1)
ESTIMATION (1)
EXTENDED BAUM-WELCH (1)
FRAME-SHIFT METHOD (1)
GAUSSIAN MIXTURE MODEL PARAMETERS (1)
GAUSSIAN PROCESSES (1)
INFLECTIVE LANGUAGES (1)
KEYWORD SEARCH (1)
KEYWORD SPOTTING (1)
LARGE MARGIN ESTIMATION (1)
LINEAR FUSION (1)
LOSS FUNCTION (1)
MANDARIN TEXT-TO-SPEECH DATABASE (1)
MATHEMATICAL MODEL (1)
MORPHOLOGY (1)
NATURAL LANGUAGE PROCESSING (1)
NIST LANGUAGE RECOGNITION EVALUATION TASK (1)
OPTIMISATION (1)
OPTIMIZATION METHODS (1)
PARAMETER ESTIMATION (1)
PENALTY FUNCTION (1)
SEMI-SUPERVISED LEARNING (1)
SME CONSTRAINED OPTIMIZATION (1)
SOFT MARGIN ESTIMATION (1)
SOFT MARGIN ESTIMATION METHOD (1)
SPEAKER RECOGNITION (1)
SPEECH PROCESSING (1)
SPEECH SYNTHESIS (1)
SPOKEN LANGUAGE RECOGNITION (1)
SPOKEN TERM DETECTION (STD) (1)
SUPPORT VECTOR MACHINES (1)
TRAINING DATA (1)
UNDER-RESOURCED LANGUAGES (1)
UNIT BOUNDARY (1)
UNIT SELECTION (1)
UNIT SELECTION BASED TEXT-TO-SPEECH SYNTHESIS (1)
UNSUPERVISED LEARNING (1)
more

INFONA - science communication portal

Search results for: Haizhou Li

Low-resource keyword search strategies for tamil

Optimization Algorithms and Applications for Speech and Language Processing

Classifier subset selection and fusion for speaker verification

Soft margin estimation of Gaussian mixture model parameters for spoken language recognition

Refining Unit Boundaries for Mandarin Text-to-Speech Database

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options