Search results for: Xiang Xiao

Items from 1 to 5 out of 5 results

chapter

Maximum a posteriori linear regression for speaker recognition

Xiang Zhang, Haipeng Wang, Xiang Xiao, Jianping Zhang, more

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 4542 - 4545

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

Recently, using maximum likelihood linear regression (MLLR) transforms as the features for SVM based speaker recognition has been proposed. This can achieve performance comparable to that obtained with state-of-the-art approaches. In this paper, we focus on calculating the transforms based on a GMM universal background model (UBM). Rather than estimating the transforms using maximum likelihood criterion,...

chapter

Combining MAP and MLLR Approaches for SVM Based Speaker Recognition with a Multi-class MLLR Technique

Haipeng Wang, Xiang Zhang, Xiang Xiao, Jianping Zhang, more

2009 Second International Symposium on Information Science and Engineering > 447 - 450

Second International Symposium on Information Science and Engineering (ISISE 2009)

Gaussian mixture models with an universal background model (UBM) have been the standard method for speaker recognition. Typically, maximum a posteriori (MAP) or maximum likelihood linear regression (MLLR) is used to adapt the means of the UBM. Together with the SVM modeling technique, these approaches can achieve excellent performance. MLLR is quite efficient when the amount of adaptation data is...

chapter

A Hierarchical System Design for Language Identification

Haipeng Wang, Xiang Xiao, Xiang Zhang, Jianping Zhang, more

2009 Second International Symposium on Information Science and Engineering > 443 - 446

Second International Symposium on Information Science and Engineering (ISISE 2009)

Token-based approaches have proven quite effective for spoken language identification (LID). Traditionally, Speech utterances are first decoded into token sequences, and then LID tasks are performed on these token sequences by either n-gram language models or support vector machines. In this paper, we propose a hierarchical system design, which utilizes a group of bayesian logistic regression models...

chapter

Harmonic Structure Features for Robust Speaker Recognition against Channel Effect

Chuan Cao, Xiang Xiao, Ming Li, Jian Liu, more

2009 Second International Symposium on Information Science and Engineering > 451 - 454

Second International Symposium on Information Science and Engineering (ISISE 2009)

This paper proposes a novel feature set for robust speaker recognition, which is based on the harmonic structure of speech signals. Channel modulation effects are supposed to be weakened in the harmonic structure features, and furthermore the influence introduced by channel variability could be diminished to a certain degree. Though experiment results show that the raw performance of the harmonic...

chapter

Speaker Recognition using a Kind of Novel Phonotactic Information

Xiang Zhang, Xiang Xiao, Haipeng Wang, Hongbin Suo, more

2008 6th International Symposium on Chinese Spoken Language Processing > 1 - 4

2008 6th International Symposium on Chinese Spoken Language Processing

In this paper, we present a new modeling approach for speaker recognition, which uses a kind of novel phonotactic information as the feature for S VM modeling. Gaussian mixture models (GMMs) have been proven extremely successful for text- independent speaker recognition. The GMM universal background model (UBM) is a speaker-independent model, each component of which can be considered to be modeling...

Filter options

Keywords:
SPEECH

Publication date

Set your own date range

Keywords

SPEAKER RECOGNITION (4)
NIST (3)
ADAPTATION MODEL (2)
FEATURE EXTRACTION (2)
GAUSSIAN MIXTURE MODELS (2)
GAUSSIAN PROCESSES (2)
MLLR (2)
SPEECH PROCESSING (2)
TRAINING (2)
TRANSFORMS (2)
BAYES METHODS (1)
BAYESIAN LOGISTIC REGRESSION MODEL (1)
BAYESIAN LOGISTIC REGRESSION MODELS (1)
BAYESIAN METHODS (1)
CEPSTRAL ANALYSIS (1)
CHANNEL EFFECT (1)
CHANNEL EFFECT REDUCTION (1)
CHANNEL VARIABILITY (1)
COMBINING MAP (1)
COMPUTATIONAL MODELING (1)
GMM UNIVERSAL BACKGROUND MODEL (1)
HARMONIC ANALYSIS (1)
HARMONIC STRUCTURE (1)
HARMONIC STRUCTURE FEATURES (1)
HARMONICS (1)
HIERARCHICAL SYSTEM DESIGN (1)
HIERARCHICAL SYSTEMS (1)
LANGUAGE IDENTIFICATION (1)
LID TASKS (1)
LOGISTICS (1)
MAPLR (1)
MATHEMATICAL MODEL (1)
MAXIMUM A POSTERIORI (1)
MAXIMUM LIKELIHOOD ESTIMATION (1)
MAXIMUM LIKELIHOOD LINEAR REGRESSION (1)
MLLR APPROACHES (1)
MODULATION (1)
MULTICLASS MLLR TECHNIQUE (1)
N-GRAM LANGUAGE MODELS (1)
NATURAL LANGUAGE PROCESSING (1)
NISR LRE 2007 DATABASES (1)
PERIODIC STRUCTURES (1)
PHONOTACTIC INFORMATION (1)
POSTERIOR PROBABILITIES (1)
REGRESSION ANALYSIS (1)
ROBUST SPEAKER RECOGNITION (1)
SCORE FUSION (1)
SCORE FUSION APPROACH (1)
SCORE GENERATORS (1)
SCORE MERGER (1)
SENSOR FUSION (1)
SINUSOIDAL RESYNTHESIS (1)
SPEECH RECOGNITION (1)
SPEECH SIGNALS (1)
SPEECH UTTERANCES (1)
SPOKEN LANGUAGE IDENTIFICATION (1)
SUPPORT VECTOR MACHINE (1)
SVM (1)
SVM BASED SPEAKER RECOGNITION (1)
SVM MODELING (1)
TOKEN-BASED APPROACH (1)
TRAINING DATA (1)
UBM (1)
UNIVERSAL BACKGROUND MODEL (1)
more

INFONA - science communication portal

Search results for: Xiang Xiao

Maximum a posteriori linear regression for speaker recognition

Combining MAP and MLLR Approaches for SVM Based Speaker Recognition with a Multi-class MLLR Technique

A Hierarchical System Design for Language Identification

Harmonic Structure Features for Robust Speaker Recognition against Channel Effect

Speaker Recognition using a Kind of Novel Phonotactic Information

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options