Search results for: Haizhou Li

Items from 1 to 4 out of 4 results

chapter

A GMM supervector Kernel with the Bhattacharyya distance for SVM based speaker recognition

Chang Huai You, Kong Aik Lee, Haizhou Li

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 4221 - 4224

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

Gaussian mixture model (GMM) supervector is one of the effective techniques in text independent speaker recognition. In our previous work, we introduce the GMM-UBM mean interval (GUMI) concept based on the Bhattacharyya distance. Subsequently GUMI kernel was successfully used in conjunction with support vector machine (SVM) for speaker recognition. Besides the first order statistics, it is generally...

chapter

Cluster criterion functions in spectral subspace and their application in speaker clustering

Trung Hieu Nguyen, Haizhou Li, Eng Siong Chng

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 4085 - 4088

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

In this paper, we propose two cluster criterion functions which aim to maximize the separation between intra-cluster distances and inter-cluster distances. These criteria can automatically deduce the desired number of clusters based on their extremized values. We then propose an algorithm to apply our criterion functions in conjunction with spectral clustering. By exploiting the characteristic of...

article

An SVM Kernel With GMM-Supervector Based on the Bhattacharyya Distance for Speaker Recognition

Chang Huai You, Kong Aik Lee, Haizhou Li

IEEE Signal Processing Letters > 2009 > 16 > 1 > 49 - 52

Gaussian mixture model (GMM) and support vector machine (SVM) have become popular classifiers in text-independent speaker recognition. A GMM-supervector characterizes a speaker's voice with the parameters of GMM, which include mean vectors, covariance matrices, and mixture weights. GMM-supervector SVM benefits from both GMM and SVM frameworks to achieve the state-of-the-art performance. Conventional...

chapter

Predicting Spectral and Prosodic Parameters for Unit Selection in Speech Synthesis

Minghui Dong, Haizhou Li

2008 6th International Symposium on Chinese Spoken Language Processing > 1 - 4

2008 6th International Symposium on Chinese Spoken Language Processing

We usually build a prosody model to predict the prosodic parameters, which will be used as part of the criteria for unit selection. Spectral appropriateness of units is usually ensured by using identities of context units, which are linguistic symbols. With looking into the spectral properties of the actual signal, the spectral mismatches are often perceived in the synthetic speech. In this paper,...

Filter options

Keywords:
DISTANCE MEASUREMENT

Publication date

Set your own date range

Publication type

book (3)
article (1)

Keywords

NIST (3)
SPEAKER RECOGNITION (3)
BHATTACHARYYA DISTANCE (2)
DATA MINING (2)
GAUSSIAN MIXTURE MODEL (2)
GAUSSIAN PROCESSES (2)
KERNEL (2)
SPEECH (2)
SUPERVECTOR (2)
SUPPORT VECTOR MACHINE (2)
SUPPORT VECTOR MACHINES (2)
ACOUSTICS (1)
AGGLOMERATIVE HIERARCHICAL SPEAKER DIARIZATION SYSTEM (1)
ALGORITHM DESIGN AND ANALYSIS (1)
CLUSTER CRITERION FUNCTIONS (1)
CLUSTERING ALGORITHMS (1)
COVARIANCE ANALYSIS (1)
COVARIANCE MATRICES (1)
COVARIANCE STATISTICAL VECTOR (1)
CRITERION FUNCTION (1)
DENSITY ESTIMATION ROBUST ALGORITHM (1)
ERROR ANALYSIS (1)
GMM SUPERVECTOR KERNEL (1)
GMM-UBM MEAN INTERVAL (1)
GUMI KERNEL (1)
HIDDEN MARKOV MODELS (1)
INTER-CLUSTER DISTANCES (1)
INTRA-CLUSTER DISTANCES (1)
KULLBACK-LEIBLER KERNEL (1)
MEAN STATISTICAL VECTOR (1)
MEL FREQUENCY CEPSTRAL COEFFICIENT (1)
MFCC (1)
NATIONAL INSTITUTE OF STANDARDS AND TECHNOLOGY (NIST) EVALUATION (1)
NIST EVALUATION (1)
PATTERN CLUSTERING (1)
PREDICTIVE MODELS (1)
SPEAKER CLUSTERING (1)
SPEAKER DIARIZATION (1)
SPEAKER VERIFICATION (1)
SPECTRAL CLUSTERING (1)
SPECTRAL SUBSPACE (1)
SPEECH SYNTHESIS (1)
SVM CLASSIFIER (1)
SVM KERNEL (1)
TESTING (1)
TEXT INDEPENDENT SPEAKER RECOGNITION (1)
TEXT-INDEPENDENT SPEAKER RECOGNITION (1)
TRAINING (1)
UNIT SELECTION (1)
VECTORS (1)
more

INFONA - science communication portal

Search results for: Haizhou Li

A GMM supervector Kernel with the Bhattacharyya distance for SVM based speaker recognition

Cluster criterion functions in spectral subspace and their application in speaker clustering

An SVM Kernel With GMM-Supervector Based on the Bhattacharyya Distance for Speaker Recognition

Predicting Spectral and Prosodic Parameters for Unit Selection in Speech Synthesis

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options