Search results for: Bin Ma

Items from 1 to 8 out of 8 results

chapter

Factor analysis based spatial correlation modeling for speaker verification

Er-Yu Wang, Wu Guo, Li-Rong Dai, Kong-Aik Lee, more

2010 7th International Symposium on Chinese Spoken Language Processing > 166 - 170

7th International Symposium on Chinese Spoken Language Processing (ISCSLP 2010)

Gaussian mixture models (GMMs) are commonly used in text-independent speaker verification for modeling the spectral distribution of speech. Recent studies have shown the effectiveness of characterizing speaker information using the mean super-vector obtained by concatenating the mean vectors of the GMM. This paper proposes to use the spatial correlation captured by the covariance matrix of the mean...

chapter

Soft margin estimation of Gaussian mixture model parameters for spoken language recognition

Donglai Zhu, Bin Ma, Haizhou Li

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 4990 - 4993

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

This paper extends our previous work on large margin estimation (LME) of GMM parameters with extend Baum-Welch (EBW) for spoken language recognition. To overcome the problem in the LME that negative samples in the training set are not used in parameter estimation, we propose a soft margin estimation (SME) method in this paper. The soft margin is scaled by a loss function measuring the distance between...

chapter

Joint map adaptation of feature transformation and Gaussian Mixture Model for speaker recognition

Donglai Zhu, Bin Ma, Haizhou Li

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 4045 - 4048

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

This paper extends our previous work on feature transformation-based support vector machines for speaker recognition by proposing a joint MAP adaptation of feature transformation (FT) and Gaussian Mixture Models (GMM) parameters. In the new approach, the prior probability density functions (PDFs) of FT and GMM parameters are jointly estimated using the background data under the maximum likelihood...

chapter

An Efficient Feature Selection Method for Speaker Recognition

Hanwu Sun, Bin Ma, Haizhou Li

2008 6th International Symposium on Chinese Spoken Language Processing > 1 - 4

2008 6th International Symposium on Chinese Spoken Language Processing

In this paper, a new feature selection method for speaker recognition is proposed to keep the high quality speech frames for speaker modelling and to remove noisy and corrupted speech frames. In order to obtain robust voice activity detection in variety of acoustic conditions, the spectral subtraction algorithm is adopted to estimate the frame power. An energy based frame selection algorithm is then...

chapter

Self-Organized Clustering for Feature Mapping in Language Recognition

Chang Huai You, Kong Aik Lee, Bin Ma, Haizhou Li

2008 6th International Symposium on Chinese Spoken Language Processing > 1 - 4

2008 6th International Symposium on Chinese Spoken Language Processing

In this paper, we propose a self-organized clustering method for feature mapping to compensate the channel variation in spoken language recognition. The self-organized clustering is realized by transforming the utterances into the Gaussian mixture model (GMM) supervectors and categorizing the supervectors through k-mean algorithm. Based on the language-dependent cluster-of-utterance information of...

chapter

Discriminative learning for optimizing detection performance in spoken language recognition

Donglai Zhu, Haizhou Li, Bin Ma, Chin-Hui Lee

2008 IEEE International Conference on Acoustics, Speech and Signal Processing > 4161 - 4164

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

We propose novel approaches for optimizing the detection performance in spoken language recognition. Two objective functions are designed to directly relate model parameters to two performance metrics of interest, the detection cost function and the area under the detection-error-tradeoff curve, respectively. Both metrics are approximated with differentiable functions of model parameters by using...

chapter

A Generalized Feature Transformation Approach for Channel Robust Speaker Verification

Donglai Zhu, Bin Ma, Haizhou Li, Qiang Huo

2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '7 > 4 > IV-61 - IV-64

2007 IEEE International Conference on Acoustics, Speech, and Signal Processing

In this paper we propose a generalized feature transformation approach to compensating for channel variation in speaker verification (SV) applications. Channel-dependent (CD) piecewise linear transformations are used for feature compensation. CD transformation parameters are estimated together with a channel-independent (CI) root Gaussian mixture model (GMM) from training data with a variety of channel...

chapter

Chinese Dialect Identification Using Tone Features Based on Pitch Flux

Bin Ma, Donglai Zhu, Rong Tong

2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings > 1 > I

2006 IEEE International Conference on Acoustics, Speech, and Signal Processing

This paper presents a method to extract tone relevant features based on pitch flux from continuous speech signal. The autocorrelations of two adjacent frames are calculated and the covariance between them is estimated to extract multi-dimensional pitch flux features. These features, together with MFCCs, are modeled in a 2-stream GMM models, and are tested in a 3-dialect identification task for Chinese...

Filter options

Keywords:
GAUSSIAN PROCESSES

Publication date

Set your own date range

Keywords

SPEAKER RECOGNITION (4)
SPEECH (4)
SPEECH RECOGNITION (4)
FEATURE EXTRACTION (3)
GAUSSIAN MIXTURE MODEL (3)
NIST (3)
SPOKEN LANGUAGE RECOGNITION (3)
SUPPORT VECTOR MACHINES (3)
ESTIMATION (2)
MAXIMUM LIKELIHOOD ESTIMATION (2)
SUPPORT VECTOR MACHINE (2)
TRAINING (2)
ADAPTATION MODEL (1)
CHANNEL COMPENSATION (1)
CHANNEL ROBUST SPEAKER VERIFICATION (1)
CHANNEL-DEPENDENT PIECEWISE LINEAR TRANSFORMATIONS (1)
CHANNEL-INDEPENDENT GMM (1)
CHINESE DIALECT IDENTIFICATION (1)
CLASS MISCLASSIFICATION MEASURE (1)
CLASSIFICATION BOUNDARY (1)
CLUSTERING ALGORITHMS (1)
CONTINUOUS SPEECH SIGNAL (1)
CORRELATION (1)
COVARIANCE MATRICES (1)
COVARIANCE MATRIX (1)
DATABASES (1)
DETECTION COST FUNCTION (1)
DETECTION ERROR TRADEOFF (1)
DETECTION PERFORMANCE OPTIMIZATION (1)
DETECTION-ERROR-TRADEOFF CURVE (1)
DIFFERENTIABLE FUNCTIONS (1)
DISCRIMINATIVE LEARNING (1)
EBW ALGORITHM (1)
EQUAL ERROR RATE (1)
EXTENDED BAUM-WELCH (1)
FACTOR ANALYSIS (1)
FEATURE COMPENSATION (1)
FEATURE MAPPING (1)
FEATURE MAPPING PARAMETER (1)
FEATURE SELECTION (1)
FEATURE TRANSFORMATION (1)
FROBENIUS ANGLE (1)
GAUSSIAN MIXTURE MODEL PARAMETERS (1)
GAUSSIAN MIXTURE MODEL SUPERVECTOR (1)
GENERALIZED FEATURE TRANSFORMATION (1)
GENERALIZED FEATURE TRANSFORMATION APPROACH (1)
GENERALIZED PROBABILISTIC DESCENT ALGORITHM (1)
GMM MODELS (1)
GMM-UBM SPEAKER RECOGNITION SYSTEM (1)
HIDDEN MARKOV MODELS (1)
INNER PRODUCT CLASSIFIER (1)
JOINT MAP ADAPTATION (1)
JOINTS (1)
K-MEAN ALGORITHM (1)
KERNEL (1)
LANGUAGE RECOGNITION EVALUATION (1)
LANGUAGE-DEPENDENT CLUSTER-OF-UTTERANCE INFORMATION (1)
LARGE MARGIN ESTIMATION (1)
LEARNING (ARTIFICIAL INTELLIGENCE) (1)
LOADING (1)
LOG EUCLIDEAN INNER PRODUCT CLASSIFIER (1)
LOG-EUCLIDEAN DISTANCE (1)
LOSS FUNCTION (1)
MAXIMUM A POSTERIORI (1)
MAXIMUM LIKELIHOOD (1)
MAXIMUM LIKELIHOOD CRITERIA (1)
MAXIMUM LIKELIHOOD CRITERION (1)
MAXIMUM LIKELIHOOD TRAINING APPROACH (1)
MEAN SUPER VECTOR (1)
MEASUREMENT (1)
MICROPHONES (1)
MODEL PARAMETERS (1)
MULTIDIMENSIONAL PITCH FLUX FEATURES (1)
NATIONAL INSTITUTE OF STANDARDS AND TECHNOLOGY (1)
NATURAL LANGUAGE PROCESSING (1)
NATURAL LANGUAGES (1)
NIST LANGUAGE RECOGNITION EVALUATION CORPORA (1)
NIST LANGUAGE RECOGNITION EVALUATION TASK (1)
NOISE (1)
NOISE MEASUREMENT (1)
OPTIMISATION (1)
OPTIMIZATION (1)
PARAMETER ESTIMATION (1)
PATTERN CLASSIFICATION (1)
PATTERN CLUSTERING (1)
PENALTY FUNCTION (1)
PROBABILITY DENSITY FUNCTION (1)
PROBABILITY DENSITY FUNCTIONS (1)
ROBUST VOICE ACTIVITY DETECTION (1)
ROOT GAUSSIAN MIXTURE MODEL (1)
SELF-ORGANISING FEATURE MAPS (1)
SELF-ORGANIZED CLUSTERING METHOD (1)
SIGNAL TO NOISE RATIO (1)
SME CONSTRAINED OPTIMIZATION (1)
SMOOTHING FUNCTION (1)
SOFT MARGIN ESTIMATION (1)
SOFT MARGIN ESTIMATION METHOD (1)
SPATIAL CORRELATION MODELING (1)
SPEAKER MODELLING (1)
more

INFONA - science communication portal

Search results for: Bin Ma

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options