Search results for: Liang He

Items from 1 to 7 out of 7 results

chapter

Deep neural networks based speaker modeling at different levels of phonetic granularity

Yao Tian, Liang He, Meng Cai, Wei-Qiang Zhang, more

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5440 - 5444

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Recently, a hybrid deep neural network/i-vector framework has been proved effective for speaker verification, where the DNN trained to predict tied-triphone states (senones) is used to produce frame alignments for sufficient statistics extraction. In this work, in order to better understand the impact of different phonetic precision to speaker verification tasks, three levels of phonetic granularity...

chapter

A study of variational method for text-independent speaker recognition

Liang He, Yao Tian, Yi Liu, Fang Dong, more

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP) > 1 - 5

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP)

An i-vector has become the state-of-the-art algorithm for text-independent recognition. Most of related works take the extraction of the i-vector as a black-box by using some open software (e.g. Kaldi, Alize) and focus on the vector-based back-end algorithms, such as length normalization, WCCN, or PLDA. In this paper, we study the variational method and present a concise derivation for the i-vector...

chapter

PRISM: A statistical modeling framework for text-independent speaker verification

Liang He, Jia Liu

2015 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP) > 529 - 533

2015 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP)

This paper presents a statistical modeling framework termed as PRISM for text-independent speaker verification. We decompose the verification task into three subtasks: PRobability density estimation, Information metric and Subspace/Manifold learning (PRISM). Subsequently, we take advantages of variational maximum likelihood estimation, Fisher information metric and discriminant locality preserving...

chapter

Stacked bottleneck features for speaker verification

Yao Tian, Liang He, Jia Liu

2015 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP) > 514 - 518

2015 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP)

i-Vector modeling has shown to be effective for text independent speaker verification. It represents each utterance as a low-dimensional vector using factor analysis with a GMM supervector. In order to capture more complex speaker statistics, this paper proposes a new feature representation other than i-vectors for speaker verification using neural networks. In this work, stacked bottleneck features...

chapter

Discriminant local information distance preserving projection for text-independent speaker recognition

Liang He, Jia Li

2012 8th International Symposium on Chinese Spoken Language Processing > 349 - 352

2012 8th International Symposium on Chinese Spoken Language Processing (ISCSLP 2012)

A novel method is presented based on a statistical manifold for text-independent speaker recognition. After feature extraction, speaker recognition becomes a sequence classification problem. By discarding time information, the core task is the comparison of multiple sample sets. Each set is assumed to be governed by a probability density function (PDF). We estimate the PDFs and place the estimated...

chapter

Channel compensation technology in differential GSV-SVM speaker verification system

Liang He, Wei-Qiang Zhang, Yuxiang Shan, Jia Liu

APCCAS 2008 - 2008 IEEE Asia Pacific Conference on Circuits and Systems > 221 - 224

APCCAS 2008 - 2008 IEEE Asia Pacific Conference on Circuits and Systems (APCCAS)

Channel variability is the major cause of performance degradation in text-independent speaker verification. Compensation technology in feature, model or score domain has been widely applied to baseline systems to mitigate mismatch. Newly proposed Gaussian mixture models super vector-support vector machine (GMM-SVM or GSV-SVM) baseline system has proven successful through integrating advantages of...

chapter

Auditory features with vocal track length normalization for language identification

Weiqiang Zhang, Jia Liu, Liang He

2008 International Conference on Audio, Language and Image Processing > 66 - 70

2008 International Conference on Audio, Language and Image Processing

This paper reports on a novel feature, auditory cepstrum coefficient (ACC) with vocal tract length normalization (VTLN), for language identification (LID). The ACC feature is based on the auditory characteristics of human ear and the VTLN technology compensates the speaker variability. The detailed implementation of ACC feature with VTLN in frequency domain is given. Experimental results show that...

Filter options

Data set:
ieee
Keywords:
NIST

Publication date

Set your own date range

Keywords

FEATURE EXTRACTION (4)
SPEECH (3)
CEPSTRAL ANALYSIS (2)
ESTIMATION (2)
MANIFOLDS (2)
NEURAL NETWORKS (2)
SPEAKER RECOGNITION (2)
SPEAKER VERIFICATION (2)
SPEECH RECOGNITION (2)
TEXT-INDEPENDENT SPEAKER RECOGNITION (2)
TRAINING DATA (2)
VECTORS (2)
ACOUSTICS (1)
ADAPTATION MODEL (1)
ANALYTICAL MODELS (1)
AUDITORY CEPSTRUM COEFFICIENT FEATURE EXTRACTION (1)
BAND PASS FILTERS (1)
BAYES METHODS (1)
BOTTLENECK FEATURE (1)
CHANNEL COMPENSATION TECHNOLOGY (1)
COVARIANCE MATRICES (1)
DATABASES (1)
DEEP NEURAL NETWORK (1)
DEEP NEURAL NETWORKS (1)
DISCRIMINANT LOCAL PRESERVING PROJECTION (1)
EIGENVALUES AND EIGENFUNCTIONS (1)
ENTROPY (1)
FILTER BANK (1)
FISHER INFORMATION (1)
FISHER INFORMATION METRIC (1)
FREQUENCY DOMAIN ANALYSIS (1)
FREQUENCY-DOMAIN ANALYSIS (1)
GAUSSIAN CHANNELS (1)
GAUSSIAN MIXTURE MODEL-SUPER VECTOR-SUPPORT VECTOR MACHINE (1)
GAUSSIAN MIXTURE MODELS (1)
GMM SUPERVECTOR (1)
GSV-SVM SPEAKER VERIFICATION SYSTEM (1)
I-VECTOR (1)
INFORMATION GEOMETRY (1)
KERNEL (1)
LANGUAGE IDENTIFICATION (1)
LINEAR PROGRAMMING (1)
MANGANESE (1)
MANIFOLD LEARNING (1)
MAP (1)
MATHEMATICAL MODEL (1)
MAXIMUM A POSTERIORI MODEL (1)
MAXIMUM LIKELIHOOD ESTIMATION (1)
MEASUREMENT (1)
MEL FREQUENCY CEPSTRAL COEFFICIENT (1)
MEL-FREQUENCY CEPSTRUM COEFFICIENT (1)
NIST SRE 06 CORPUS (1)
PHONETIC GRANULARITY (1)
PROBABILITY DENSITY FUNCTION (1)
SPEAKER VARIABILITY COMPENSATION (1)
SUBSPACE (1)
SUPPORT VECTOR MACHINE CLASSIFICATION (1)
SUPPORT VECTOR MACHINES (1)
TELEPHONE SETS (1)
TEXT-INDEPENDENT SPEAKER VERIFICATION (1)
TOTAL VARIABILITY MODEL (1)
TRAINING (1)
UNIVERSAL BACKGROUND MODEL (1)
VARATIONAL ESTIMATION (1)
VARIATIONAL METHOD (1)
VOCAL TRACK LENGTH NORMALIZATION (1)
more

INFONA - science communication portal

Search results for: Liang He

Deep neural networks based speaker modeling at different levels of phonetic granularity

A study of variational method for text-independent speaker recognition

PRISM: A statistical modeling framework for text-independent speaker verification

Stacked bottleneck features for speaker verification

Discriminant local information distance preserving projection for text-independent speaker recognition

Channel compensation technology in differential GSV-SVM speaker verification system

Auditory features with vocal track length normalization for language identification

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options