Search results for: Yonghong Yan

Items from 1 to 17 out of 17 results

chapter

Factor analysis of Laplacian approach for speaker recognition

Jinchao Yang, Chunyan Liang, Lin Yang, Hongbin Suo, more

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4221 - 4224

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

In this study, we introduce a new factor analysis of Laplacian approach to speaker recognition under the support vector machine (SVM) framework. The Laplacian-projected supervector from our proposed Laplacian approach, which finds an embedding that preserves local information by locality preserving projections (LPP), is believed to contain speaker dependent information. The proposed method was compared...

chapter

An automatic data-driven technique for selecting background dataset in GMM-SVM speaker verification system

Jinchao Yang, Haipeng Wang, Jianping Zhang, Yonghong Yan

2010 International Conference on Audio, Language and Image Processing > 85 - 89

2010 International Conference on Audio, Language and Image Processing (ICALIP)

In this paper, we propose an automatic data-driven technique for selecting proper background dataset. By the technique, impostor confidence(IC) is proposed as a metric and more discriminative background dataset is automatically chose by impostor confidence(IC) to train more discriminative model. Experiment results on NIST 2008 SRE corpus in GMM-SVM speaker verification system show that the proposed...

chapter

A SVM-Based Audio Event Detection System

Li Lu, Fengpei Ge, Qingwei Zhao, Yonghong Yan

2010 International Conference on Electrical and Control Engineering > 292 - 295

2010 International Conference on Electrical and Control Engineering (ICECE 2010)

This paper proposes a SVM-based method to deal with the problem of detecting audio events(cheering and applause) by audio analysis. In our framework, a sliding window is first used to pre-segment the audio stream into short segments by moving from start to the end. Second, various kinds of audio features are extracted to represent different audio sounds in each segment. Third, SVM(super vector machine)...

chapter

Support Vector Machine for Chinese Part-Of-Speech Tagging in Speech Synthesis Systems

Xiang Wang, Jianping Zhang, Yonghong Yan

2010 International Conference on Biomedical Engineering and Computer Science > 1 - 4

International Conference on Biomedical Engineering and Computer Science (ICBECS 2010)

The paper presents a support vector machine based Part-Of-Speech tagging on Chinese database which is part of our speech synthesis system. The model can be classified as SVM model and uses many sequential features to predict the POS tag. The text database was download from the internet with 1,280,000 words and 33 parts of Speech. The total accuracy of our experiments is 99.31%.

chapter

Maximum a posteriori linear regression for speaker recognition

Xiang Zhang, Haipeng Wang, Xiang Xiao, Jianping Zhang, more

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 4542 - 4545

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

Recently, using maximum likelihood linear regression (MLLR) transforms as the features for SVM based speaker recognition has been proposed. This can achieve performance comparable to that obtained with state-of-the-art approaches. In this paper, we focus on calculating the transforms based on a GMM universal background model (UBM). Rather than estimating the transforms using maximum likelihood criterion,...

chapter

Speech Emotion Recognition Using Both Spectral and Prosodic Features

Yu Zhou, Yanqing Sun, Jianping Zhang, Yonghong Yan

2009 International Conference on Information Engineering and Computer Science > 1 - 4

2009 International Conference on Information Engineering and Computer Science. ICIECS 2009

In this paper, we propose a speech emotion recognition system using both spectral and prosodic features. Most traditional systems have focused on spectral features or prosodic features. Since both the spectral and the prosodic features contain emotion information, it is believed that the combining of spectral features and prosodic features will improve the performance of the emotion recognition system...

chapter

Combining MAP and MLLR Approaches for SVM Based Speaker Recognition with a Multi-class MLLR Technique

Haipeng Wang, Xiang Zhang, Xiang Xiao, Jianping Zhang, more

2009 Second International Symposium on Information Science and Engineering > 447 - 450

Second International Symposium on Information Science and Engineering (ISISE 2009)

Gaussian mixture models with an universal background model (UBM) have been the standard method for speaker recognition. Typically, maximum a posteriori (MAP) or maximum likelihood linear regression (MLLR) is used to adapt the means of the UBM. Together with the SVM modeling technique, these approaches can achieve excellent performance. MLLR is quite efficient when the amount of adaptation data is...

chapter

A Hierarchical System Design for Language Identification

Haipeng Wang, Xiang Xiao, Xiang Zhang, Jianping Zhang, more

2009 Second International Symposium on Information Science and Engineering > 443 - 446

Second International Symposium on Information Science and Engineering (ISISE 2009)

Token-based approaches have proven quite effective for spoken language identification (LID). Traditionally, Speech utterances are first decoded into token sequences, and then LID tasks are performed on these token sequences by either n-gram language models or support vector machines. In this paper, we propose a hierarchical system design, which utilizes a group of bayesian logistic regression models...

chapter

Harmonic Structure Features for Robust Speaker Recognition against Channel Effect

Chuan Cao, Xiang Xiao, Ming Li, Jian Liu, more

2009 Second International Symposium on Information Science and Engineering > 451 - 454

Second International Symposium on Information Science and Engineering (ISISE 2009)

This paper proposes a novel feature set for robust speaker recognition, which is based on the harmonic structure of speech signals. Channel modulation effects are supposed to be weakened in the harmonic structure features, and furthermore the influence introduced by channel variability could be diminished to a certain degree. Though experiment results show that the raw performance of the harmonic...

chapter

An Mandarin Pronunciation Quality Assessment System Using Two Kinds of Acoustic Models

Fengpei Ge, Li Lu, Changliang Liu, Fuping Pan, more

2009 International Conference on Research Challenges in Computer Science > 68 - 72

2009 International Conference on Research Challenges in Computer Science (ICRCCS 2009)

This paper presents our Mandarin pronunciation quality assessment system for the examination of Putonghua Shuiping Kaoshi (PSK) and investigates some measures to improve the assessment accuracy. In this paper, a selective speaker adaptation method is studied. In the adaptation module, we select well pronounced speech as the adaptation data, and adopt Maximum Likelihood Linear Regression (MLLR) to...

chapter

Automatic Detection of Pathological Voices Using GMM-SVM Method

Xiang Wang, Jianping Zhang, Yonghong Yan

2009 2nd International Conference on Biomedical Engineering and Informatics > 1 - 4

2009 2nd International Conference on Biomedical Engineering and Informatics (BMEI)

Modern lifestyle has increased the risk of pathological voices problems. So the therapy of pathological people attracts more attention of people. Meanwhile, acoustic features have been used widely in the therapy of voice disordered people. Classification of Normal and Pathological people is also an auxiliary therapy operation. MFCC has been proved to be a useful feature with traditional classifier...

chapter

Automatic Detection of Pathological Voices Using GMM-MLLR Approach

Xiang Wang, Jianping Zhang, Yonghong Yan

2009 2nd International Conference on Biomedical Engineering and Informatics > 1 - 4

2009 2nd International Conference on Biomedical Engineering and Informatics (BMEI)

Modern lifestyles have increased the risk of suffering some kind of voice disorders. It is estimated that nearly 19% of the population have suffered from dysphonic voicing. It is very important to detect pathological voices automatically. Many classification methods have been used to detect the pathological voices automatically and got good results. In this paper, we focus on the automatic detection...

chapter

SVM Based Speaker Recognition Using Maximum a posteriori Linear Regression

Xiang Zhang, Qingwei Zhao, Yonghong Yan

2009 International Conference on Electronic Computer Technology > 438 - 442

2009 International Conference on Electronic Computer Technology. ICECT 2009

Maximum likelihood linear regression (MLLR) is a widely used technique for speaker adaptation in large vocabulary speech recognition system. Recently, using MLLR transforms as features for SVM based speaker recognition tasks has been proposed, achieving performance comparable to that obtained with cepstral features. In this paper, we focus on calculating the transforms based on a GMM universal background...

chapter

Using Eigenvoice Coefficients as Features in Speaker Recognition

Haipeng Wang, Qingwei Zhao, Yonghong Yan

2009 International Conference on Electronic Computer Technology > 262 - 266

2009 International Conference on Electronic Computer Technology. ICECT 2009

Eigenvoice speaker adaptation has been shown to be effective in recent years. In this paper, we propose to use eigenvoice coefficients as features for speaker recognition. We use a simplified version of probabilistic subspace adaptation (PSA) to estimate eigenvoice coefficients, and the coefficients are concatenated to construct supervectors of support vector machines. This approach significantly...

chapter

Speaker Recognition using a Kind of Novel Phonotactic Information

Xiang Zhang, Xiang Xiao, Haipeng Wang, Hongbin Suo, more

2008 6th International Symposium on Chinese Spoken Language Processing > 1 - 4

2008 6th International Symposium on Chinese Spoken Language Processing

In this paper, we present a new modeling approach for speaker recognition, which uses a kind of novel phonotactic information as the feature for S VM modeling. Gaussian mixture models (GMMs) have been proven extremely successful for text- independent speaker recognition. The GMM universal background model (UBM) is a speaker-independent model, each component of which can be considered to be modeling...

chapter

A study on singing performance evaluation criteria for untrained singers

Chuan Cao, Ming Li, Jian Liu, Yonghong Yan

2008 9th International Conference on Signal Processing > 1475 - 1478

2008 9th International Conference on Signal Processing (ICSP 2008)

This paper describes a study of subjective criteria for untrained singerspsila singing voice quality evaluation, focusing on the perceptual aspects that have relatively strong acoustic implications. And the correlation among the individual perceptual criteria is also investigated. A SVM regression method is applied to find the importance of every evaluation criterion. Experiments on a 200 singing...

chapter

The Design of Backend Classifiers in PPRLM System for Language Identification

Hongbin Suo, Ming Li, Tantan Liu, Ping Lu, more

Third International Conference on Natural Computation (ICNC 2007) > 1 > 678 - 682

2007 3rd International Conference on Natural Computation

The design approach for classifying the backend features of the PPRLM (Parallel Phone Recognition and Language Modeling) system is demonstrated in this paper. A variety of features and their combinations extracted by language dependent recognizers were evaluated based on the National Institute of Standards and Technology (NIST) Language Recognition Evaluation (LRE) 2003 corpus. Three well-known classifiers:...

Filter options

Keywords:
SUPPORT VECTOR MACHINES

Publication date

Set your own date range

Keywords

SPEECH (14)
SPEAKER RECOGNITION (8)
GAUSSIAN PROCESSES (7)
ADAPTATION MODEL (6)
SUPPORT VECTOR MACHINE (6)
TRAINING (6)
FEATURE EXTRACTION (5)
GAUSSIAN MIXTURE MODEL (5)
REGRESSION ANALYSIS (5)
SPEECH PROCESSING (5)
MAXIMUM LIKELIHOOD ESTIMATION (4)
NIST (4)
SVM (4)
AUDIO SIGNAL PROCESSING (3)
MATHEMATICAL MODEL (3)
MAXIMUM LIKELIHOOD LINEAR REGRESSION (3)
NATURAL LANGUAGE PROCESSING (3)
SPEECH RECOGNITION (3)
TRANSFORMS (3)
ACCURACY (2)
ACOUSTICS (2)
CEPSTRAL ANALYSIS (2)
COMPUTATIONAL MODELING (2)
DATA MINING (2)
GAUSSIAN MIXTURE MODELS (2)
HIDDEN MARKOV MODELS (2)
KERNEL (2)
LANGUAGE IDENTIFICATION (2)
MLLR (2)
PATHOLOGY (2)
POSTERIOR PROBABILITIES (2)
SYSTEM PERFORMANCE (2)
UNIVERSAL BACKGROUND MODEL (2)
ACOUSTIC FEATURES (1)
ACOUSTIC MODEL (1)
ACOUSTIC MODELS (1)
ACOUSTIC SIGNAL DETECTION (1)
AUDIO ANALYSIS (1)
AUDIO DETECTION TASK (1)
AUDIO FEATURE EXTRACTION (1)
AUDIO SOUNDS (1)
AUDIO STREAM (1)
AUDIO STREAMING (1)
AUTOMATIC DATA DRIVEN TECHNIQUE (1)
AUTOMATIC DETECTION (1)
AUTOMATIC PATHOLOGICAL VOICE DETECTION (1)
AUXILIARY THERAPY OPERATION (1)
AVERAGE CORRELATION COEFFICIENT (1)
BACKEND CLASSIFIERS (1)
BACKEND FEATURES (1)
BACKGROUND NOISE (1)
BAYES METHODS (1)
BAYESIAN LOGISTIC REGRESSION MODEL (1)
BAYESIAN LOGISTIC REGRESSION MODELS (1)
BAYESIAN METHODS (1)
BRIGHTNESS (1)
CHANNEL EFFECT (1)
CHANNEL EFFECT REDUCTION (1)
CHANNEL VARIABILITY (1)
CHINESE DATABASE (1)
CHINESE PART-OF-SPEECH TAGGING (1)
CLASSIFICATION ALGORITHMS (1)
COMBINING MAP (1)
CORRELATION (1)
DATA MODELS (1)
DATABASES (1)
DISCRIMINATIVE BACKGROUND DATASET (1)
DYNAMIC RANGE (1)
DYSPHONIC VOICING (1)
EIGENVALUES AND EIGENFUNCTIONS (1)
EIGENVOICE COEFFICIENTS (1)
EMOTION RECOGNITION (1)
EQUATIONS (1)
EVENT DETECTION (1)
FACE (1)
FACTOR ANALYSIS (1)
FALSE ALARMS (1)
FEATURE NORMALIZATION METHOD (1)
FEATURE VECTOR (1)
FEEDFORWARD NEURAL NETS (1)
FEEDFORWARD NEURAL NETWORK (1)
GAUSSIAN DISTRIBUTION (1)
GMM (1)
GMM SUPER VECTOR (1)
GMM UNIVERSAL BACKGROUND MODEL (1)
GMM-BASED AUDIO EVENT DETECTION SYSTEM (1)
GMM-MLLR APPROACH (1)
GMM-SVM METHOD (1)
GMM-SVM SPEAKER VERIFICATION (1)
HARMONIC ANALYSIS (1)
HARMONIC STRUCTURE (1)
HARMONIC STRUCTURE FEATURES (1)
HARMONICS (1)
HIERARCHICAL SYSTEM DESIGN (1)
HIERARCHICAL SYSTEMS (1)
HUMANS (1)
IMPOSTOR CONFIDENCE (1)
INTEGRATED CIRCUIT MODELING (1)
LABELING (1)
more

INFONA - science communication portal

Search results for: Yonghong Yan

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options