Search results for: Jianping Zhang

Items from 1 to 6 out of 6 results

chapter

Perceptual MVDR-based cepstral coefficients (PMCCs) for speaker recognition

Chunyan Liang, Xiang Zhang, Lin Yang, Jianping Zhang, more

IEEE 10th INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS > 1386 - 1389

2010 10th International Conference on Signal Processing (ICSP 2010)

Acoustic feature extraction from speech is a fundamental part in both automatic speech recognition and automatic speaker recognition. Mel-frequency cepstral coefficients (MFCCs) are widely used in both of the above two research directions. A new feature extraction technique named perceptual MVDR-based cepstral coefficients (PMCCs) has been demonstrated to perform superior in automatic speech recognition...

chapter

Speech Emotion Recognition Using Both Spectral and Prosodic Features

Yu Zhou, Yanqing Sun, Jianping Zhang, Yonghong Yan

2009 International Conference on Information Engineering and Computer Science > 1 - 4

2009 International Conference on Information Engineering and Computer Science. ICIECS 2009

In this paper, we propose a speech emotion recognition system using both spectral and prosodic features. Most traditional systems have focused on spectral features or prosodic features. Since both the spectral and the prosodic features contain emotion information, it is believed that the combining of spectral features and prosodic features will improve the performance of the emotion recognition system...

chapter

Combining MAP and MLLR Approaches for SVM Based Speaker Recognition with a Multi-class MLLR Technique

Haipeng Wang, Xiang Zhang, Xiang Xiao, Jianping Zhang, more

2009 Second International Symposium on Information Science and Engineering > 447 - 450

Second International Symposium on Information Science and Engineering (ISISE 2009)

Gaussian mixture models with an universal background model (UBM) have been the standard method for speaker recognition. Typically, maximum a posteriori (MAP) or maximum likelihood linear regression (MLLR) is used to adapt the means of the UBM. Together with the SVM modeling technique, these approaches can achieve excellent performance. MLLR is quite efficient when the amount of adaptation data is...

chapter

Automatic Detection of Pathological Voices Using GMM-MLLR Approach

Xiang Wang, Jianping Zhang, Yonghong Yan

2009 2nd International Conference on Biomedical Engineering and Informatics > 1 - 4

2009 2nd International Conference on Biomedical Engineering and Informatics (BMEI)

Modern lifestyles have increased the risk of suffering some kind of voice disorders. It is estimated that nearly 19% of the population have suffered from dysphonic voicing. It is very important to detect pathological voices automatically. Many classification methods have been used to detect the pathological voices automatically and got good results. In this paper, we focus on the automatic detection...

chapter

High Quality Voice Conversion through Phoneme-Based Linear Mapping Functions with STRAIGHT for Mandarin

Kun Liu, Jianping Zhang, Yonghong Yan

Fourth International Conference on Fuzzy Systems and Knowledge Discovery (FSKD 2007) > 4 > 410 - 414

2007 International Conference on Fuzzy Systems and Knowledge Discovery

A novel voice conversion system using phoneme-based linear mapping functions on main vowel phonemes is proposed in this paper. Our voice conversion algorithm has the following three improvements. First, instead of using all the vocal tract resonance (VTR) vectors in the portion of a phoneme, we use the VTR vector at the steady-state of each phoneme to train phoneme-based GMM. Second, different linear...

chapter

High Quality Voice Conversion through Combining Modified GMM and Formant Mapping for Mandarin

Kun Liu, Jianping Zhang, Yonghong Yan

2007 Second International Conference on Digital Telecommunications (ICDT'7) > 10

Second International Conference on Digital Telecommunications, ICDT 2007

A novel voice conversion system using formant mapping based on modified GMM technique is proposed in this paper. Compared with the traditional GMM technique, our modified GMM technique selects the stable frames automatically in each vowel phoneme for parameter extraction to avoid using the parameters in the transition part. With the spectral parameters extracted from the stable frames, phoneme-based...

Filter options

Keywords:
GAUSSIAN PROCESSES

Publication date

Set your own date range

Keywords

GAUSSIAN MIXTURE MODEL (5)
SPEECH (3)
SUPPORT VECTOR MACHINES (3)
ACOUSTIC SIGNAL PROCESSING (2)
ADAPTATION MODEL (2)
FEATURE EXTRACTION (2)
GMM (2)
MANDARIN (2)
MAXIMUM LIKELIHOOD ESTIMATION (2)
MAXIMUM LIKELIHOOD LINEAR REGRESSION (2)
NIST (2)
SPEAKER RECOGNITION (2)
ACOUSTIC FEATURE EXTRACTION (1)
ACOUSTIC SIGNAL DETECTION (1)
ADAPTIVE INTERPOLATION (1)
AUDIO SIGNAL PROCESSING (1)
AUTOMATIC DETECTION (1)
AUTOMATIC SPEAKER RECOGNITION (1)
AUTOMATIC SPEECH RECOGNITION (1)
CEPSTRAL ANALYSIS (1)
CHANNEL BANK FILTERS (1)
CHANNEL VARIABILITY EFFECT (1)
COMBINING MAP (1)
COMPUTATIONAL MODELING (1)
DATA MINING (1)
DATABASES (1)
DYSPHONIC VOICING (1)
EMOTION RECOGNITION (1)
EQUATIONS (1)
FFT SPECTRUM (1)
FORMANT FREQUENCY (1)
FORMANT MAPPING (1)
GAUSSIAN MIXTURE MODELS (1)
GMM SUPER VECTOR (1)
GMM-MLLR APPROACH (1)
HIDDEN MARKOV MODELS (1)
HIGH QUALITY VOICE CONVERSION SYSTEM (1)
INTERVIEWS (1)
JFA (1)
JOINT FACTOR ANALYSIS (1)
KERNEL (1)
LOADING (1)
MATHEMATICAL MODEL (1)
MAXIMUM A POSTERIORI (1)
MEDICAL SIGNAL DETECTION (1)
MEL FREQUENCY CEPSTRAL COEFFICIENT (1)
MEL-FREQUENCY CEPSTRAL COEFFICIENT (1)
MEL-SCALED FILTERBANK (1)
MFCC (1)
MINIMUM VARIANCE DISTORTIONLESS RESPONSE (1)
MLLR (1)
MLLR APPROACHES (1)
MULTICLASS MLLR TECHNIQUE (1)
MVDR (1)
PATHOLOGICAL VOICES (1)
PATHOLOGY (1)
PERCEPTUAL MVDR-BASED CEPSTRAL COEFFICIENT (1)
PHONEME-BASED GMM MODEL (1)
PHONEME-BASED LINEAR MAPPING FUNCTION (1)
PMCC (1)
PROSODIC FEATURES (1)
REGRESSION ANALYSIS (1)
SPECTRAL ANALYSIS (1)
SPECTRAL ENVELOPE (1)
SPECTRAL FEATURES (1)
SPECTRAL PARAMETER EXTRACTION (1)
SPEECH EMOTION RECOGNITION (1)
SPEECH MANIPULATION (1)
SPEECH MANIPULATION FRAMEWORK (1)
SPEECH PROCESSING (1)
SPEECH RE-SYNTHESIS (1)
SPEECH RECOGNITION (1)
SPEECH REPRESENTATION (1)
SPEECH SYNTHESIS (1)
SPEECH TRANSFORMATION (1)
SUPPORT VECTOR MACHINE (1)
SVM (1)
SVM BASED SPEAKER RECOGNITION (1)
TEST DATABASE (1)
TRANSFORMS (1)
UBM (1)
UNIVERSAL BACKGROUND MODEL (1)
VOCAL TRACT RESONANCE VECTOR (1)
VOICE CONVERSION (1)
VOICE CONVERSION ALGORITHM (1)
VOICE CONVERSION SYSTEM (1)
VOICE DISORDERS (1)
VOWEL PHONEME (1)
VOWEL PHONEMES (1)
WEIGHTED SPECTROGRAM (1)
more

INFONA - science communication portal

Search results for: Jianping Zhang

Perceptual MVDR-based cepstral coefficients (PMCCs) for speaker recognition

Speech Emotion Recognition Using Both Spectral and Prosodic Features

Combining MAP and MLLR Approaches for SVM Based Speaker Recognition with a Multi-class MLLR Technique

Automatic Detection of Pathological Voices Using GMM-MLLR Approach

High Quality Voice Conversion through Phoneme-Based Linear Mapping Functions with STRAIGHT for Mandarin

High Quality Voice Conversion through Combining Modified GMM and Formant Mapping for Mandarin

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options