Search results for: Jie Zhu

Items from 1 to 6 out of 6 results

chapter

Blind Speech Dereverberation Based on a Statistical Model

Xulei Bao, Jie Zhu, Zhen Huang

2012 IEEE International Conference on Multimedia and Expo > 467 - 472

2012 IEEE International Conference on Multimedia and Expo (ICME)

We present an algorithm to dereverberate single-channel audio signals in both noisy and noise-free acoustical environments. Recently, the model-based dereverberation that use the statistical model for room impulse responses (RIRs) is considered to be a fairly attractive approach for reverberant speech, since existing model-based estimators show the late reverberant spectral variance (LRSV) is linear-related...

chapter

A Novel Signal-Processing Strategy Based on Bark Wavelet Transform for Cochlear Implants

Yan Shen, Zhi Tao, Ji-Hua Gu, Xiao-Jun Zhang, more

2011 First International Workshop on Complexity and Data Mining > 10 - 13

2011 First International Workshop on Complexity and Data Mining (IWCDM)

Cochlear implant is one of the most promising medical applications aiming to utmost restore deaf patients hearing. How to improve voice quality by using effectively algorithms has become a bottleneck. In our research, a new speech signal processing scheme is given based on Bark Wavelet Transform (BWT). This signal-processing strategy can non-uniformly separates the time-frequency space, which is similar...

chapter

An improved method for predicting fundamental frequency contour in mandarin text-to-speech system with a small corpus

Liang Wang, Jie Zhu, Yao Lv

TENCON 2010 - 2010 IEEE Region 10 Conference > 751 - 754

2010 IEEE Region 10 Conference (TENCON 2010)

In this paper, a method to predict fundamental frequency contour is proposed for mandarin text-to-speech system with a small corpus. Above all, in order to avoid large modification to the speech clips, two kinds of corpus, tonal syllable corpus and high-frequency word corpus, are established. Afterwards, we apply two rules to predict the pitch contour of speech. Firstly, traditional Fujisaki model...

chapter

An efficient multistage Rover method for Automatic Speech recognition

Haihua Xu, Jie Zhu, Guanyong Wu

2009 IEEE International Conference on Multimedia and Expo > 894 - 897

2009 IEEE International Conference on Multimedia and Expo (ICME)

In this paper, we implemented a multistage recognizer output voting error reduction (ROVER) method for better automatic speech recognition (ASR). The first stage ROVER is conducted by combining three recognizers, which are respectively trained with maximum likelihood estimation (MLE), minimum phone error (MPE) and recently proposed boosted maximum mutual information (BMMI) criteria. After that the...

chapter

Minimum phone error based stream weight training for mandarin audio-visual Speech recognition

Guanyong Wu, Jie Zhu, Haihua Xu

2009 IEEE International Conference on Multimedia and Expo > 902 - 905

2009 IEEE International Conference on Multimedia and Expo (ICME)

Stream weight training is one of the key issues in the bimodal integration for the audio-visual speech recognition. In this paper, the audio- and video-only HMM classifiers are combined to recognize audio-visual speech recognition. More specifically, a discriminative training method is provided, in which the state-dependent stream weights are trained based on lattice rescoring by the minimum phone...

chapter

Towards more efficient and accurate methods for Mandarin LVCSR discriminative training

Haihua Xu, Jie Zhu

2008 IEEE International Conference on Multimedia and Expo > 981 - 984

2008 IEEE International Conference on Multimedia and Expo (ICME)

Discriminative training of Mandarin large vocabulary continuous speech recognition (LVCSR) has been remarkably improved in speech community recent years. However, much work still needs further investigating. In this work, we focus on improvements to two aspects of discriminative training method, in particular related to minimum phone error (MPE) training method in Mandarin speech recognition. One...

Filter options

Keywords:
SPEECH

Publication date

Set your own date range

Keywords

ACCURACY (3)
MINIMUM PHONE ERROR (3)
SPEECH RECOGNITION (3)
TRAINING (3)
DISCRIMINATIVE TRAINING (2)
HIDDEN MARKOV MODELS (2)
MAXIMUM LIKELIHOOD ESTIMATION (2)
TIME FREQUENCY ANALYSIS (2)
ADAPTATION MODEL (1)
ASR (1)
AUDIO-VISUAL SPEECH RECOGNITION (AVSR) (1)
AUDIO-VISUAL SYSTEMS (1)
AUDITORY SYSTEM (1)
AUTOMATIC SPEECH RECOGNITION (1)
BARK WAVELET TRANSFORM (1)
BIMODAL INTEGRATION (1)
BOOSTED MAXIMUM MUTUAL INFORMATION CRITERIA (1)
CHARACTER RECOGNITION (1)
COCHLEAR IMPLANT (1)
DATABASES (1)
DECODING (1)
DECODING METHOD (1)
DEREVERBERATION (1)
DISCRIMINATIVE TRAINING METHOD (1)
EXTENDED BAUM WELCH ALGORITHM (1)
FILTER BANKS (1)
FREQUENCY MODULATION (1)
FREQUENCY SYNTHESIZERS (1)
FUJISAKI MODEL (1)
FUNDAMENTAL FREQUENCY CONTOUR (1)
FUNDAMENTAL FREQUENCY CONTOUR PREDICTION (1)
GMM (1)
HIDDEN MARKOV MODEL (1)
HIGH-FREQUENCY WORD CORPUS (1)
HMM CLASSIFIER (1)
JITTER (1)
LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION (1)
LATE REVERBERANT SPECTRAL VARIANCE ESTIMATOR (1)
LATTICES (1)
MANDARIN (1)
MANDARIN AUDIO-VISUAL SPEECH RECOGNITION (1)
MANDARIN LARGE VOCABULARY AUDIO-VISUAL DATABASE (1)
MANDARIN LVCSR DISCRIMINATIVE TRAINING (1)
MANDARIN SPEECH RECOGNITION (1)
MANDARIN TEXT-TO-SPEECH SYSTEM (1)
MATHEMATICAL MODEL (1)
MAXIMUM A POSTERIORI METHOD (1)
MAXIMUM LIKELIHOOD (1)
MAXIMUM LIKELIHOOD LINEAR REGRESSION METHOD (1)
MEL FREQUENCY CEPSTRAL COEFFICIENT (1)
MINIMUM BAYES RISK (1)
MINIMUM CLASSIFICATION ERROR (1)
MINIMUM FRAME ERROR (1)
MINIMUM PHONE ERROR (MPE) (1)
MINIMUM PHONE ERROR TRAINING (1)
MODIFIED FUJISAKI MODEL (1)
MODULATION (1)
MULTISTAGE RECOGNIZER OUTPUT VOTING ERROR REDUCTION (1)
MULTISTAGE ROVER METHOD (1)
NATURAL LANGUAGE PROCESSING (1)
NOISE (1)
NOISE MEASUREMENT (1)
PHONE LATTICE (1)
PITCH CONTOUR (1)
PITCH FREQUENCY (1)
PITCH JITTER (1)
PREDICTION ALGORITHMS (1)
PREDICTIVE MODELS (1)
PROBABILISTIC SPEECH MODEL (1)
PSOLA ALGORITHM (1)
REGRESSION ANALYSIS (1)
REVERBERATION (1)
ROVER (1)
SMALL CORPUS (1)
SPEECH CLIPS (1)
SPEECH RECOGNITION UNIT (1)
SPEECH SYNTHESIS (1)
SRU (1)
STATE-DEPENDENT STREAM WEIGHT TRAINING METHOD (1)
STREAM WEIGHT TRAINING METHOD (1)
TEXT-TO-SPEECH (1)
TIME-INVARIANT ACOUSTICAL ENVIRONMENTS (1)
TONAL SYLLABLE CORPUS (1)
UNSUPERVISED MODEL ADAPTATION (1)
VISUALIZATION (1)
WAVELET TRANSFORMS (1)
WORD ERROR RATE REDUCTION (1)
more

INFONA - science communication portal

Search results for: Jie Zhu

Blind Speech Dereverberation Based on a Statistical Model

A Novel Signal-Processing Strategy Based on Bark Wavelet Transform for Cochlear Implants

An improved method for predicting fundamental frequency contour in mandarin text-to-speech system with a small corpus

An efficient multistage Rover method for Automatic Speech recognition

Minimum phone error based stream weight training for mandarin audio-visual Speech recognition

Towards more efficient and accurate methods for Mandarin LVCSR discriminative training

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options