Search results for: Yu Zhang

Items from 1 to 6 out of 6 results

chapter

Very deep convolutional networks for end-to-end speech recognition

Yu Zhang, William Chan, Navdeep Jaitly

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4845 - 4849

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Sequence-to-sequence models have shown success in end-to-end speech recognition. However these models have only used shallow acoustic encoder networks. In our work, we successively train very deep convolutional networks to add more expressive power and better generalization for end-to-end ASR models. We apply network-in-network principles, batch normalization, residual connections and convolutional...

chapter

Speech recognition with prediction-adaptation-correction recurrent neural networks

Yu Zhang, Dong Yu, Michael L. Seltzer, Jasha Droppo

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5004 - 5008

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We propose the prediction-adaptation-correction RNN (PAC-RNN), in which a correction DNN estimates the state posterior probability based on both the current frame and the prediction made on the past frames by a prediction DNN. The result from the main DNN is fed back to the prediction DNN to make better predictions for the future frames. In the PAC-RNN, we can consider that, given the new, current...

chapter

A study of an irrelevant variability normalization based discriminative training approach for LVCSR

Yu Zhang, Jian Xu, Zhi-Jie Yan, Qiang Huo

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5308 - 5311

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper presents a discriminative training (DT) approach to irrelevant variability normalization (IVN) based training of feature transforms and hidden Markov models for large vocabulary continuous speech recognition. A speaker-clustering based method is used for acoustic sniffing and maximum mutual information (MMI) is used as a training criterion. Combined with unsupervised adaptation of feature...

chapter

Statistical Machine Translation based on LDA

Zhengxian Gong, Yu Zhang, Guodong Zhou

2010 4th International Universal Communication Symposium > 286 - 290

2010 4th International Universal Communication Symposium (IUCS 2010)

Current Statistical Machine Translation (SMT) systems translate one sentence at a time, ignoring any document level information. Consequently, translation models are learned only at sentence level and document contexts are generally overlooked. In this paper, we try to introduce document topic to help SMT system to produce target sentences. First, the parallel training corpus with underlying document...

chapter

Cross-validation based decision tree clustering for HMM-based TTS

Yu Zhang, Zhi-Jie Yan, F K Soong

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 4602 - 4605

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

In HMM-based speech synthesis, we usually use complex, context dependent models to characterize prosodically and linguistically rich speech units. It is therefore difficult to prepare training data which can cover all combinatorial possibilities of contexts. A common approach to cope with this insufficient training data problem is to build a clustered tree via the MDL criterion. However, an MDL-based...

chapter

An evidence framework for Bayesian learning of continuous-density hidden Markov models

Yu Zhang, Peng Liu, Jen-Tzung Chien, F. Soong

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 3857 - 3860

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

We present an evidence Bayesian framework, which can learn both the prior distributions and posterior distributions from data, for continuous-density hidden Markov models (CDHMM). The goal of this study is to build the regularized CDHMMs to improve model generalization, and achieve desirable recognition performance for unknown test speech. Under this framework, we develop an EM iterative procedure...

Filter options

Keywords:
TRAINING
HIDDEN MARKOV MODELS

Publication date

Set your own date range

Keywords

SPEECH (3)
SPEECH RECOGNITION (3)
ACOUSTICS (2)
DATA MODELS (2)
STATISTICAL ANALYSIS (2)
TRAINING DATA (2)
ACCURACY (1)
ACOUSTIC MODELING (1)
ADAPTATION (1)
ADAPTATION MODEL (1)
AUTOMATIC SPEECH RECOGNITION (1)
BACKGROUND PHRASE TABLE (1)
BAYES METHODS (1)
BAYESIAN LEARNING (1)
BAYESIAN METHODS (1)
BIOLOGICAL NEURAL NETWORKS (1)
BIOLOGICAL SYSTEM MODELING (1)
BLEU SCORE (1)
BRAIN MODELING (1)
CHINESE TO ENGLISH MACHINE TRANSLATION (1)
COMPUTATIONAL MODELING (1)
CONFERENCES (1)
CONTEXT (1)
CONTEXT CLUSTERING (1)
CONTEXTS (1)
CONTINUOUS DENSITY HIDDEN MARKOV MODELS (1)
CROSS VALIDATION (1)
CROSS-VALIDATION (1)
DECISION TREE CLUSTERING (1)
DECISION TREES (1)
DECODING (1)
DEEP NEURAL NETWORK (1)
DISCRIMINATIVE TRAINING (1)
DNN (1)
DOCUMENT (1)
DOCUMENT BOUNDARY (1)
DOCUMENT CONTEXTS (1)
DOCUMENT HANDLING (1)
EM ITERATIVE PROCEDURE (1)
EMPIRICAL BAYESIAN SOLUTION (1)
END-TO-END SPEECH RECOGNITION (1)
EVIDENCE BAYESIAN FRAMEWORK (1)
EVIDENCE FRAMEWORK (1)
FEATURE EXTRACTION (1)
GENERATION ERROR (1)
HEURISTIC HYPERPARAMETER (1)
HIDDEN MARKOV MODEL (1)
HMM-BASED SPEECH SYNTHESIS (1)
HMM-BASED TTS (1)
INFERENCE MECHANISMS (1)
IRRELEVANT VARIABILITY NORMALIZATION (1)
LANGUAGE TRANSLATION (1)
LDA (1)
LINGUISTICALLY RICH SPEECH UNITS (1)
LOGIC GATES (1)
LVCSR (1)
MDL (1)
MDL CRITERION (1)
MONOLINGUAL LDA MODEL (1)
NATURAL LANGUAGE PROCESSING (1)
NIST (1)
NOISE MEASUREMENT (1)
ONE SENTENCE TRANSLATION (1)
PAC-RNN (1)
PARALLEL TRAINING CORPUS (1)
PATTERN CLUSTERING (1)
POSTERIOR DATA DISTRIBUTION (1)
PREDICTION-ADAPTATION-CORRECTION RNN (1)
PROBABILITY DISTRIBUTION (1)
RECURRENT NEURAL NETWORK (1)
RECURRENT NEURAL NETWORKS (1)
RNN (1)
SMT (1)
SMT SYSTEM (1)
SPEECH SYNTHESIS (1)
STATISTICAL DISTRIBUTIONS (1)
STATISTICAL MACHINE TRANSLATION (1)
STATISTICS (1)
TEST SPEECH RECOGNITION (1)
TRAINING ALGORITHM (1)
TRANSFORMS (1)
UNSUPERVISED ADAPTATION (1)
VARIATIONAL BAYESIAN (1)
VARIATIONAL BAYESIAN INFERENCE (1)
VERY DEEP CONVOLUTIONAL NEURAL NETWORKS (1)
more

INFONA - science communication portal

Search results for: Yu Zhang

Very deep convolutional networks for end-to-end speech recognition

Speech recognition with prediction-adaptation-correction recurrent neural networks

A study of an irrelevant variability normalization based discriminative training approach for LVCSR

Statistical Machine Translation based on LDA

Cross-validation based decision tree clustering for HMM-based TTS

An evidence framework for Bayesian learning of continuous-density hidden Markov models

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options