Wyniki wyszukiwania dla: Hui Jiang

Pozycje od 1 do 15 spośród 15 wyników

rozdział

Unsupervised speaker adaptation of deep neural network based on the combination of speaker codes and singular value decomposition for speech recognition

Shaofei Xue, Hui Jiang, Lirong Dai, Qingfeng Liu

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4555 - 4559

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Recently, we have proposed a general adaptation scheme for deep neural network based on discriminant condition codes and applied it to supervised speaker adaptation in speech recognition based on either frame-level cross-entropy or sequence-level maximum mutual information training criterion [1, 2, 3, 4]. In this case, each condition code is associated with one speaker in data, which is thus called...

rozdział

Investigation of deep neural networks (DNN) for large vocabulary continuous speech recognition: Why DNN surpasses GMMS in acoustic modeling

Jia Pan, Cong Liu, Zhiguo Wang, Yu Hu, więcej

2012 8th International Symposium on Chinese Spoken Language Processing > 301 - 305

2012 8th International Symposium on Chinese Spoken Language Processing (ISCSLP 2012)

Recently, it has been reported that context-dependent deep neural network (DNN) has achieved some unprecedented gains in many challenging ASR tasks, including the well-known Switchboard task. In this paper, we first investigate DNN for several large vocabulary speech recognition tasks. Our results have confirmed that DNN can consistently achieve about 25–30% relative error reduction over the best...

rozdział

Applying Convolutional Neural Networks concepts to hybrid NN-HMM model for speech recognition

Ossama Abdel-Hamid, Abdel-rahman Mohamed, Hui Jiang, Gerald Penn

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4277 - 4280

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

Convolutional Neural Networks (CNN) have showed success in achieving translation invariance for many image processing tasks. The success is largely attributed to the use of local filtering and max-pooling in the CNN architecture. In this paper, we propose to apply CNN to speech recognition within the framework of hybrid NN-HMM model. We propose to use local filtering and max-pooling in frequency domain...

rozdział

A bounded trust region optimization for discriminative training of HMMS in speech recognition

Cong Liu, Yu Hu, Hui Jiang, Li-Rong Dai

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 4914 - 4917

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

In this paper, we have proposed a new method to construct an auxiliary function for the discriminative training of HMMs in speech recognition. The new auxiliary function serves as a first-order approximation of the original objective function but more importantly it remains as a lower bound of the original objective function as well. Furthermore, the trust region (TR) method in [1] is applied to find...

rozdział

Large margin estimation of n-gram language models for speech recognition via linear programming

V Magdin, Hui Jiang

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 5398 - 5401

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

We present a novel discriminative training algorithm for n-gram language models for use in large vocabulary continuous speech recognition. The algorithm uses large margin estimation (LME) to build an objective function for maximizing the minimum margin between correct transcriptions and their competing hypotheses, which are encoded as word graphs generated from the Viterbi decoding process. The nonlinear...

rozdział

Discriminative training of n-gram language models for speech recognition via linear programming

V. Magdin, Hui Jiang

2009 IEEE Workshop on Automatic Speech Recognition&Understanding > 305 - 310

2009 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU 2009)

This paper presents a novel discriminative training algorithm for n-gram language models for use in large vocabulary continuous speech recognition. The algorithm uses Maximum Mutual Information Estimation (MMIE) to build an objective function that involves a metric computed between correct transcriptions and their competing hypotheses, which are encoded as word graphs generated from the Viterbi decoding...

rozdział

Second order cone programming (SOCP) relaxations for large margin HMMs in speech recognition

Yan Yin, Hui Jiang

2009 IEEE International Symposium on Circuits and Systems > 105 - 108

2009 IEEE International Symposium on Circuits and Systems - ISCAS 2009

In this paper, we present a new fast optimization method to solve large margin estimation (LME) of continuous density hidden Markov models (CDHMMs) for speech recognition based on second order cone programming (SOCP). SOCP is a class of nonlinear convex optimization problems which can be solved very efficiently. In this work, we have formulated the LME of CDHMMs as an SOCP problem and proposed two...

rozdział

A trust region based optimization for maximum mutual information estimation of HMMS in speech recognition

Zhi-Jie Yan, Cong Liu, Yu Hu, Hui Jiang

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 3757 - 3760

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

In this paper, we present a new optimization method for MMIE-based discriminative training of HMMs in speech recognition. In our method, the MMIE training of Gaussian mixture HMMs is formulated as a so-called trust region problem, where a quadratic objective function is minimized under a spherical constraint, so that an efficient global optimization method for the trust region problem can be used...

rozdział

A constrained line search approach to general discriminative HMM training

Peng Liu, Cong Liu, Hui Jiang, F.K. Soong, więcej

2007 IEEE Workshop on Automatic Speech Recognition&Understanding (ASRU) > 290 - 295

2007 IEEE Workshop on Automatic Speech Recognition and Understanding

Recently, we proposed a novel optimization algorithm called constrained line search (CLS) to train Gaussian mean vectors of HMMs in the MMI sense. In this paper, we extend and re-formulate it in a more general framework. The new CLS can optimize any discriminative objective functions including MMI, MCE, MPE/MWE etc. Also, closed-form solutions to update all Gaussian mixture parameters, including means,...

rozdział

A Constrained Line Search Optimization for Discriminative Training in Speech Recognition

Cong Liu, Peng Liu, Hui Jiang, F. Soong, więcej

2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '7 > 4 > IV-329 - IV-332

2007 IEEE International Conference on Acoustics, Speech, and Signal Processing

In this paper, we propose a novel constrained line search to optimize the MMEE objective function for training discriminative HMMs. In our method, the MMI estimation is cast as a constrained maximization problem, where Kullback-Leibler divergence between models before and after parameters adjustment is introduced as a constraint during optimization. Then, based on the idea of line search, we show...

rozdział

A New Minimum Divergence Approach to Discriminative Training

Jun Du, Peng Liu, Hui Jiang, F.K. Soong, więcej

2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '7 > 4 > IV-677 - IV-680

2007 IEEE International Conference on Acoustics, Speech, and Signal Processing

We propose to use minimum divergence, where acoustic similarity between HMMs is characterized by Kullback-Leibler divergence, for discriminative training. The MD objective function is defined as a posterior weighted divergence measured over the whole training set. Different from our earlier work, where KLD-based acoustic similarity is pre-computed for all initial models and stays invariant in the...

rozdział

Recent Improvement on Maximum Relative Margin Estimation of HMMS for Speech Recognition

Chaojun Liu, Hui Jiang, L. Rigazio

2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings > 1 > I

2006 IEEE International Conference on Acoustics, Speech, and Signal Processing

Our previous study on maximum relative margin estimation (MRME) of HMM (C. Liu et al., 2005) demonstrated its advantage over the standard minimum classification error (MCE) training. In this paper, we report our recent improvement on MRME. Specifically, two novel approaches are proposed to handle recognition errors in training sets for the MRME. One is a new training criterion based on a combination...

rozdział

A constrained joint optimization method for large margin HMM estimation

Xinwei Li, Hui Jiang

IEEE Workshop on Automatic Speech Recognition and Understanding, 2005. > 151 - 156

2005 IEEE Workshop on Automatic Speech Recognition and Understanding

In this paper, we propose a new optimization method, i.e., constrained joint optimization method, to solve the minimax optimization problem in large margin estimation (LME) of continuous density hidden Markov model (CDHMM) for speech recognition. First, we mathematically analyze the definition of margin and introduce some theoretically-sound constraints into the minimax optimization to guarantee the...

rozdział

Maximum relative margin estimation of HMMS based on N-best string models for continuous speech recognition

Chaojun Liu, Hui Jiang, L. Rigazio

IEEE Workshop on Automatic Speech Recognition and Understanding, 2005. > 420 - 425

2005 IEEE Workshop on Automatic Speech Recognition and Understanding

Based on the principle of large margin classifier, recently we proposed two novel training methods, namely large margin estimation (LME) [8] and maximum relative margin estimation (MRME) [9] for speech recognition. In LME or MRME, HMM parameters are estimated to maximize the minimum margin among all training utterances. However their original formulation is limited to isolated-word ASR tasks. In this...

rozdział

A dynamic in-search discriminative training approach for large vocabulary speech recognition

Hui Jiang, Olivier Siohan, Frank K. Soong, Chin-Hui Lee

2002 IEEE International Conference on Acoustics, Speech, and Signal Processing > 1 > I-113 - I-116

Proceedings of ICASSP '02

In this paper, we propose a dynamic in-search discriminative training approach of a large-scale HMM model for large vocabulary speech recognition. A previously proposed data selection method is used to choose competing hypotheses dynamically during Viterbi beam search procedure. Particularly, all active word-ending paths are examined during search with reference transcription to identify competing...

Opcje filtrowania

Słowa kluczowe:
HIDDEN MARKOV MODELS
Typ publikacji:
książka

Data publikacji

Ustaw własny zakres dat

Dostępność treści

Dostępna (13)
Brak (2)

Słowa kluczowe

SPEECH RECOGNITION (14)
TRAINING (9)
SPEECH (5)
DISCRIMINATIVE TRAINING (4)
OPTIMISATION (4)
ACOUSTICS (3)
HMM (3)
KULLBACK-LEIBLER DIVERGENCE (3)
LARGE MARGIN ESTIMATION (3)
NEURAL NETWORKS (3)
OPTIMIZATION METHODS (3)
ACOUSTIC MODELING (2)
APPROXIMATION METHODS (2)
CONTINUOUS DENSITY HIDDEN MARKOV MODEL (2)
DISCRIMINATIVE TRAINING ALGORITHM (2)
LINE SEARCH (2)
LINEAR PROGRAMMING (2)
MATHEMATICAL MODEL (2)
MAXIMUM LIKELIHOOD ESTIMATION (2)
MAXIMUM RELATIVE MARGIN ESTIMATION (2)
N-GRAM LANGUAGE MODELS (2)
OPTIMIZATION (2)
SEARCH PROBLEMS (2)
SWITCHES (2)
TIDIGITS DATABASE (2)
VOCABULARY (2)
WORD ERROR RATE REDUCTION (2)
ADAPTATION MODELS (1)
ARGON (1)
ARTIFICIAL NEURAL NETWORKS (1)
AUDIO DATABASES (1)
AUTOMATIC SPEECH RECOGNITION (1)
AUXILIARY FUNCTION (1)
BOUNDED TRUST REGION OPTIMIZATION (1)
CLS OPTIMIZATION METHOD (1)
COMPUTATIONAL MODELING (1)
CONSTRAINED JOINT OPTIMIZATION (1)
CONSTRAINED LINE SEARCH APPROACH (1)
CONSTRAINED LINE SEARCH OPTIMIZATION (1)
CONTEXT (1)
CONTINUOUS SPEECH RECOGNITION (1)
CONVEX OPTIMIZATION (1)
CONVEX OPTIMIZATION ALGORITHMS (1)
CONVEX PROGRAMMING (1)
CONVOLUTION (1)
DECODING (1)
DEEP NEURAL NETWORK (DNN) (1)
DEEP NEURAL NETWORKS (1)
DIGIT STRINGS RECOGNITION (1)
DISCRIMINATIVE OBJECTIVE FUNCTION (1)
EBW OPTIMIZATION METHOD (1)
EM STYLE AUXILIARY FUNCTION (1)
EQUATIONS (1)
ERROR ANALYSIS (1)
ERROR REDUCTION (1)
ESTIMATION (1)
ESTIMATION THEORY (1)
FIRST-ORDER APPROXIMATION (1)
GAUSSIAN MEAN VECTOR (1)
GAUSSIAN MIXTURE (1)
GAUSSIAN MIXTURE PARAMETER (1)
GAUSSIAN PROCESSES (1)
GENERAL DISCRIMINATIVE HMM TRAINING (1)
GLOBAL OPTIMIZATION METHOD (1)
GRADIENT DESCENT ALGORITHM (1)
GRADIENT METHODS (1)
HIDDEN MARKOV MODEL (1)
HMMS (1)
ISOLATED-WORD ASR TASKS (1)
LARGE MARGIN ESTIMATION (LME) (1)
LARGE MARGIN HMM ESTIMATION (1)
LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION (1)
LINEAR EM STYLE AUXILIARY FUNCTION (1)
LOCAL FILTERING (1)
LVCSR (1)
LVCSR TASKS (1)
MARKOV PROCESSES (1)
MATRIX DECOMPOSITION (1)
MAX-POOLING (1)
MAXIMUM LIKELIHOOD ESTIMATION METHODS (1)
MAXIMUM MUTUAL INFORMATION (1)
MAXIMUM MUTUAL INFORMATION (MMI) (1)
MAXIMUM MUTUAL INFORMATION ESTIMATION (1)
MINIMAX OPTIMIZATION (1)
MINIMUM DIVERGENCE APPROACH (1)
MUTUAL INFORMATION ESTIMATION (1)
N-BEST STRING MODELS (1)
N-GRAM LANGUAGE MODELING (1)
NONLINEAR CONVEX OPTIMIZATION PROBLEM (1)
OPTIMIZATION ALGORITHM (1)
OPTIMIZATION CONVERGENCE BEHAVIOR (1)
ORIGINAL DISCRIMINATIVE OBJECTIVE FUNCTION (1)
PRE-TRAINING (1)
PROGRAMMING (1)
RELAXATION THEORY (1)
SECOND ORDER CONE PROGRAMMING RELAXATION METHOD (1)
SINGULAR VALUE DECOMPOSITION (SVD) (1)
SPEAKER ADAPTATION (1)
SPEAKER CODE (1)
więcej

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania dla: Hui Jiang

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Dostępność treści

Słowa kluczowe

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu