Search results for: Herve Bourlard

Items from 1 to 4 out of 4 results

chapter

Low-rank and sparse soft targets to learn better DNN acoustic models

Pranay Dighe, Afsaneh Asaei, Herve Bourlard

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5265 - 5269

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Conventional deep neural networks (DNN) for speech acoustic modeling rely on Gaussian mixture models (GMM) and hidden Markov model (HMM) to obtain binary class labels as the targets for DNN training. Subword classes in speech recognition systems correspond to context-dependent tied states or senones. The present work addresses some limitations of GMM-HMM senone alignments for DNN training. We hypothesize...

chapter

Using KL-divergence and multilingual information to improve ASR for under-resourced languages

David Imseng, Herve Bourlard, Philip N. Garner

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4869 - 4872

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

Setting out from the point of view that automatic speech recognition (ASR) ought to benefit from data in languages other than the target language, we propose a novel Kullback-Leibler (KL) divergence based method that is able to exploit multilingual information in the form of universal phoneme posterior probabilities conditioned on the acoustics. We formulate a means to train a recognizer on several...

chapter

Posterior features for template-based ASR

Serena Soldo, Mathew Magimai.-Doss, Joel Pinto, Herve Bourlard

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4864 - 4867

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper investigates the use of phoneme class conditional probabilities as features (posterior features) for template-based ASR. Using 75 words and 600 words task-independent and speaker-independent setup on Phonebook database, we investigate the use of different posterior distribution estimators, different distance measures that are better suited for posterior distributions, and different training...

chapter

Analysis of phone posterior feature space exploiting class-specific sparsity and MLP-based similarity measure

Afsaneh Asaei, Benjamin Picart, Herve Bourlard

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 4886 - 4889

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

Class posterior distributions have recently been used quite successfully in Automatic Speech Recognition (ASR), either for frame or phone level classification or as acoustic features, which can be further exploited (usually after some “ad hoc” transformations) in different classifiers (e.g., in Gaussian Mixture based HMMs). In the present paper, we show preliminary results showing that it may be possible...

Filter options

Keywords:
TRAINING DATA

Publication date

Set your own date range

Keywords

HIDDEN MARKOV MODELS (3)
SPEECH RECOGNITION (3)
ACCURACY (2)
AUTOMATIC SPEECH RECOGNITION (2)
SPEECH (2)
ACOUSTIC VECTORS (1)
CLASS POSTERIOR DISTRIBUTIONS (1)
CLASS-SPECIFIC SPARSITY (1)
DICTIONARIES (1)
ENCODING (1)
FAST TRAINING (1)
KNN CLASSIFIER (1)
KNN PHONE CLASSIFICATION RATES (1)
KULLBACK-LEIBLER DIVERGENCE (1)
MEASUREMENT (1)
MLP-BASED SIMILARITY MEASURE (1)
MULTILAYER PERCEPTRONS (1)
MULTILINGUAL SPEECH RECOGNITION (1)
NEURAL NETWORK FEATURES (1)
PATTERN CLASSIFICATION (1)
PHONE POSTERIOR FEATURE SPACE (1)
POSTERIOR FEATURE SPACE (1)
POSTERIOR FEATURES (1)
POSTERIOR SPACE PROPERTIES (1)
POSTERIOR-BASED METRICS (1)
PRINCIPAL COMPONENT ANALYSIS (1)
PRINCIPLE COMPONENT ANALYSIS (1)
SOFT TARGETS (1)
SPARSE CODING (1)
SUPPORT VECTOR MACHINE CLASSIFICATION (1)
TEMPLATE-BASED APPROACH (1)
TIMIT DATABASE (1)
UNTRANSCRIBED DATA (1)
VOCABULARY (1)
more

INFONA - science communication portal

Search results for: Herve Bourlard

Low-rank and sparse soft targets to learn better DNN acoustic models

Using KL-divergence and multilingual information to improve ASR for under-resourced languages

Posterior features for template-based ASR

Analysis of phone posterior feature space exploiting class-specific sparsity and MLP-based similarity measure

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options