Search results for: H. Ney

Items from 1 to 5 out of 5 results

chapter

Feature selection for log-linear acoustic models

S. Wiesler, A. Richard, Y. Kubo, R. Schluter, more

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5324 - 5327

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Log-linear acoustic models have been shown to be competitive with Gaussian mixture models in speech recognition. Their high training time can be reduced by feature selection. We compare a simple univariate feature selection algorithm with ReliefF - an efficient multivariate algorithm. An alternative to feature selection is ℓ₁-regularized training, which leads to sparse models. We observe that this...

chapter

Evaluation of automatic transcription systems for the judicial domain

J Lööf, D Falavigna, R Schlüter, D Giuliani, more

2010 IEEE Spoken Language Technology Workshop > 206 - 211

2010 IEEE Spoken Language Technology Workshop (SLT 2010)

This paper describes two different automatic transcription systems developed for judicial application domains for the Polish and Italian languages. The judicial domain requires to cope with several factors which are known to be critical for automatic speech recognition, such as: background noise, reverberation, spontaneous and accented speech, overlapped speech, cross channel effects, etc. The two...

chapter

Investigations on features for log-linear acoustic models in continuous speech recognition

S. Wiesler, M. Nussbaum-Thom, G. Heigold, R. Schluter, more

2009 IEEE Workshop on Automatic Speech Recognition&Understanding > 52 - 57

2009 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU 2009)

Hidden Markov Models with Gaussian Mixture Models as emission probabilities (GHMMs) are the underlying structure of all state-of-the-art speech recognition systems. Using Gaussian mixture distributions follows the generative approach where the class-conditional probability is modeled, although for classification only the posterior probability is needed. Though being very successful in related tasks...

chapter

Modified MPE/MMI in a transducer-based framework

G. Heigold, R. Schluter, H. Ney

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 3749 - 3752

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

In this paper we show how common training criteria like for example MPE or MMI can be extended to incorporate a margin term. In addition, a transducer-based training implementation is presented, which covers a large variety of discriminative training criteria for ASR, including the standard MMI, MPE, and MCE criteria, as well as the modifications to these criteria presented here. The modified criteria...

chapter

Audio segmentation for speech recognition using segment features

D. Rybach, C. Gollan, R. Schluter, H. Ney

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 4197 - 4200

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

Audio segmentation is an essential preprocessing step in several audio processing applications with a significant impact e.g. on speech recognition performance. We introduce a novel framework which combines the advantages of different well known segmentation methods. An automatically estimated log-linear segment model is used to determine the segmentation of an audio stream in a holistic way by a...

Filter options

Keywords:
ACOUSTICS

Publication date

Set your own date range

Keywords

TRAINING (4)
OPTIMIZATION (2)
POLYNOMIALS (2)
SPEECH (2)
ℓ<INF>1</INF>-REGULARIZATION (1)
ACCURACY (1)
ACOUSTIC MODELING (1)
ADAPTATION MODEL (1)
AUDIO PROCESSING (1)
AUDIO SEGMENTATION (1)
AUDIO STREAM (1)
AUDIO STREAMING (1)
AUTOMATIC SPEECH RECOGNITION (1)
AUTOMATIC SPEECH RECOGNITION SYSTEM (1)
AUTOMATIC TRANSCRIPTION (1)
AUTOMATIC TRANSCRIPTION SYSTEM (1)
BROADCAST NEWS TRANSCRIPTION (1)
CLASS-CONDITIONAL PROBABILITY (1)
COMPLEXITY THEORY (1)
CONTEXT (1)
CONTINUOUS SPEECH RECOGNITION SYSTEM (1)
CROSS-CHANNEL EFFECTS (1)
DATA DEPENDENT SPARSE FEATURES (1)
DECODING (1)
DOMAIN ADAPTATION (1)
EMISSION PROBABILITIES (1)
ERROR ANALYSIS (1)
FASTENERS (1)
FEATURE EXTRACTION (1)
FEATURE SELECTION (1)
FINITE STATE MACHINES (1)
FINITE STATE TRANSDUCER-BASED TRAINING FRAMEWORK (1)
GAUSSIAN MIXTURE MODELS (1)
ITALIAN LANGUAGES (1)
JUDICIAL APPLICATION DOMAINS (1)
JUDICIAL DOMAIN (1)
LARGE MARGIN (1)
LEARNING (ARTIFICIAL INTELLIGENCE) (1)
LOG-LINEAR ACOUSTIC MODELS (1)
LOG-LINEAR MODELS (1)
LOG-LINEAR SEGMENT MODEL (1)
MAXIMUM A POSTERIORI DECODING (1)
MAXIMUM LIKELIHOOD ESTIMATION (1)
MCE (1)
MEL FREQUENCY CEPSTRAL COEFFICIENT (1)
MICROPHONES (1)
MMI (1)
MPE (1)
NATURAL LANGUAGE PROCESSING (1)
POLISH LANGUAGES (1)
POSTERIOR PROBABILITY (1)
RELIEFF (1)
SEGMENT FEATURES (1)
SIGNAL CLASSIFICATION (1)
SPEECH CODING (1)
STATISTICAL DISTRIBUTIONS (1)
SUPPORT VECTOR MACHINES (1)
SVM (1)
TRAINING CRITERIA (1)
WALL STREET JOURNAL CORPUS (1)
WEIGHTED FINITE STATE TRANSDUCER (1)
more

INFONA - science communication portal

Search results for: H. Ney

Feature selection for log-linear acoustic models

Evaluation of automatic transcription systems for the judicial domain

Investigations on features for log-linear acoustic models in continuous speech recognition

Modified MPE/MMI in a transducer-based framework

Audio segmentation for speech recognition using segment features

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options