Search results for: S. Thomas

Items from 1 to 6 out of 6 results

chapter

Speech recognitionwith segmental conditional random fields: A summary of the JHU CLSP 2010 Summer Workshop

G. Zweig, P. Nguyen, D. Van Compernolle, K. Demuynck, more

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5044 - 5047

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper summarizes the 2010 CLSP Summer Workshop on speech recognition at Johns Hopkins University. The key theme of the workshop was to improve on state-of-the-art speech recognition systems by using Segmental Conditional Random Fields (SCRFs) to integrate multiple types of information. This approach uses a state-of-the-art baseline as a springboard from which to add a suite of novel features...

chapter

Alpha-Numerical Sequences Extraction in Handwritten Documents

S Thomas, Clément Chatelain, L Heutte, T Paquet

2010 12th International Conference on Frontiers in Handwriting Recognition > 232 - 237

2010 12th International Conference on Frontiers in Handwriting Recognition (ICFHR 2010)

In this paper, we introduce an alpha-numerical sequences extraction system (keywords, numerical fields or alpha-numerical sequences) in unconstrained handwritten documents. Contrary to most of the approaches presented in the literature, our system relies on a global handwriting line model describing two kinds of information : i) the relevant information and ii) the irrelevant information represented...

chapter

An Information Extraction Model for Unconstrained Handwritten Documents

S Thomas, C Chatelain, L Heutte, T Paquet

2010 20th International Conference on Pattern Recognition > 3412 - 3415

2010 20th International Conference on Pattern Recognition (ICPR 2010)

In this paper, a new information extraction system by statistical shallow parsing in unconstrained handwritten documents is introduced. Unlike classical approaches found in the literature as keyword spotting or full document recognition, our approach relies on a strong and powerful global handwriting model. A entire text line is considered as an indivisible entity and is modeled with Hidden Markov...

chapter

A novel estimation of feature-space MLLR for full-covariance models

A Ghoshal, D Povey, M Agarwal, P Akyazi, more

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 4310 - 4313

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

In this paper we present a novel approach for estimating feature-space maximum likelihood linear regression (fMLLR) transforms for full-covariance Gaussian models by directly maximizing the likelihood function by repeated line search in the direction of the gradient. We do this in a pre-transformed parameter space such that an approximation to the expected Hessian is proportional to the unit matrix...

chapter

Multilingual acoustic modeling for speech recognition based on subspace Gaussian Mixture Models

L Burget, P Schwarz, M Agarwal, P Akyazi, more

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 4334 - 4337

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

Although research has previously been done on multilingual speech recognition, it has been found to be very difficult to improve over separately trained systems. The usual approach has been to use some kind of “universal phone set” that covers multiple languages. We report experiments on a different approach to multilingual speech recognition, in which the phone sets are entirely distinct but the...

chapter

Phoneme recognition using spectral envelope and modulation frequency features

S. Thomas, S. Ganapathy, H. Hermansky

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 4453 - 4456

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

We present a new feature extraction technique for phoneme recognition that uses short-term spectral envelope and modulation frequency features. These features are derived from sub-band temporal envelopes of speech estimated using frequency domain linear prediction (FDLP). While spectral envelope features are obtained by the short-term integration of the sub-band envelopes, the modulation frequency...

Filter options

Keywords:
HIDDEN MARKOV MODELS

Publication date

Set your own date range

Keywords

FEATURE EXTRACTION (4)
SPEECH RECOGNITION (4)
SPEECH (3)
ACOUSTICS (2)
DATA MINING (2)
DATABASES (2)
GAUSSIAN PROCESSES (2)
HANDWRITING RECOGNITION (2)
INFORMATION EXTRACTION (2)
INFORMATION RETRIEVAL (2)
MATHEMATICAL MODEL (2)
SHALLOW PARSING MODEL (2)
TRAINING (2)
ACOUSTIC SIGNAL PROCESSING (1)
ADAPTATION MODEL (1)
ALPHA NUMERICAL SEQUENCES EXTRACTION (1)
COMPUTATIONAL MODELING (1)
CRF (1)
DATA MODELS (1)
DETECTORS (1)
DISCRETE COSINE TRANSFORMS (1)
DOCUMENT HANDLING (1)
EQUATIONS (1)
ESTIMATION (1)
FEATURE-SPACE MAXIMUM LIKELIHOOD LINEAR REGRESSION TRANSFORMS (1)
FEATURE-SPACE MLLR (1)
FREQUENCY DOMAIN LINEAR PREDICTION (1)
FREQUENCY MODULATION (1)
FULL DOCUMENT RECOGNITION (1)
FULL-COVARIANCE GAUSSIAN MODELS (1)
FULL-COVARIANCE MODELS (1)
HANDWRITING LINE MODEL (1)
HANDWRITTEN DOCUMENTS (1)
HYBRID HMM-ANN PHONEME RECOGNIZER (1)
INFORMATION EXTRACTION MODEL (1)
IRRELEVANT INFORMATION REPRESENTATION (1)
ISOLATED TEXT LINES (1)
KEYWORD SPOTTING (1)
LARGE VOCABULARY SPEECH RECOGNITION (1)
LIKELIHOOD FUNCTION (1)
LINEAR ALGEBRA (1)
LITERATURE (1)
MAXIMUM LIKELIHOOD ESTIMATION (1)
MODULATION (1)
MODULATION FREQUENCY COMPONENTS (1)
MODULATION FREQUENCY FEATURES (1)
MULTILINGUAL ACOUSTIC MODELING (1)
NATURAL LANGUAGE PROCESSING (1)
NEURAL NETS (1)
NUMERICAL MODELS (1)
OPTIMIZATION METHODS (1)
PHONEME POSTERIOR LEVEL (1)
PHONEME RECOGNITION (1)
POSTAL SERVICES (1)
PRETRANSFORMED PARAMETER SPACE (1)
REGRESSION ANALYSIS (1)
SEGMENTAL CONDITIONAL RANDOM FIELD (1)
SHORT-TERM SPECTRAL ENVELOPE (1)
SPEAKER ADAPTATION (1)
SPECTRAL ENVELOPE AND MODULATION FREQUENCY FEATURES (1)
SPEECH ESTIMATION (1)
STATISTICAL ANALYSIS (1)
STATISTICAL SHALLOW PARSING (1)
SUB-BAND TEMPORAL ENVELOPES (1)
SUBSPACE GAUSSIAN MIXTURE MODEL (1)
SUBSPACE MEAN MODEL (1)
SUBSPACE PRECISION (1)
TEXT LINE SHALLOW PARSING (1)
TIMIT DATABASE (1)
TRAINING DATA (1)
TRANSFORMS (1)
UNCONSTRAINED HANDWRITTEN DOCUMENTS (1)
UNIT MATRIX (1)
VOCABULARY (1)
more

INFONA - science communication portal

Search results for: S. Thomas

Speech recognitionwith segmental conditional random fields: A summary of the JHU CLSP 2010 Summer Workshop

Alpha-Numerical Sequences Extraction in Handwritten Documents

An Information Extraction Model for Unconstrained Handwritten Documents

A novel estimation of feature-space MLLR for full-covariance models

Multilingual acoustic modeling for speech recognition based on subspace Gaussian Mixture Models

Phoneme recognition using spectral envelope and modulation frequency features

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options