Search results for: S. Thomas

Items from 1 to 9 out of 9 results

chapter

Speech recognitionwith segmental conditional random fields: A summary of the JHU CLSP 2010 Summer Workshop

G. Zweig, P. Nguyen, D. Van Compernolle, K. Demuynck, more

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5044 - 5047

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper summarizes the 2010 CLSP Summer Workshop on speech recognition at Johns Hopkins University. The key theme of the workshop was to improve on state-of-the-art speech recognition systems by using Segmental Conditional Random Fields (SCRFs) to integrate multiple types of information. This approach uses a state-of-the-art baseline as a springboard from which to add a suite of novel features...

chapter

Alpha-Numerical Sequences Extraction in Handwritten Documents

S Thomas, Clément Chatelain, L Heutte, T Paquet

2010 12th International Conference on Frontiers in Handwriting Recognition > 232 - 237

2010 12th International Conference on Frontiers in Handwriting Recognition (ICFHR 2010)

In this paper, we introduce an alpha-numerical sequences extraction system (keywords, numerical fields or alpha-numerical sequences) in unconstrained handwritten documents. Contrary to most of the approaches presented in the literature, our system relies on a global handwriting line model describing two kinds of information : i) the relevant information and ii) the irrelevant information represented...

chapter

An Information Extraction Model for Unconstrained Handwritten Documents

S Thomas, C Chatelain, L Heutte, T Paquet

2010 20th International Conference on Pattern Recognition > 3412 - 3415

2010 20th International Conference on Pattern Recognition (ICPR 2010)

In this paper, a new information extraction system by statistical shallow parsing in unconstrained handwritten documents is introduced. Unlike classical approaches found in the literature as keyword spotting or full document recognition, our approach relies on a strong and powerful global handwriting model. A entire text line is considered as an indivisible entity and is modeled with Hidden Markov...

chapter

Robust spectro-temporal features based on autoregressive models of Hilbert envelopes

S Ganapathy, S Thomas, H Hermansky

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 4286 - 4289

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

In this paper, we present a robust spectro-temporal feature extraction technique using autoregressive models (AR) of sub-band Hilbert envelopes. AR models of Hilbert envelopes are derived using frequency domain linear prediction (FDLP). From the sub-band Hilbert envelopes, spectral features are derived by integrating these envelopes in short-term frames and the temporal features are formed by converting...

chapter

Temporal envelope subtraction for robust speech recognition using modulation spectrum

S. Ganapathy, S. Thomas, H. Hermansky

2009 IEEE Workshop on Automatic Speech Recognition&Understanding > 164 - 169

2009 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU 2009)

In this paper, we present a new noise compensation technique for modulation frequency features derived from syllable length segments of subband temporal envelopes. The subband temporal envelopes are estimated using frequency domain linear prediction (FDLP). We propose a technique for noise compensation in FDLP where an estimate of the noise envelope is subtracted from the noisy speech envelope. The...

chapter

Applications of signal analysis using autoregressive models for amplitude modulation

S. Ganapathy, S. Thomas, P. Motlicek, H. Hermansky

2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics > 341 - 344

2009 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA)

Frequency domain linear prediction (FDLP) represents an efficient technique for representing the long-term amplitude modulations (AM) of speech/audio signals using autoregressive models. For the proposed analysis technique, relatively long temporal segments (1000 ms) of the input signal are decomposed into a set of sub-bands. FDLP is applied on each sub-band to model the temporal envelopes. The residual...

chapter

Phoneme recognition using spectral envelope and modulation frequency features

S. Thomas, S. Ganapathy, H. Hermansky

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 4453 - 4456

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

We present a new feature extraction technique for phoneme recognition that uses short-term spectral envelope and modulation frequency features. These features are derived from sub-band temporal envelopes of speech estimated using frequency domain linear prediction (FDLP). While spectral envelope features are obtained by the short-term integration of the sub-band envelopes, the modulation frequency...

chapter

Preliminary results of lava flow mapping using remote sensing in Piton de la Fournaise, La Réunion island

S. Zarah, V. Nicolas, U. Minoru, S. Thomas, more

2008 Second Workshop on Use of Remote Sensing Techniques for Monitoring Volcanoes and Seismogenic Areas > 1 - 4

2008 Second Workshop on Use of Remote Sensing Techniques for Monitoring Volcanoes and Seismogenic Areas (USEReST)

The use of remote sensing is more and more incontrovertible in volcanic monitoring, especially in INSAR and thermal studies. A comprehensive database of high-resolution multispectral and multitemporal optical satellite imagery exists for Piton de la Fournaise, the active volcano on La Reunion Island. This database, however, remains relatively underexploited in volcanological studies of Piton de la...

article

Recognition of Reverberant Speech Using Frequency Domain Linear Prediction

S. Thomas, S. Ganapathy, H. Hermansky

IEEE Signal Processing Letters > 2008 > 15 > 681 - 684

Performance of a typical automatic speech recognition (ASR) system severely degrades when it encounters speech from reverberant environments. Part of the reason for this degradation is the feature extraction techniques that use analysis windows which are much shorter than typical room impulse responses. We present a feature extraction technique based on modeling temporal envelopes of the speech signal...

Filter options

Keywords:
FEATURE EXTRACTION

Publication date

Set your own date range

Publication type

book (8)
article (1)

Keywords

SPEECH RECOGNITION (6)
FREQUENCY DOMAIN LINEAR PREDICTION (5)
SPEECH (5)
FREQUENCY MODULATION (4)
HIDDEN MARKOV MODELS (4)
PHONEME RECOGNITION (3)
AUTOREGRESSIVE MODELS (2)
DATA MINING (2)
DATABASES (2)
DISCRETE COSINE TRANSFORMS (2)
FREQUENCY DOMAIN LINEAR PREDICTION (FDLP) (2)
HANDWRITING RECOGNITION (2)
HILBERT TRANSFORMS (2)
INFORMATION EXTRACTION (2)
INFORMATION RETRIEVAL (2)
MODULATION FREQUENCY FEATURES (2)
NOISE (2)
SHALLOW PARSING MODEL (2)
ACCURACY (1)
ACOUSTICS (1)
AERIAL PHOTOGRAPHY (1)
ALL-POLE APPROXIMATION (1)
ALPHA NUMERICAL SEQUENCES EXTRACTION (1)
AM-FM DECOMPOSITION (1)
AM-FM DECOMPOSITION TECHNIQUE (1)
AMPLITUDE MODULATION (1)
ANALYSIS WINDOWS (1)
APPROXIMATION THEORY (1)
ASTER (1)
AUDIO CODING (1)
AUTOMATIC MAPPING (1)
AUTOMATIC SPEECH RECOGNITION (1)
AUTOREGRESSIVE PROCESSES (1)
CARTOGRAPHY (1)
CODECS (1)
COMPUTATIONAL MODELING (1)
CONNECTED DIGIT RECOGNITION TASK (1)
COSINE TRANSFORM (1)
CRF (1)
DETECTORS (1)
DOCUMENT HANDLING (1)
FEATURE EXTRACTION TECHNIQUES (1)
FREQUENCY DOMAIN ANALYSIS (1)
FREQUENCY MODULATIONS (1)
FREQUENCY-DOMAIN ANALYSIS (1)
FULL DOCUMENT RECOGNITION (1)
GEOPHYSICS COMPUTING (1)
HANDWRITING LINE MODEL (1)
HANDWRITTEN DOCUMENTS (1)
HIGH-RESOLUTION MULTISPECTRAL-MULTITEMPORAL OPTICAL SATELLITE IMAGERY (1)
HILBERT ENVELOPE (1)
HILBERT ENVELOPES (1)
HYBRID HMM-ANN PHONEME RECOGNIZER (1)
IMAGE SEGMENTATION (1)
INFORMATION EXTRACTION MODEL (1)
INSAR (1)
IRRELEVANT INFORMATION REPRESENTATION (1)
ISOLATED TEXT LINES (1)
KEYWORD SPOTTING (1)
LA REUNION ISLAND (1)
LAVA FLOW CONTOURS EXTRACTION (1)
LAVA FLOW MAPPING (1)
LITERATURE (1)
MATHEMATICAL MODEL (1)
MODULATION (1)
MODULATION FREQUENCY COMPONENTS (1)
MODULATION SPECTRUM (1)
NEURAL NETS (1)
NOISE COMPENSATION TECHNIQUE (1)
NOISE MEASUREMENT (1)
NUMERICAL MODELS (1)
OBJECT-SPECTRAL BASED TECHNIQUES (1)
OPTICAL IMAGING (1)
OPTICAL REFLECTION (1)
OPTICAL SENSORS (1)
PHONEME POSTERIOR LEVEL (1)
PHONEME RECOGNITION TASK (1)
PHOTO INTERPRETATION (1)
PITON DE LA FOURNAISE (1)
PIXEL (1)
POSTAL SERVICES (1)
PREDICTION THEORY (1)
PRINCIPAL COMPONENTS ANALYSIS (1)
RADIOMETRIC CHARACTERISTICS (1)
REMOTE SENSING (1)
REVERBERANT ENVIRONMENTS (1)
REVERBERANT SPEECH (1)
REVERBERANT SPEECH RECOGNITION (1)
REVERBERATION (1)
ROBUST FEATURE EXTRACTION (1)
ROBUST FEATURES FOR SPEECH RECOGNITION (1)
ROBUST SPECTRO-TEMPORAL FEATURES (1)
ROBUST SPEECH RECOGNITION (1)
ROBUSTNESS (1)
ROOM IMPULSE RESPONSE (1)
SEGMENTAL CONDITIONAL RANDOM FIELD (1)
SHORT-TERM SPECTRAL ENVELOPE (1)
SIGNAL ANALYSIS (1)
SPECTRAL ENVELOPE AND MODULATION FREQUENCY FEATURES (1)
more

INFONA - science communication portal

Search results for: S. Thomas

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options