Advanced search

Advanced search in people

From:

To:

Items from 1 to 4 out of 4 results

chapter

Auditory Features Revisited for Robust Speech Recognition

F Kelly, N Harte

2010 20th International Conference on Pattern Recognition > 4456 - 4459

2010 20th International Conference on Pattern Recognition (ICPR 2010)

Auditory based front-ends for speech recognition have been compared before, but this paper focuses on two of the most promising algorithms for noise robustness in automatic speech recognition (ASR). The feature sets are Zero-Crossings with Peak Amplitudes (ZCPA) and the recently introduced Power-Law Nonlinearity and Power-Bias Subtraction (PNCC). Standard Mel-Frequency Cepstral Coefficients (MFCC)...

chapter

Acoustic front-end optimization for bird species recognition

Martin Graciarena, Michelle Delplanche, Elizabeth Shriberg, Andreas Stolcke, more

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 293 - 296

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

The goal of this work was to explore the optimization of the feature extraction module (front-end) parameters to improve bird species recognition. We explored optimizing the spectral and temporal parameters of a Mel cepstrum feature-based front-end, starting from common parameter values used in speech processing experiments. These features were modeled using a Gaussian mixture model (GMM) system....

chapter

An auditory-based feature for robust speech recognition

Yang Shao, Zhaozhang Jin, DeLiang Wang, S. Srinivasan

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 4625 - 4628

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

A conventional automatic speech recognizer does not perform well in the presence of noise, while human listeners are able to segregate and recognize speech in noisy conditions. We study a novel feature based on an auditory periphery model for robust speech recognition. Specifically, gammatone frequency cepstral coefficients are derived by applying a cepstral analysis on gammatone filterbank responses...

chapter

Emotion Classification of Infant Voice Based on Features Derived from Teager Energy Operator

Hui Gao, Shanguang Chen, Guangchuan Su

2008 Congress on Image and Signal Processing > 5 > 333 - 337

International Congress on Image and Signal Processing (CISP 2008)

To study effective speech features which can represent different emotion styles in infant voice, nonlinear features based on Teager Energy Operator are investigated. Neutral state and 4 emotional states (i.e. happiness, impatience, anger and fear) are classified from the infant voice database. MFCC extraction and HMM-based emotion classification are used as baseline system to evaluate the emotional...

Filter options

Content availability:
Available
Keywords:
FEATURE EXTRACTION
HIDDEN MARKOV MODELS
CEPSTRAL ANALYSIS
FILTERING THEORY

Publication date

Set your own date range

Keywords

MEL FREQUENCY CEPSTRAL COEFFICIENT (3)
ROBUSTNESS (3)
FILTER BANK (2)
NOISE (2)
SPEECH PROCESSING (2)
ACCURACY (1)
ACOUSTIC FEATURE (1)
ACOUSTIC FRONT-END (1)
ACOUSTIC FRONT-END OPTIMIZATION (1)
ACOUSTIC SIGNAL PROCESSING (1)
ACOUSTICS (1)
ADAPTATION MODEL (1)
ANALYTICAL MODELS (1)
ART (1)
ATMOSPHERIC MODELING (1)
AUDIO SIGNAL PROCESSING (1)
AUDITORY BASED FRONT-ENDS (1)
AUDITORY FEATURE (1)
AUDITORY FEATURES (1)
AUDITORY PERIPHERY MODEL (1)
AUDITORY SYSTEM (1)
AUDITORY-BASED FEATURE (1)
AUTOMATIC SPEECH RECOGNITION (1)
BAND PASS FILTERS (1)
BANDWIDTH (1)
BIOMECHANICS (1)
BIRD SPECIES RECOGNITION (1)
BIRDS (1)
CLASSIFICATION (1)
CLASSIFICATION ALGORITHMS (1)
COGNITION (1)
COMPUTATIONAL AUDITORY SCENE ANALYSIS (1)
COMPUTATIONAL AUDITORY SCENE ANALYSIS SYSTEM (1)
CONFERENCES (1)
DATA MINING (1)
DATABASES (1)
DISCRETE COSINE TRANSFORMS (1)
EMOTION (1)
EMOTION RECOGNITION (1)
ENERGY MEASUREMENT (1)
ENERGY RESOLUTION (1)
EQUATIONS (1)
FILTER BANK CONFIGURATION (1)
FILTER BANK DISTRIBUTION (1)
FILTERING ALGORITHMS (1)
FINITE IMPULSE RESPONSE FILTER (1)
FLUID FLOW (1)
FLUIDS (1)
FREQUENCY CONVERSION (1)
FREQUENCY DOMAIN ANALYSIS (1)
FREQUENCY MODULATION (1)
FRONT-END PARAMETER (1)
GAMMATONE FILTERBANK RESPONSE (1)
GAMMATONE FREQUENCY CEPSTRAL COEFFICIENT (1)
GAMMATONE FREQUENCY CEPSTRAL COEFFICIENTS (1)
GAUSSIAN MIXTURE MODEL (1)
GAUSSIAN PROCESSES (1)
GMM SYSTEM (1)
HIDDEN MARKOV MODEL-BASED RECOGNISER (1)
HUMAN COMPUTER INTERACTION (1)
LINEAR FREQUENCY SCALE (1)
MATERIALS (1)
MATHEMATICAL MODEL (1)
MAXIMUM LIKELIHOOD DETECTION (1)
MEL CEPSTRUM FEATURE (1)
MEL FREQUENCY SCALE (1)
MEL-FREQUENCY CEPSTRAL COEFFICIENTS (1)
MONITORING (1)
NOISE ROBUSTNESS (1)
NONLINEAR FILTERS (1)
NUMERICAL SIMULATION (1)
OPTIMIZATION (1)
POWER-BIAS SUBTRACTION (1)
POWER-LAW NONLINEARITY (1)
PRESSES (1)
PRODUCTION (1)
PSYCHOLOGY (1)
RESONANT FREQUENCY (1)
ROBUST SPEECH RECOGNITION (1)
SIGNAL PROCESSING (1)
SIGNAL RESOLUTION (1)
SPECTRAL BANDWIDTH (1)
SPECTRAL PARAMETER (1)
STRESS (1)
SUPPORT VECTOR MACHINE CLASSIFICATION (1)
TEAGER ENERGY OPERATOR (1)
TEMPORAL PARAMETER (1)
TIMIT DATABASE (1)
TIN (1)
TRAINING (1)
TRANSFORMS (1)
WAVELET ANALYSIS (1)
WAVELET PACKETS (1)
WAVELET TRANSFORMS (1)
more

INFONA - science communication portal

Advanced search

Advanced search in people

Auditory Features Revisited for Robust Speech Recognition

Acoustic front-end optimization for bird species recognition

An auditory-based feature for robust speech recognition

Emotion Classification of Infant Voice Based on Features Derived from Teager Energy Operator

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options