Search results for: M. Hasegawa

Items from 1 to 4 out of 4 results

chapter

Emotion recognition from speech VIA boosted Gaussian mixture models

Hao Tang, S.M. Chu, M. Hasegawa-Johnson, T.S. Huang

2009 IEEE International Conference on Multimedia and Expo > 294 - 297

2009 IEEE International Conference on Multimedia and Expo (ICME)

Gaussian mixture models (GMMs) and the minimum error rate classifier (i.e. Bayesian optimal classifier) are popular and effective tools for speech emotion recognition. Typically, GMMs are used to model the class-conditional distributions of acoustic features and their parameters are estimated by the expectation maximization (EM) algorithm based on a training data set. Then, classification is performed...

chapter

Feature analysis and selection for acoustic event detection

Xiaodan Zhuang, Xi Zhou, T.S. Huang, M. Hasegawa-Johnson

2008 IEEE International Conference on Acoustics, Speech and Signal Processing > 17 - 20

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

Speech perceptual features, such as Mel-frequency Cepstral Coefficients (MFCC), have been widely used in acoustic event detection. However, the different spectral structures between speech and acoustic events degrade the performance of the speech feature sets. We propose quantifying the discriminative capability of each feature component according to the approximated Bayesian accuracy and deriving...

chapter

Articulatory Feature-Based Methods for Acoustic and Audio-Visual Speech Recognition: Summary from the 2006 JHU Summer workshop

K. Livescu, O. Cetin, M. Hasegawa-Johnson, S. King, more

2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '7 > 4 > IV-621 - IV-624

2007 IEEE International Conference on Acoustics, Speech, and Signal Processing

We report on investigations, conducted at the 2006 Johns Hopkins Workshop, into the use of articulatory features (AFs) for observation and pronunciation models in speech recognition. In the area of observation modeling, we use the outputs of AF classifiers both directly, in an extension of hybrid HMM/neural network models, and as part of the observation vector, an extension of the "tandem"...

chapter

Generalized Optimal Multi-Microphone Speech Enhancement Using Sequential Minimum Variance Distortionless Response(MVDR) Beamforming and Postfiltering

Lae-Hoon Kim, M. Hasegawa-Johnson, Koeng-Mo Sung

2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings > 3 > III

2006 IEEE International Conference on Acoustics, Speech, and Signal Processing

A theoretical basis for optimal multichannel speech enhancements presented, sufficient, flexible to be used with any assumed statistical model and optimality criterion. Any Bayesian optimal one-channel estimator for speech enhancement can be generalized to the multichannel case as a sequentially constructed minimum variance distortionless response (MVDR) beamformer followed by an optimal one-channel...

Filter options

Keywords:
BAYES METHODS

Publication date

Set your own date range

Keywords

HIDDEN MARKOV MODELS (3)
FEATURE EXTRACTION (2)
SPEECH PROCESSING (2)
SPEECH RECOGNITION (2)
2006 JHU SUMMER WORKSHOP (1)
ACOUSTIC (1)
ACOUSTIC EVENT DETECTION (1)
ACOUSTIC FEATURE (1)
ACOUSTIC SIGNAL DETECTION (1)
ACOUSTIC SIGNAL PROCESSING (1)
ACOUSTICS (1)
APPROXIMATED BAYESIAN ACCURACY (1)
ARRAY SIGNAL PROCESSING (1)
ARTICULATORY FEATURE-BASED METHODS (1)
AUDIO-VISUAL RECOGNITION (1)
AUDIO-VISUAL SPEECH (1)
AUDIO-VISUAL SPEECH RECOGNITION (1)
AVICAR CORPUS (1)
BAYESIAN ACCURACY (1)
BAYESIAN OPTIMAL CLASSIFIER (1)
BAYESIAN OPTIMAL ONE-CHANNEL ESTIMATOR (1)
BOOSTED GAUSSIAN MIXTURE MODEL (1)
BOOSTING (1)
CEPSTRAL ANALYSIS (1)
CLASS-CONDITIONAL DISTRIBUTION (1)
CUAVE AUDIO-VISUAL DIGITS CORPUS (1)
DYNAMIC BAYESIAN NETWORKS (1)
EM ALGORITHM (1)
EM-GMM ALGORITHM (1)
EMOTION RECOGNITION (1)
ERROR STATISTICS (1)
ESTIMATION (1)
EXPECTATION MAXIMIZATION ALGORITHM (1)
EXPECTATION-MAXIMISATION ALGORITHM (1)
FEATURE ANALYSIS (1)
FEATURE SELECTION (1)
FEATURE-LEVEL MANUAL TRANSCRIPTIONS (1)
FILTERING THEORY (1)
GAUSSIAN MIXTURE MODEL (1)
GAUSSIAN PROCESSES (1)
GENERALIZED OPTIMAL MULTI-MICROPHONE SPEECH ENHANCEMENT (1)
HMM (1)
LEARNING (ARTIFICIAL INTELLIGENCE) (1)
LEAST MEAN SQUARES METHODS (1)
LOG-SPECTRAL AMPLITUDE OPTIMALITY CRITERION (1)
MINIMUM ERROR RATE CLASSIFIER (1)
MINIMUM MEAN-SQUARE ERROR OPTIMALITY CRITERION (1)
NEURAL NETS (1)
NEURAL NETWORK MODELS (1)
OPTIMAL MULTICHANNEL SPEECH ENHANCEMENTS (1)
OPTIMAL ONE-CHANNEL POSTFILTER (1)
PRONUNCIATION MODELS (1)
REALISTIC INTER-MICROPHONE NOISE COHERENCE (1)
SEQUENTIAL MINIMUM VARIANCE DISTORTIONLESS RESPONSE BEAMFORMING (1)
SIGNAL CLASSIFICATION (1)
SMALL-VOCABULARY SWITCHBOARD (1)
SOFT SYNCHRONY CONSTRAINTS (1)
SPECTRAL STRUCTURE (1)
SPEECH (1)
SPEECH EMOTION RECOGNITION (1)
SPEECH ENHANCEMENT (1)
STATISTICAL ANALYSIS (1)
STATISTICAL MODEL (1)
TRAINING DATA SET (1)
WORD ERROR RATE (1)
more

INFONA - science communication portal

Search results for: M. Hasegawa

Emotion recognition from speech VIA boosted Gaussian mixture models

Feature analysis and selection for acoustic event detection

Articulatory Feature-Based Methods for Acoustic and Audio-Visual Speech Recognition: Summary from the 2006 JHU Summer workshop

Generalized Optimal Multi-Microphone Speech Enhancement Using Sequential Minimum Variance Distortionless Response(MVDR) Beamforming and Postfiltering

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options