Search results for: M. Hasegawa

Items from 1 to 7 out of 7 results

chapter

Toward robust learning of the Gaussian mixture state emission densities for hidden Markov models

Hao Tang, M Hasegawa-Johnson, T S Huang

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 5242 - 5245

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

One important class of state emission densities of the hiddenMarkov model (HMM) is the Gaussian mixture densities. The classical Baum-Welch algorithm often fails to reliably learn the Gaussian mixture densities when there is insufficient training data, due to the large number of free parameters present in the model. In this paper, we propose a novel strategy for robustly and accurately learning the...

chapter

Kernel metric learning for phonetic classification

Jui-Ting Huang, Xi Zhou, M. Hasegawa-Johnson, T. Huang

2009 IEEE Workshop on Automatic Speech Recognition&Understanding > 141 - 145

2009 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU 2009)

While a sound spoken is described by a handful of frame-level spectral vectors, not all frames have equal contribution for either human perception or machine classification. In this paper, we introduce a novel framework to automatically emphasize important speech frames relevant to phonetic information. We jointly learn the importance of speech frames by a distance metric across the phone classes,...

chapter

Emotion recognition from speech VIA boosted Gaussian mixture models

Hao Tang, S.M. Chu, M. Hasegawa-Johnson, T.S. Huang

2009 IEEE International Conference on Multimedia and Expo > 294 - 297

2009 IEEE International Conference on Multimedia and Expo (ICME)

Gaussian mixture models (GMMs) and the minimum error rate classifier (i.e. Bayesian optimal classifier) are popular and effective tools for speech emotion recognition. Typically, GMMs are used to model the class-conditional distributions of acoustic features and their parameters are estimated by the expectation maximization (EM) algorithm based on a training data set. Then, classification is performed...

chapter

Face age estimation using patch-based hidden Markov model supervectors

Xiaodan Zhuang, Xi Zhou, M. Hasegawa-Johnson, T. Huang

2008 19th International Conference on Pattern Recognition > 1 - 4

ICPR 2008 19th International Conference on Pattern Recognition

Recent studies in patch-based Gaussian Mixture Model (GMM) approaches for face age estimation present promising results. We propose using a hidden Markov model (HMM) supervector to represent face image patches, to improve from the previous GMM supervector approach by capturing the spatial structure of human faces and loosening the assumption of identical face patch distribution within a face image...

chapter

Feature analysis and selection for acoustic event detection

Xiaodan Zhuang, Xi Zhou, T.S. Huang, M. Hasegawa-Johnson

2008 IEEE International Conference on Acoustics, Speech and Signal Processing > 17 - 20

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

Speech perceptual features, such as Mel-frequency Cepstral Coefficients (MFCC), have been widely used in acoustic event detection. However, the different spectral structures between speech and acoustic events degrade the performance of the speech feature sets. We propose quantifying the discriminative capability of each feature component according to the approximated Bayesian accuracy and deriving...

chapter

Articulatory Feature-Based Methods for Acoustic and Audio-Visual Speech Recognition: Summary from the 2006 JHU Summer workshop

K. Livescu, O. Cetin, M. Hasegawa-Johnson, S. King, more

2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '7 > 4 > IV-621 - IV-624

2007 IEEE International Conference on Acoustics, Speech, and Signal Processing

We report on investigations, conducted at the 2006 Johns Hopkins Workshop, into the use of articulatory features (AFs) for observation and pronunciation models in speech recognition. In the area of observation modeling, we use the outputs of AF classifiers both directly, in an extension of hybrid HMM/neural network models, and as part of the observation vector, an extension of the "tandem"...

chapter

Hmm-Based and Svm-Based Recognition of the Speech of Talkers With Spastic Dysarthria

M. Hasegawa-Johnson, J. Gunderson, A. Penman, T. Huang

2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings > 3 > III

2006 IEEE International Conference on Acoustics, Speech, and Signal Processing

This paper studies the speech of three talkers with spastic dysarthria caused by cerebral palsy. All three subjects share the symptom of low intelligibility, but causes differ. First, all subjects tend to reduce or delete word-initial consonants; one subject deletes all consonants. Second, one subject exhibits a painstaking stutter. Two algorithms were used to develop automatic isolated digit recognition...

Filter options

Keywords:
HIDDEN MARKOV MODELS

Publication date

Set your own date range

Keywords

SPEECH RECOGNITION (5)
BAYES METHODS (3)
SPEECH (3)
BOOSTING (2)
EMOTION RECOGNITION (2)
EQUATIONS (2)
ESTIMATION (2)
FEATURE EXTRACTION (2)
GAUSSIAN PROCESSES (2)
HMM (2)
LEARNING (ARTIFICIAL INTELLIGENCE) (2)
SPEECH EMOTION RECOGNITION (2)
SPEECH PROCESSING (2)
TRAINING (2)
2006 JHU SUMMER WORKSHOP (1)
ACOUSTIC (1)
ACOUSTIC EVENT DETECTION (1)
ACOUSTIC FEATURE (1)
ACOUSTIC SIGNAL DETECTION (1)
ACOUSTIC SIGNAL PROCESSING (1)
ACOUSTICS (1)
ADAPTATION MODEL (1)
APPROXIMATED BAYESIAN ACCURACY (1)
ARTICULATORY FEATURE-BASED METHODS (1)
AUDIO-VISUAL RECOGNITION (1)
AUDIO-VISUAL SPEECH RECOGNITION (1)
AUTOMATIC ISOLATED DIGIT RECOGNITION SYSTEMS (1)
BAUM-WELCH ALGORITHM (1)
BAYESIAN ACCURACY (1)
BAYESIAN OPTIMAL CLASSIFIER (1)
BOOSTED GAUSSIAN MIXTURE MODEL (1)
BOOSTING BAUM-WELCH ALGORITHM (1)
CEPSTRAL ANALYSIS (1)
CLASS-CONDITIONAL DISTRIBUTION (1)
CUAVE AUDIO-VISUAL DIGITS CORPUS (1)
DISTANCE MEASUREMENT (1)
DYNAMIC BAYESIAN NETWORKS (1)
EM ALGORITHM (1)
EM-GMM ALGORITHM (1)
ENSEMBLE LEARNING (1)
EUCLIDEAN DISTANCE (1)
EXPECTATION MAXIMIZATION ALGORITHM (1)
EXPECTATION-MAXIMISATION ALGORITHM (1)
FACE (1)
FACE AGE ESTIMATION (1)
FACE RECOGNITION (1)
FEATURE ANALYSIS (1)
FEATURE SELECTION (1)
FEATURE-LEVEL MANUAL TRANSCRIPTIONS (1)
GAUSSIAN DISTRIBUTION (1)
GAUSSIAN MIXTURE DENSITY (1)
GAUSSIAN MIXTURE MODEL (1)
GAUSSIAN MIXTURE STATE EMISSION DENSITY (1)
GRADIENT DESCENT SEARCH (1)
GRADIENT METHODS (1)
HIDDEN MARKOV MODEL (1)
HUMANS (1)
IDENTICAL FACE PATCH DISTRIBUTION (1)
IMAGE REPRESENTATION (1)
IMPLICIT UNSUPERVISED ALIGNMENT (1)
JOINTS (1)
KERNEL (1)
KERNEL METRIC LEARNING (1)
KULLBACK-LEIBLER DIVERGENCE APPROXIMATION (1)
MARKOV PROCESSES (1)
MATHEMATICAL MODEL (1)
MINIMUM ERROR RATE CLASSIFIER (1)
NEURAL NETS (1)
NEURAL NETWORK MODELS (1)
PAINSTAKING STUTTER (1)
PATCH-BASED GAUSSIAN MIXTURE MODEL (1)
PATCH-BASED HIDDEN MARKOV MODEL SUPERVECTOR (1)
PHONETIC CLASSIFICATION (1)
PROBABILITY DENSITY ESTIMATION (1)
PRONUNCIATION MODELS (1)
SIGNAL CLASSIFICATION (1)
SMALL-VOCABULARY SWITCHBOARD (1)
SOFT SYNCHRONY CONSTRAINTS (1)
SPASTIC DYSARTHRIA (1)
SPATIAL HUMAN FACE STRUCTURE (1)
SPECTRAL STRUCTURE (1)
SPEECH FRAMES EMPHASIS (1)
SPEECH RECOGNITION FRAMEWORK (1)
STATISTICAL ANALYSIS (1)
STATISTICAL MODELS (1)
SUPPORT VECTOR MACHINES (1)
SVM-BASED RECOGNITION (1)
TRAINING DATA SET (1)
more

INFONA - science communication portal

Search results for: M. Hasegawa

Toward robust learning of the Gaussian mixture state emission densities for hidden Markov models

Kernel metric learning for phonetic classification

Emotion recognition from speech VIA boosted Gaussian mixture models

Face age estimation using patch-based hidden Markov model supervectors

Feature analysis and selection for acoustic event detection

Articulatory Feature-Based Methods for Acoustic and Audio-Visual Speech Recognition: Summary from the 2006 JHU Summer workshop

Hmm-Based and Svm-Based Recognition of the Speech of Talkers With Spastic Dysarthria

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options