Search results

Items from 1 to 5 out of 5 results

chapter

Probabilistic non-intrusive quality assessment of speech for bounded-scale preference scores

Petko N Petkov, W Bastiaan Kleijn

2010 Second International Workshop on Quality of Multimedia Experience (QoMEX) > 188 - 193

2010 Second International Workshop on Quality of Multimedia Experience (QoMEX 2010)

We propose a probabilistic, non-intrusive method for quality assessment of speech that takes into consideration the bounded character of the preference scores. The quality ratings are modeled as iid Beta random variables, whose mean and precision are parametrized directly in terms of the signal features. Maximum likelihood estimation is used to learn the model parameters in view of a training database...

chapter

Fast approach to speaker identification for large population using MLLR and sufficient statistics

A.K. Sarkar, S.P. Rath, S. Umesh

2010 National Conference On Communications (NCC) > 1 - 5

2010 National Conference on Communications (NCC 2010)

In speaker identification, most of the computational processing time is required to calculate the likelihood of the test utterance of the unknown speaker with respect to the speaker models in the database. When number of speakers in the database is in the order of 10,000 or more, then computational complexity becomes very high. In this paper, we propose a Maximum Likelihood Linear Regression (MLLR)...

chapter

Evaluating vowel pronunciation quality: Formant space matching versus ASR confidence scoring

A. Patil, C. Gupta, P. Rao

2010 National Conference On Communications (NCC) > 1 - 5

2010 National Conference on Communications (NCC 2010)

Quantitative evaluation of the quality of a speaker's pronunciation of the vowels of a language can contribute to the important task of speaker accent detection. Our aim is to qualitatively and quantitatively distinguish between native and non-native speakers of a language on the basis of a comparative study of two analysis methods. One deals with relative positions of their vowels in formant (F1-F2)...

chapter

Speech emotion recognition via a max-margin framework incorporating a loss function based on the Watson and Tellegen's emotion model

Sungrack Yun, C.D. Yoo

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 4169 - 4172

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

This paper considers a method for speech emotion recognition by a max-margin framework incorporating a loss function based on a well-known model called theWatson and Tellegen's emotion model. Each emotion is modeled by a single-state hidden Markov model (HMM) that is trained by maximizing the minimum separation margin between emotions, and the margin is scaled by a loss function. The framework is...

chapter

A New Method for Discriminative Model Combination in Speech Recognition

Wu Yahui, Liu Gang, Guo Jun

2008 International Conference on Computational Intelligence and Security > 1 > 200 - 203

2008 International Conference on Computational Intelligence and Security

A new method based on discriminative model combination for acoustic model training is proposed. The MPE trained model and the MMIE trained model are used for model combination. The combination criterion is based on the ratio of the inter-variance to the intra-variance of each model. Besides we also propose a ldquoclusterrdquo method for the model to choose its confusion models in order to get the...

Filter options

Keywords:
DATABASES
COMPUTATIONAL MODELING
SPEECH
MAXIMUM LIKELIHOOD ESTIMATION

Publication date

Set your own date range

Keywords

HIDDEN MARKOV MODELS (3)
SPEECH RECOGNITION (3)
COMPUTATIONAL COMPLEXITY (2)
ACCENT DETECTION (1)
ACOUSTIC MODEL TRAINING (1)
ACOUSTICS (1)
ADAPTATION MODEL (1)
ASR CONFIDENCE SCORING (1)
AUTOMATIC SPEECH RECOGNITION (1)
BAND-BASED FEATURES (1)
BETA REGRESSION (1)
BIOLOGICAL SYSTEM MODELING (1)
BOUNDED-SCALE PREFERENCE SCORES (1)
CLUSTER METHOD (1)
COMPLEXITY THEORY (1)
COMPUTATIONAL PROCESSING TIME (1)
CORRELATION (1)
CORRELATION METHODS (1)
DATA MODELS (1)
DISCRIMINATIVE MODEL (1)
EMOTION RECOGNITION (1)
EMOTIONAL SPEECH (1)
FORMANT SPACE MATCHING (1)
FORMANTS (1)
HIDDEN MARKOV MODEL (1)
IID BETA RANDOM VARIABLES (1)
LOG LIKELIHOOD SCORE (1)
MATRIX ALGEBRA (1)
MAX-MARGIN FRAMEWORK (1)
MAXIMUM LIKELIHOOD (1)
MAXIMUM LIKELIHOOD LINEAR REGRESSION (1)
MAXIMUM MUTUAL INFORMATION ESTIMATION (1)
MEAN SQUARE ERROR METHODS (1)
MLLR MATRIX (1)
MMIE (1)
MODEL COMBINATION (1)
MPE (1)
NIST (1)
NON-INTRUSIVE QUALITY ASSESSMENT (1)
PRONUNCIATION (1)
REGRESSION ANALYSIS (1)
ROOT MEAN SQUARE ERROR (1)
SEMIDEFINITE PROGRAMMING (1)
SPEAKER ACCENT DETECTION (1)
SPEAKER IDENTIFICATION (1)
SPEAKER RECOGNITION (1)
SPEECH EMOTION RECOGNITION (1)
SPEECH PROBABILISTIC NON-INTRUSIVE QUALITY ASSESSMENT (1)
SPEECH PROCESSING (1)
SPEECH TRAINING (1)
SUFFICIENT STATISTICS (1)
TELLEGEN EMOTION MODEL (1)
TRAINING DATABASE (1)
VOWEL PRONUNCIATION QUALITY (1)
WATSON AND TELLEGEN'S EMOTION MODEL (1)
more

INFONA - science communication portal

Search results

Probabilistic non-intrusive quality assessment of speech for bounded-scale preference scores

Fast approach to speaker identification for large population using MLLR and sufficient statistics

Evaluating vowel pronunciation quality: Formant space matching versus ASR confidence scoring

Speech emotion recognition via a max-margin framework incorporating a loss function based on the Watson and Tellegen's emotion model

A New Method for Discriminative Model Combination in Speech Recognition

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options