Search results

Items from 1 to 4 out of 4 results

chapter

Speech recognition in unseen and noisy channel conditions

Vikramjit Mitra, Horacio Franco, Chris Bartels, Julien van Hout, more

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5215 - 5219

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Speech recognition in varying background conditions is a challenging problem. Acoustic condition mismatch between training and evaluation data can significantly reduce recognition performance. For mismatched conditions, data-adaptation techniques are typically found to be useful, as they expose the acoustic model to the new data condition(s). Supervised adaptation techniques usually provide substantial...

chapter

Feature selection and model optimization for semi-supervised speaker spotting

Srikanth Raj Chetupalli, Anand Gopalakrishnan, Thippur V. Sreenivas

2016 24th European Signal Processing Conference (EUSIPCO) > 135 - 139

2016 24th European Signal Processing Conference (EUSIPCO)

We explore, experimentally, feature selection and optimization of stochastic model parameters for the problem of speaker spotting. Based on an initially identified segment of speech of a speaker, an iterative model refinement method is developed along with a latent variable mixture model so that segments of the same speaker are identified in a long speech record. It is found that a GMM with moderate...

chapter

Study of fusion strategies and exploiting the combination of MFCC and PNCC features for robust biometric speaker identification

M.T.S. Al-Kaltakchi, W. L. Woo, S.S. Dlay, J. A. Chambers

2016 4th International Conference on Biometrics and Forensics (IWBF) > 1 - 6

2016 4th International Conference on Biometrics and Forensics (IWBF)

In this paper, a new combination of features and normalization methods is investigated for robust biometric speaker identification. Mel Frequency Cepstral Coefficients (MFCC) are efficient for speaker identification in clean speech while Power Normalized Cepstral Coefficients (PNCC) features are robust for noisy environments. Therefore, combining both features together is better than taking each one...

article

Regularized Auto-Associative Neural Networks for Speaker Verification

Sri Garimella, Sri Harish Mallidi, Hynek Hermansky

IEEE Signal Processing Letters > 2012 > 19 > 12 > 841 - 844

Auto-Associative Neural Network (AANN) is a fully connected feed-forward neural network, trained to reconstruct its input at its output through a hidden compression layer. AANNs are used to model speakers in speaker verification, where a speaker-specific AANN model is obtained by adapting (or retraining) the Universal Background Model (UBM) AANN, an AANN trained on multiple held out speakers, using...

Filter options

Data set:
ieee
Keywords:
TRAINING
ADAPTATION MODELS
DATA MODELS
MEL FREQUENCY CEPSTRAL COEFFICIENT

Publication date

Set your own date range

Publication type

book (3)
article (1)

Keywords

SPEECH (3)
FEATURE EXTRACTION (2)
SPEAKER VERIFICATION (2)
ADAPTATION (1)
AUTO-ASSOCIATIVE NEURAL NETWORK (1)
AUTO-ENCODERS (1)
AUTOMATIC SPEECH RECOGNITION (1)
BOTTLENECK FEATURES (1)
CHANNEL- AND NOISE-ROBUST SPEECH RECOGNITION (1)
GAUSSIAN MIXTURE MODEL (1)
GAUSSIAN MIXTURE MODEL (GMM) (1)
HIDDEN MARKOV MODELS (1)
MAXIMUM A POSTERIOR PROBABILITY ADAPTATION (1)
MEL-FREQUENCY CEPSTRAL COEFFICIENTS (MFCCS) (1)
NEURAL NETWORKS (1)
OPTIMIZATION (1)
PRINCIPAL COMPONENT ANALYSIS (1)
REGULARIZATION (1)
ROBUST BIOMETRIC SPEAKER IDENTIFICATION AND ROBUST RECOGNITION (1)
ROBUSTNESS (1)
SCORE FUSION (1)
SPEAKER DIARIZATION (1)
SPEAKER SPOTTING (1)
UNIVERSAL BACKGROUND MODEL (1)
UNSUPERVISED ADAPTATION (1)
VECTORS (1)
more

INFONA - science communication portal

Search results

Speech recognition in unseen and noisy channel conditions

Feature selection and model optimization for semi-supervised speaker spotting

Study of fusion strategies and exploiting the combination of MFCC and PNCC features for robust biometric speaker identification

Regularized Auto-Associative Neural Networks for Speaker Verification

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options