Search results for: Luciana Ferrer

Items from 1 to 6 out of 6 results

article

Study of Senone-Based Deep Neural Network Approaches for Spoken Language Recognition

Luciana Ferrer, Yun Lei, Mitchell McLaren, Nicolas Scheffer

IEEE/ACM Transactions on Audio, Speech, and Language Processing > 2016 > 24 > 1 > 105 - 116

This paper compares different approaches for using deep neural networks (DNNs) trained to predict senone posteriors for the task of spoken language recognition (SLR). These approaches have recently been found to outperform various baseline systems on different datasets, but they have not yet been compared to each other or to a common baseline. Two of these approaches use the DNNs to generate feature...

chapter

Advances in deep neural network approaches to speaker recognition

Mitchell McLaren, Yun Lei, Luciana Ferrer

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4814 - 4818

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The recent application of deep neural networks (DNN) to speaker identification (SID) has resulted in significant improvements over current state-of-the-art on telephone speech. In this work, we report a similar achievement in DNN-based SID performance on microphone speech. We consider two approaches to DNN-based SID: one that uses the DNN to extract features, and another that uses the DNN during feature...

chapter

iVector-based prosodic system for language identification

David Martinez, Lukas Burget, Luciana Ferrer, Nicolas Scheffer

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4861 - 4864

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

Prosody is the part of speech where rhythm, stress, and intonation are reflected. In language identification tasks, these characteristics are assumed to be language dependent, and thus the language can be identified from them. In this paper, an automatic language recognition system that extracts prosody information from speech and makes decisions about the language with a generative classifier based...

chapter

Towards noise-robust speaker recognition using probabilistic linear discriminant analysis

Yun Lei, Lukas Burget, Luciana Ferrer, Martin Graciarena, more

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4253 - 4256

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

This work addresses the problem of speaker verification where additive noise is present in the enrollment and testing utterances. We show how the current state-of-the-art framework can be effectively used to mitigate this effect. We first look at the degradation a standard speaker verification system is subjected to when presented with noisy speech waveforms. We designed and generated a corpus with...

chapter

Acoustic front-end optimization for bird species recognition

Martin Graciarena, Michelle Delplanche, Elizabeth Shriberg, Andreas Stolcke, more

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 293 - 296

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

The goal of this work was to explore the optimization of the feature extraction module (front-end) parameters to improve bird species recognition. We explored optimizing the spectral and temporal parameters of a Mel cepstrum feature-based front-end, starting from common parameter values used in speech processing experiments. These features were modeled using a Gaussian mixture model (GMM) system....

chapter

A comparison of approaches for modeling prosodic features in speaker recognition

Luciana Ferrer, Nicolas Scheffer, Elizabeth Shriberg

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 4414 - 4417

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

Prosodic information has been successfully used for speaker recognition for more than a decade. The best-performing prosodic system to date has been one based on features extracted over syllables obtained automatically from speech recognition output. The features are then transformed using a Fisher kernel, and speaker models are trained using support vector machines (SVMs). Recently, a simpler version...

Filter options

Keywords:
SPEECH

Publication date

Set your own date range

Publication type

book (5)
article (1)

Keywords

FEATURE EXTRACTION (5)
TRAINING (4)
NIST (3)
SPEAKER RECOGNITION (3)
ADAPTATION MODEL (2)
JOINT FACTOR ANALYSIS (2)
NEURAL NETWORKS (2)
NOISE (2)
POLYNOMIALS (2)
PROSODY (2)
ACOUSTIC FRONT-END (1)
ACOUSTIC FRONT-END OPTIMIZATION (1)
ACOUSTIC SIGNAL PROCESSING (1)
ACOUSTICS (1)
BANDWIDTH (1)
BIRD SPECIES RECOGNITION (1)
BIRDS (1)
BOTTLENECK FEATURES (1)
CALIBRATION (1)
CEPSTRAL ANALYSIS (1)
CHANNEL MISMATCH (1)
DEEP NEURAL NETWORKS (1)
DEEP NEURAL NETWORKS (DNNS) (1)
DEGRADATION (1)
DETECTION COST FUNCTION (1)
FILTER BANK (1)
FILTER BANK DISTRIBUTION (1)
FILTERING THEORY (1)
FISHER KERNEL (1)
FRONT-END PARAMETER (1)
GAUSSIAN MIXTURE MODEL (1)
GAUSSIAN PROCESSES (1)
GMM SYSTEM (1)
HIDDEN MARKOV MODELS (1)
I-VECTOR (1)
IVECTORS (1)
JFA (1)
LANGUAGE IDENTIFICATION (1)
LINEAR FREQUENCY SCALE (1)
MATHEMATICAL MODEL (1)
MEL CEPSTRUM FEATURE (1)
MEL FREQUENCY SCALE (1)
MICROPHONES (1)
NOISE MEASUREMENT (1)
NORMALIZATION (1)
OPTIMIZATION (1)
PLDA (1)
PROSODIC FEATURE EXTRACTION MODELLING (1)
RATS (1)
ROBUSTNESS (1)
SENONES (1)
SIGNAL TO NOISE RATIO (1)
SPECTRAL BANDWIDTH (1)
SPECTRAL PARAMETER (1)
SPEECH PROCESSING (1)
SPEECH RECOGNITION (1)
SPOKEN LANGUAGE RECOGNITION (SLR) (1)
STANDARDS (1)
SUPPORT VECTOR MACHINES (1)
SVM (1)
TEMPORAL PARAMETER (1)
ZOOLOGY (1)
more

INFONA - science communication portal

Search results for: Luciana Ferrer

Study of Senone-Based Deep Neural Network Approaches for Spoken Language Recognition

Advances in deep neural network approaches to speaker recognition

iVector-based prosodic system for language identification

Towards noise-robust speaker recognition using probabilistic linear discriminant analysis

Acoustic front-end optimization for bird species recognition

A comparison of approaches for modeling prosodic features in speaker recognition

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options