Wyniki wyszukiwania dla: Luciana Ferrer

Pozycje od 1 do 6 spośród 6 wyników

artykuł

Study of Senone-Based Deep Neural Network Approaches for Spoken Language Recognition

Luciana Ferrer, Yun Lei, Mitchell McLaren, Nicolas Scheffer

IEEE/ACM Transactions on Audio, Speech, and Language Processing > 2016 > 24 > 1 > 105 - 116

This paper compares different approaches for using deep neural networks (DNNs) trained to predict senone posteriors for the task of spoken language recognition (SLR). These approaches have recently been found to outperform various baseline systems on different datasets, but they have not yet been compared to each other or to a common baseline. Two of these approaches use the DNNs to generate feature...

rozdział

Advances in deep neural network approaches to speaker recognition

Mitchell McLaren, Yun Lei, Luciana Ferrer

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4814 - 4818

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The recent application of deep neural networks (DNN) to speaker identification (SID) has resulted in significant improvements over current state-of-the-art on telephone speech. In this work, we report a similar achievement in DNN-based SID performance on microphone speech. We consider two approaches to DNN-based SID: one that uses the DNN to extract features, and another that uses the DNN during feature...

rozdział

iVector-based prosodic system for language identification

David Martinez, Lukas Burget, Luciana Ferrer, Nicolas Scheffer

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4861 - 4864

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

Prosody is the part of speech where rhythm, stress, and intonation are reflected. In language identification tasks, these characteristics are assumed to be language dependent, and thus the language can be identified from them. In this paper, an automatic language recognition system that extracts prosody information from speech and makes decisions about the language with a generative classifier based...

rozdział

Recent progress in prosodic speaker verification

Marcel Kockmann, Luciana Ferrer, Lukas Burget, Elizabeth Shriberg, więcej

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4556 - 4559

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We describe recent progress in the field of prosodic modeling for speaker verification. In a previous paper, we proposed a technique for modeling syllable-based prosodic features that uses a multinomial subspace model for feature extraction and within-class covariance normalization or linear discriminant analysis for session variability compensation. In this paper, we show that performance can be...

rozdział

Acoustic front-end optimization for bird species recognition

Martin Graciarena, Michelle Delplanche, Elizabeth Shriberg, Andreas Stolcke, więcej

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 293 - 296

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

The goal of this work was to explore the optimization of the feature extraction module (front-end) parameters to improve bird species recognition. We explored optimizing the spectral and temporal parameters of a Mel cepstrum feature-based front-end, starting from common parameter values used in speech processing experiments. These features were modeled using a Gaussian mixture model (GMM) system....

rozdział

A comparison of approaches for modeling prosodic features in speaker recognition

Luciana Ferrer, Nicolas Scheffer, Elizabeth Shriberg

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 4414 - 4417

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

Prosodic information has been successfully used for speaker recognition for more than a decade. The best-performing prosodic system to date has been one based on features extracted over syllables obtained automatically from speech recognition output. The features are then transformed using a Fisher kernel, and speaker models are trained using support vector machines (SVMs). Recently, a simpler version...

Opcje filtrowania

Słowa kluczowe:
FEATURE EXTRACTION

Data publikacji

Ustaw własny zakres dat

Typ publikacji

książka (5)
artykuł (1)

Słowa kluczowe

SPEECH (5)
NIST (3)
SPEAKER RECOGNITION (3)
TRAINING (3)
ACOUSTICS (2)
ADAPTATION MODEL (2)
JOINT FACTOR ANALYSIS (2)
NEURAL NETWORKS (2)
POLYNOMIALS (2)
PROSODY (2)
SUPPORT VECTOR MACHINES (2)
ACOUSTIC FRONT-END (1)
ACOUSTIC FRONT-END OPTIMIZATION (1)
ACOUSTIC SIGNAL PROCESSING (1)
ANALYTICAL MODELS (1)
BANDWIDTH (1)
BIRD SPECIES RECOGNITION (1)
BIRDS (1)
BOTTLENECK FEATURES (1)
CALIBRATION (1)
CEPSTRAL ANALYSIS (1)
CHANNEL MISMATCH (1)
DEEP NEURAL NETWORKS (1)
DEEP NEURAL NETWORKS (DNNS) (1)
DETECTION COST FUNCTION (1)
FILTER BANK (1)
FILTER BANK DISTRIBUTION (1)
FILTERING THEORY (1)
FISHER KERNEL (1)
FRONT-END PARAMETER (1)
GAUSSIAN MIXTURE MODEL (1)
GAUSSIAN PROCESSES (1)
GMM SYSTEM (1)
HIDDEN MARKOV MODELS (1)
IVECTOR (1)
IVECTORS (1)
JFA (1)
LANGUAGE IDENTIFICATION (1)
LINEAR FREQUENCY SCALE (1)
MATHEMATICAL MODEL (1)
MEL CEPSTRUM FEATURE (1)
MEL FREQUENCY SCALE (1)
MICROPHONES (1)
MSM (1)
NOISE (1)
NORMALIZATION (1)
OPTIMIZATION (1)
PLDA (1)
PROBABILISTIC LOGIC (1)
PROSODIC FEATURE EXTRACTION MODELLING (1)
PROSODIC SPEAKER VERIFICATION (1)
RATS (1)
SENONES (1)
SNERFS (1)
SPECTRAL BANDWIDTH (1)
SPECTRAL PARAMETER (1)
SPEECH PROCESSING (1)
SPEECH RECOGNITION (1)
SPOKEN LANGUAGE RECOGNITION (SLR) (1)
STANDARDS (1)
SVM (1)
TEMPORAL PARAMETER (1)
ZOOLOGY (1)
więcej

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania dla: Luciana Ferrer

Study of Senone-Based Deep Neural Network Approaches for Spoken Language Recognition

Advances in deep neural network approaches to speaker recognition

iVector-based prosodic system for language identification

Recent progress in prosodic speaker verification

Acoustic front-end optimization for bird species recognition

A comparison of approaches for modeling prosodic features in speaker recognition

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Typ publikacji

Słowa kluczowe

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu