Search results for: O. Glembek

Items from 1 to 9 out of 9 results

chapter

Developing a speaker identification system for the DARPA RATS project

Oldrich Plchot, Spyros Matsoukas, Pavel Matejka, Najim Dehak, more

2013 IEEE International Conference on Acoustics, Speech and Signal Processing > 6768 - 6772

ICASSP 2013 - 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper describes the speaker identification (SID) system developed by the Patrol team for the first phase of the DARPA RATS (Robust Automatic Transcription of Speech) program, which seeks to advance state of the art detection capabilities on audio from highly degraded communication channels. We present results using multiple SID systems differing mainly in the algorithm used for voice activity...

chapter

Approaches to automatic lexicon learning with limited training examples

N Goel, S Thomas, M Agarwal, P Akyazi, more

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 5094 - 5097

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

Preparation of a lexicon for speech recognition systems can be a significant effort in languages where the written form is not exactly phonetic. On the other hand, in languages where the written form is quite phonetic, some common words are often mispronounced. In this paper, we use a combination of lexicon learning techniques to explore whether a lexicon can be learned when only a small lexicon is...

chapter

Multilingual acoustic modeling for speech recognition based on subspace Gaussian Mixture Models

L Burget, P Schwarz, M Agarwal, P Akyazi, more

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 4334 - 4337

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

Although research has previously been done on multilingual speech recognition, it has been found to be very difficult to improve over separately trained systems. The usual approach has been to use some kind of “universal phone set” that covers multiple languages. We report experiments on a different approach to multilingual speech recognition, in which the phone sets are entirely distinct but the...

chapter

Comparison of scoring methods used in speaker recognition with Joint Factor Analysis

O. Glembek, L. Burget, N. Dehak, N. Brummer, more

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 4057 - 4060

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

The aim of this paper is to compare different log-likelihood scoring methods, that different sites used in the latest state-of-the-art joint factor analysis (JFA) speaker recognition systems. The algorithms use various assumptions and have been derived from various approximations of the objective functions of JFA. We compare the techniques in terms of speed and performance. We show, that approximations...

chapter

Support vector machines and Joint Factor Analysis for speaker verification

N. Dehak, P. Kenny, R. Dehak, O. Glembek, more

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 4237 - 4240

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

This article presents several techniques to combine between support vector machines (SVM) and joint factor analysis (JFA) model for speaker verification. In this combination, the SVMs are applied to different sources of information produced by the JFA. These informations are the Gaussian mixture model supervectors and speakers and common factors. We found that using SVM in JFA factors gave the best...

chapter

Neural network based language models for highly inflective languages

T. Mikolov, J. Kopecky, L. Burget, O. Glembek, more

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 4725 - 4728

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

Speech recognition of inflectional and morphologically rich languages like Czech is currently quite a challenging task, because simple n-gram techniques are unable to capture important regularities in the data. Several possible solutions were proposed, namely class based models, factored models, decision trees and neural networks. This paper describes improvements obtained in recognition of spoken...

chapter

Morphological random forests for language modeling of inflectional languages

I. Oparin, O. Glembek, L. Burget, J. Cernocky

2008 IEEE Spoken Language Technology Workshop > 189 - 192

2008 IEEE Workshop on Spoken Language Technology. SLT 2008

In this paper, we are concerned with using decision trees (DT) and random forests (RF) in language modeling for Czech LVCSR. We show that the RF approach can be successfully implemented for language modeling of an inflectional language. Performance of word-based and morphological DTs and RFs was evaluated on lecture recognition task. We show that while DTs perform worse than conventional trigram language...

chapter

Search in speech, language identification and speaker recognition in Speech@FIT

J. Cernocky, L. Burget, P. Schwarz, P. Matejka, more

2007 17th International Conference Radioelektronika > 1 - 6

Radioelektronika, 2007. 17th International Conference

This paper describes "search in speech" techniques developed in the Speech@FIT research group at FIT BUT in the last couple of years. It concentrates on spoken term detection (STD) and presents our system for NIST STD 2006 evaluations in detail. It also briefly mentions our systems for speaker and language recognition.

chapter

STBU System for the NIST 2006 Speaker Recognition Evaluation

P. Matejka, L. Burget, P. Schwarz, O. Glembek, more

2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '7 > 4 > IV-221 - IV-224

2007 IEEE International Conference on Acoustics, Speech, and Signal Processing

This paper describes STBU 2006 speaker recognition system, which performed well in the NIST 2006 speaker recognition evaluation. STBU is consortium of 4 partners: Spescom DataVoice (South Africa), TNO (Netherlands), BUT (Czech Republic) and University of Stellenbosch (South Africa). The primary system is a combination of three main kinds of systems: (1) GMM, with short-time MFCC or PLP features, (2)...

Filter options

Publication date

Set your own date range

Keywords

NATURAL LANGUAGE PROCESSING (5)
SPEECH RECOGNITION (5)
SPEAKER RECOGNITION (4)
TRAINING DATA (4)
DATA MODELS (3)
TRAINING (3)
ACCURACY (2)
ACOUSTICS (2)
DECISION TREES (2)
GAUSSIAN PROCESSES (2)
GMM (2)
JOINT FACTOR ANALYSIS (2)
JOINTS (2)
LANGUAGE MODELING (2)
NIST (2)
SMOOTHING METHODS (2)
SPEECH (2)
ACOUSTIC SIGNAL PROCESSING (1)
ADAPTATION MODEL (1)
ANALYTICAL MODELS (1)
APPROXIMATION METHODS (1)
APPROXIMATION THEORY (1)
ARTIFICIAL NEURAL NETWORKS (1)
AUTOMATIC LEXICON LEARNING TECHNIQUE (1)
BOOTSTRAPPING (1)
CHANNEL ESTIMATION (1)
CLASS BASED MODELS (1)
CLASS COVARIANCE NORMALIZATION METHOD (1)
COMPUTATIONAL MODELING (1)
COVARIANCE ANALYSIS (1)
COVARIANCE MATRIX (1)
CZECH LVCSR (1)
DICTIONARIES (1)
EIGENCHANNEL (1)
ENTROPY (1)
FACTORED MODELS (1)
FAST SCORING (1)
GAUSSIAN MIXTURE MODEL SUPERVECTOR (1)
HIDDEN MARKOV MODELS (1)
HISTORY (1)
INFLECTIONAL LANGUAGES (1)
INFLECTIVE LANGUAGES (1)
INTERPOLATION (1)
KERNEL (1)
LANGUAGE IDENTIFICATION (1)
LANGUAGE MODELS (1)
LARGE VOCABULARY SPEECH RECOGNITION (1)
LEARNING (ARTIFICIAL INTELLIGENCE) (1)
LECTURE RECOGNITION TASK (1)
LEXICON LEARNING (1)
LOG-LIKELIHOOD RATIO (1)
LOG-LIKELIHOOD SCORING METHODS (1)
LVCSR (1)
MODIFIED KNESER-NEY SMOOTHING (1)
MORPHOLOGICAL RANDOM FORESTS (1)
MULTILINGUAL ACOUSTIC MODELING (1)
NAP (1)
NEURAL NETS (1)
NEURAL NETWORK (1)
NEURAL NETWORKS (1)
NOISY SPEECH PROCESSING (1)
OBJECTIVE FUNCTIONS (1)
PHONETIC LANGUAGE (1)
RADIO FREQUENCY (1)
REAL TIME SYSTEMS (1)
SPEAKER FACTORS SPACE (1)
SPEAKER IDENTIFICATION (1)
SPEAKER RECOGNITION SYSTEMS (1)
SPEAKER VERIFICATION (1)
SPEECH RECOGNITION SYSTEMS (1)
SPEECH@FIT (1)
SPOKEN TERM DETECTION (1)
SUBSPACE GAUSSIAN MIXTURE MODEL (1)
SUPPORT VECTOR MACHINE (1)
SUPPORT VECTOR MACHINES (1)
SVM (1)
TRIGRAM LANGUAGE MODELS (1)
VOCABULARY (1)
WITHIN CLASS COVARIANCE NORMALIZATION (1)
more

INFONA - science communication portal

Search results for: O. Glembek

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options