Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on

chapter

Modelling the alternative hypothesis for text-dependent speaker verification

Anthony Larcher, Kong Aik Lee, Bin Ma, Haizhou Li

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 734 - 738

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper describes text-dependent speaker verification as a task involving four classes of trials depending on whether the target speaker or an impostor pronounces the expected pass-phrase or not. These four classes are used to reformulate the log-likelihood ratio traditionally used in text-independent speaker verification. Three formulations of the alternative hypothesis are considered, leading...

chapter

Imposture classification for text-dependent speaker verification

Anthony Larcher, Kong Aik Lee, Bin Ma, Haizhou Li

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 739 - 743

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This work focuses on text-dependent speaker verification, where a user is required to chose and pronounce a customized pass-phrase to get authenticated. In this context, there are three types of impostures: an impostor pronouncing the correct pass-phrase, an impostor pronouncing a wrong pass-phrase and the most difficult one: an impostor playing back a recording of the target speaker pronouncing a...

chapter

Evasion and obfuscation in automatic speaker verification

Federico Alegre, Giovanni Soldi, Nicholas Evans

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 749 - 753

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The potential for biometric systems to be manipulated through some form of subversion is well acknowledged. One such approach known as spoofing relates to the provocation of false accepts in authentication applications. Another approach referred to as obfuscation relates to the provocation of missed detections in surveillance applications. While the automatic speaker verification research community...

chapter

Speaker verification using kernel-based binary classifiers with binary operation derived features

Hung-Shin Lee, Yu Tso, Yun-Fan Chang, Hsin-Min Wang, more

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1660 - 1664

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper, we study the use of two kinds of kernel-based discriminative models, namely support vector machine (SVM) and deep neural network (DNN), for speaker verification. We treat the verification task as a binary classification problem, in which a pair of two utterances, each represented by an i-vector, is assumed to belong to either the “within-speaker” group or the “between-speaker” group...

chapter

Improving PLDA speaker verification with limited development data

Ahilan Kanagasundaram, David Dean, Sridha Sridharan

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1665 - 1669

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper analyses the probabilistic linear discriminant analysis (PLDA) speaker verification approach with limited development data. This paper investigates the use of the median as the central tendency of a speaker's i-vector representation, and the effectiveness of weighted discriminative techniques on the performance of state-of-the-art length-normalised Gaussian PLDA (GPLDA) speaker verification...

chapter

Constrained discriminative PLDA training for speaker verification

Johan Rohdin, Sangeeta Biswas, Koichi Shinoda

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1670 - 1674

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Many studies have proven the effectiveness of discriminative training for speaker verification based on probabilistic linear discriminative analysis (PLDA) with i-vectors as features. Most of them directly optimize the log-likelihood ratio score function of the PLDA model instead of explicitly train the PLDA model. But this optimization process removes some of the constraints that normally are imposed...

chapter

Bayesian vocal tract model estimates of nasal stops for speaker verification

Ewald Enzinger, Christian H. Kasess

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1685 - 1689

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper we report on speaker verification experiments using branched vocal tract model estimates of alveolar nasal (/n/) stops. While the discriminatory potential of nasal acoustics has long been established, their acoustic properties have so far mostly been characterized using spectral features. Here, we used a Bayesian estimation technique to obtain reflection coefficients of a branched-tube...

chapter

Speaker verification based processing for robust ASR in co-channel speech scenarios

Seyed Omid Sadjadi, Larry P. Heck

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1774 - 1778

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Co-channel speech, which occurs in monaural audio recordings of two or more overlapping talkers, poses a great challenge for automatic speech applications. Automatic speech recognition (ASR) performance, in particular, has been shown to degrade significantly in the presence of a competing talker. In this paper, assuming a known target talker scenario, we present two different masking strategies based...

chapter

Frequency offset correction in single sideband speech for speaker verification

Hua Xing, Philipos C. Loizou, John H.L. Hansen

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4022 - 4026

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Communication system mismatch represents a major influence for loss in speaker recognition performance. While microphone and handset differences have been considered in the NIST SRE, nonlinear communication system differences, such as modulation/demodulation (Mod/DeMod) carrier drift, have yet to be considered. In this study, an algorithm for estimating and correcting Mod/DeMod frequency offsets distortion...

chapter

Deep neural networks for small footprint text-dependent speaker verification

Ehsan Variani, Xin Lei, Erik McDermott, Ignacio Lopez Moreno, more

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4052 - 4056

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper we investigate the use of deep neural networks (DNNs) for a small footprint text-dependent speaker verification task. At development stage, a DNN is trained to classify speakers at the framelevel. During speaker enrollment, the trained DNN is used to extract speaker specific features from the last hidden layer. The average of these speaker features, or d-vector, is taken as the speaker...

INFONA - science communication portal

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Modelling the alternative hypothesis for text-dependent speaker verification

Imposture classification for text-dependent speaker verification

Evasion and obfuscation in automatic speaker verification

Speaker verification using kernel-based binary classifiers with binary operation derived features

Improving PLDA speaker verification with limited development data

Constrained discriminative PLDA training for speaker verification

Bayesian vocal tract model estimates of nasal stops for speaker verification

Speaker verification based processing for robust ASR in co-channel speech scenarios

Frequency offset correction in single sideband speech for speaker verification

Deep neural networks for small footprint text-dependent speaker verification

Filter options

Publication date

Keywords

INFONA - science communication portal

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)