Search results for: Deliang Wang

Items from 1 to 5 out of 5 results

chapter

Robust speaker recognition based on DNN/i-vectors and speech separation

Jorge Chang, DeLiang Wang

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5415 - 5419

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Recent research shows that the i-vector framework for speaker recognition can significantly benefit from phonetic information. A common approach is to use a deep neural network (DNN) trained for automatic speech recognition to generate a universal background model (UBM). Studies in this area have been done in relatively clean conditions. However, strong background noise is known to severely reduce...

article

CASA-Based Robust Speaker Identification

Xiaojia Zhao, Yang Shao, DeLiang Wang

IEEE Transactions on Audio, Speech, and Language Processing > 2012 > 20 > 5 > 1608 - 1616

Conventional speaker recognition systems perform poorly under noisy conditions. Inspired by auditory perception, computational auditory scene analysis (CASA) typically segregates speech by producing a binary time–frequency mask. We investigate CASA for robust speaker identification. We first introduce a novel speaker feature, gammatone frequency cepstral coefficient (GFCC), based on an auditory periphery...

chapter

Robust speaker identification using a CASA front-end

Xiaojia Zhao, Yang Shao, DeLiang Wang

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5468 - 5471

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Speaker recognition remains a challenging task under noisy conditions. Inspired by auditory perception, computational auditory scene analysis (CASA) typically segregates speech by producing a binary time-frequency mask. We first show that a recently introduced speaker feature, Gammatone Frequency Cepstral Coefficient, performs substantially better than conventional speaker features under noisy conditions...

chapter

Robust speaker identification using auditory features and computational auditory scene analysis

Yang Shao, DeLiang Wang

2008 IEEE International Conference on Acoustics, Speech and Signal Processing > 1589 - 1592

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

The performance of speaker recognition systems drop significantly under noisy conditions. To improve robustness, we have recently proposed novel auditory features and a robust speaker recognition system using a front-end based on computational auditory scene analysis. In this paper, we further study the auditory features by exploring different feature dimensions and incorporating dynamic features...

chapter

Robust Speaker Recognition Using Binary Time-Frequency Masks

Yang Shao, DeLiang Wang

2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings > 1 > I

2006 IEEE International Conference on Acoustics, Speech, and Signal Processing

Conventional speaker recognition systems perform poorly under noisy conditions. In this paper, we evaluate binary time-frequency masks for robust speaker recognition. An ideal binary mask is a priori defined as a binary matrix where 1 indicates that the target is stronger than the interference within the corresponding time-frequency unit and 0 indicates otherwise. We perform speaker identification...

Filter options

Keywords:
SPEAKER RECOGNITION

Publication date

Set your own date range

Publication type

book (4)
article (1)

Keywords

FEATURE EXTRACTION (3)
NOISE MEASUREMENT (3)
ROBUST SPEAKER RECOGNITION (3)
SPEECH (3)
CEPSTRAL ANALYSIS (2)
GAMMATONE FREQUENCY CEPSTRAL COEFFICIENT (2)
IDEAL BINARY MASK (2)
ROBUST SPEAKER IDENTIFICATION (2)
ROBUSTNESS (2)
SIGNAL TO NOISE RATIO (2)
AUDITORY FEATURE (1)
AUDITORY FEATURES (1)
BINARY MATRIX (1)
BINARY TIME-FREQUENCY MASKS (1)
CASA (1)
COMPUTATIONAL AUDITORY SCENE ANALYSIS (1)
COMPUTATIONAL AUDITORY SCENE ANALYSIS (CASA) (1)
DATA RECOGNIZER (1)
DEEP NEURAL NETWORKS (1)
FILTER BANKS (1)
GAMMATONE FEATURE (1)
GAMMATONE FREQUENCY CEPSTRAL COEFFICIENT (GFCC) (1)
GFCC (1)
I-VECTOR (1)
MATRIX ALGEBRA (1)
SIGNAL-TO-NOISE CONDITION (1)
SPEAKER IDENTIFICATION (1)
SPECTROGRAM (1)
SPEECH RECOGNITION (1)
SPEECH SEGREGATION SYSTEM (1)
SPEECH SEPARATION (1)
TIME-FREQUENCY ANALYSIS (1)
TIME-FREQUENCY MASKING (1)
more

INFONA - science communication portal

Search results for: Deliang Wang

Robust speaker recognition based on DNN/i-vectors and speech separation

CASA-Based Robust Speaker Identification

Robust speaker identification using a CASA front-end

Robust speaker identification using auditory features and computational auditory scene analysis

Robust Speaker Recognition Using Binary Time-Frequency Masks

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options