Wyniki wyszukiwania dla: Deliang Wang

Pozycje od 1 do 5 spośród 5 wyników

rozdział

Robust speaker recognition based on DNN/i-vectors and speech separation

Jorge Chang, DeLiang Wang

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5415 - 5419

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Recent research shows that the i-vector framework for speaker recognition can significantly benefit from phonetic information. A common approach is to use a deep neural network (DNN) trained for automatic speech recognition to generate a universal background model (UBM). Studies in this area have been done in relatively clean conditions. However, strong background noise is known to severely reduce...

artykuł

CASA-Based Robust Speaker Identification

Xiaojia Zhao, Yang Shao, DeLiang Wang

IEEE Transactions on Audio, Speech, and Language Processing > 2012 > 20 > 5 > 1608 - 1616

Conventional speaker recognition systems perform poorly under noisy conditions. Inspired by auditory perception, computational auditory scene analysis (CASA) typically segregates speech by producing a binary time–frequency mask. We investigate CASA for robust speaker identification. We first introduce a novel speaker feature, gammatone frequency cepstral coefficient (GFCC), based on an auditory periphery...

rozdział

Robust speaker identification using a CASA front-end

Xiaojia Zhao, Yang Shao, DeLiang Wang

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5468 - 5471

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Speaker recognition remains a challenging task under noisy conditions. Inspired by auditory perception, computational auditory scene analysis (CASA) typically segregates speech by producing a binary time-frequency mask. We first show that a recently introduced speaker feature, Gammatone Frequency Cepstral Coefficient, performs substantially better than conventional speaker features under noisy conditions...

rozdział

Robust speaker identification using auditory features and computational auditory scene analysis

Yang Shao, DeLiang Wang

2008 IEEE International Conference on Acoustics, Speech and Signal Processing > 1589 - 1592

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

The performance of speaker recognition systems drop significantly under noisy conditions. To improve robustness, we have recently proposed novel auditory features and a robust speaker recognition system using a front-end based on computational auditory scene analysis. In this paper, we further study the auditory features by exploring different feature dimensions and incorporating dynamic features...

rozdział

Robust Speaker Recognition Using Binary Time-Frequency Masks

Yang Shao, DeLiang Wang

2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings > 1 > I

2006 IEEE International Conference on Acoustics, Speech, and Signal Processing

Conventional speaker recognition systems perform poorly under noisy conditions. In this paper, we evaluate binary time-frequency masks for robust speaker recognition. An ideal binary mask is a priori defined as a binary matrix where 1 indicates that the target is stronger than the interference within the corresponding time-frequency unit and 0 indicates otherwise. We perform speaker identification...

Opcje filtrowania

Słowa kluczowe:
SPEAKER RECOGNITION

Data publikacji

Ustaw własny zakres dat

Typ publikacji

książka (4)
artykuł (1)

Słowa kluczowe

FEATURE EXTRACTION (3)
NOISE MEASUREMENT (3)
ROBUST SPEAKER RECOGNITION (3)
SPEECH (3)
CEPSTRAL ANALYSIS (2)
GAMMATONE FREQUENCY CEPSTRAL COEFFICIENT (2)
IDEAL BINARY MASK (2)
ROBUST SPEAKER IDENTIFICATION (2)
ROBUSTNESS (2)
SIGNAL TO NOISE RATIO (2)
AUDITORY FEATURE (1)
AUDITORY FEATURES (1)
BINARY MATRIX (1)
BINARY TIME-FREQUENCY MASKS (1)
CASA (1)
COMPUTATIONAL AUDITORY SCENE ANALYSIS (1)
COMPUTATIONAL AUDITORY SCENE ANALYSIS (CASA) (1)
DATA RECOGNIZER (1)
DEEP NEURAL NETWORKS (1)
FILTER BANKS (1)
GAMMATONE FEATURE (1)
GAMMATONE FREQUENCY CEPSTRAL COEFFICIENT (GFCC) (1)
GFCC (1)
I-VECTOR (1)
MATRIX ALGEBRA (1)
SIGNAL-TO-NOISE CONDITION (1)
SPEAKER IDENTIFICATION (1)
SPECTROGRAM (1)
SPEECH RECOGNITION (1)
SPEECH SEGREGATION SYSTEM (1)
SPEECH SEPARATION (1)
TIME-FREQUENCY ANALYSIS (1)
TIME-FREQUENCY MASKING (1)
więcej

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania dla: Deliang Wang

Robust speaker recognition based on DNN/i-vectors and speech separation

CASA-Based Robust Speaker Identification

Robust speaker identification using a CASA front-end

Robust speaker identification using auditory features and computational auditory scene analysis

Robust Speaker Recognition Using Binary Time-Frequency Masks

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Typ publikacji

Słowa kluczowe

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu