Wyniki wyszukiwania dla: Yi Jiang

Pozycje od 1 do 8 spośród 8 wyników

rozdział

A DNN parameter mask for the binaural reverberant speech segregation

Yi Jiang, Wei Li, Yuanyuan Zu, Runsheng Liu, więcej

2016 9th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI) > 959 - 963

2016 9th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)

The reverberant speech segregation is a basic problem in speech enhancement and automatic speech recognition. Based on the deep neural networks (DNN), a novel binaural speech segregation method is proposed. The binaural feature is extracted and used as the cue to train a DNN with a ideal parameter mask. The trained DNN is used to distinguish the target speech and noise, and output the estimated parameter...

rozdział

A Binaural Deep Neural Networks Parameter Mask for the Robust Automatic Speech Recognition System

Yi Jiang, Runsheng Liu

2016 International Conference on Network and Information Systems for Computers (ICNISC) > 352 - 356

2016 International Conference on Network and Information Systems for Computers (ICNISC)

Within the framework of computational auditory scene analysis (CASA), a parameter masks estimator based on deep neural networks (DNN) is proposed for automatic speech recognition (ASR) in noisy environments. This paper addresses the robustness in binaural machine speech recognition by speech energy estimation using DNN. An ideal parameter mask (IPM) is introduced as the goal of the DNN estimator,...

rozdział

Auditory features for the close talk speech enhancement with parameter masks

Yi Jiang, Yuanyuan Zu, Runsheng Liu

2015 8th International Congress on Image and Signal Processing (CISP) > 1194 - 1198

2015 8th International Congress on Image and Signal Processing (CISP)

The speech segregation and enhancement is a hard task in speech communication. In order to get the clean target speech, a close talk system is used to collect the speech with a nearby microphone. A deep neural networks (DNN) estimator is used in a frequency channel for speech energy calculation with parameter masks. The adjusted binaural auditory features are used as the main input for DNN speech...

rozdział

A realtime analysis/synthesis Gammatone filterbank

Youwei Yang, Yi Jiang, Runsheng Liu, Dongmei Li

2015 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC) > 1 - 6

2015 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC)

Gammatone filterbanks are widely used in computational auditory models for modeling the peripheral filtering function of the cochlea. However, the high computational complexity and time consumption limits its usage in portable acoustic applications. To address this issue, a realtime and efficient digital implementation of Gammatone filterbank is proposed. The decomposed signal can be resynthesized...

rozdział

Binaural deep neural network for robust speech enhancement

Yi Jiang, Runsheng Liu

2014 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC) > 692 - 695

2014 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC)

Robust speech enhancement is a challenge task, especially in noisy environments. The deep neural network has shown good performance on binaural speech enhancement with various speakers at a same distance. As binaural cues are based on the locations of sound sources, this paper analyze the performance of binaural deep neural network with different distances. The theoretical derivation and experiment...

artykuł

Binaural Classification for Reverberant Speech Segregation Using Deep Neural Networks

Yi Jiang, DeLiang Wang, RunSheng Liu, ZhenMing Feng

IEEE/ACM Transactions on Audio, Speech, and Language Processing > 2014 > 22 > 12 > 2112 - 2121

Speech signal degradation in real environments mainly results from room reverberation and concurrent noise. While human listening is robust in complex auditory scenes, current speech segregation algorithms do not perform well in noisy and reverberant environments. We treat the binaural segregation problem as binary classification, and employ deep neural networks (DNNs) for the classification task...

rozdział

Auditory features based on Gammatone filters for robust speech recognition

Jun Qi, Dong Wang, Yi Jiang, Runsheng Liu

2013 IEEE International Symposium on Circuits and Systems (ISCAS2013) > 305 - 308

2013 IEEE International Symposium on Circuits and Systems (ISCAS)

A major challenge for automatic speech recognition (ASR) relates to significant performance reduction in noisy environments. Recent research has shown that auditory features based on Gammatone filters are promising to improve robustness of ASR systems against noise, though the research is far from extensive and generalizability of the new features is unknown. This paper presents our implementation...

rozdział

An algorithm combined with spectral subtraction and binary masking for monaural speech segregation

Yi Jiang, Hong Zhou

2011 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC) > 1 - 4

2011 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC)

Monaural speech segregation from complex concurrent noise is an extremely challenging problem; binary mask is a method to solve this problem, however, the performance of binary mask is limited by remaining the noise in the result. In this paper, an algorithm integrated Spectral Subtraction and binary masking for speech separation and enhancement was proposed. It follows the framework of computational...

Opcje filtrowania

Słowa kluczowe:
SPEECH

Data publikacji

Ustaw własny zakres dat

Typ publikacji

książka (7)
artykuł (1)

Słowa kluczowe

SIGNAL TO NOISE RATIO (6)
COMPUTATIONAL AUDITORY SCENE ANALYSIS (CASA) (4)
FEATURE EXTRACTION (4)
EAR (3)
ROBUSTNESS (3)
AZIMUTH (2)
DEEP NEURAL NETWORKS (DNNS) (2)
FILTER BANKS (2)
INTERFERENCE (2)
PARAMETER MASKS (2)
SPEECH ENHANCEMENT (2)
SPEECH SEGREGATION (2)
TRAINING (2)
ALGORITHM DESIGN AND ANALYSIS (1)
AUDITORY FEATURE (1)
AUDITORY MODELS (1)
AUTOMATIC SPEECH RECOGNITION (1)
BANDWIDTH (1)
BINARY CLASSIFICATION (1)
BINARY MASKING (1)
BINAURAL (1)
BINAURAL FEATURES (1)
CLOSE TALK (1)
COMPUTATIONAL MODELING (1)
DEEP NEURAL NETWORK (1)
DEEP NEURAL NETWORKS (1)
DEEP NEURAL NETWORKS (DNN) (1)
DELAYS (1)
FREQUENCY-DOMAIN ANALYSIS (1)
GAMMATONE FILTERBANK (1)
GAMMATONE FILTERS (1)
IMAGE ANALYSIS (1)
MEL FREQUENCY CEPSTRAL COEFFICIENT (1)
MICROPHONES (1)
NEURAL NETWORKS (1)
NOISE (1)
PARAMETER MASK ALGORITHM (1)
PSYCHOACOUSTIC MODELS (1)
REVERBERANT SPEECH SEGREGATION (1)
ROBUST SPEECH RECOGNITION (1)
ROOM REVERBERATION (1)
SHORT-TIME OBJECTIVE INTELLIGIBILITY (1)
SPECTRAL SUBTRACTION (1)
SPEECH PROCESSING (1)
SPEECH RECOGNITION (1)
TIME FREQUENCY ANALYSIS (1)
TIME-DOMAIN ANALYSIS (1)
TIME-FREQUENCY ANALYSIS (1)
TIME-FREQUENCY UNITS (1)
więcej

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania dla: Yi Jiang

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Typ publikacji

Słowa kluczowe

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu