Search results for: Runsheng Liu

Items from 1 to 5 out of 5 results

chapter

A DNN parameter mask for the binaural reverberant speech segregation

Yi Jiang, Wei Li, Yuanyuan Zu, Runsheng Liu, more

2016 9th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI) > 959 - 963

2016 9th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)

The reverberant speech segregation is a basic problem in speech enhancement and automatic speech recognition. Based on the deep neural networks (DNN), a novel binaural speech segregation method is proposed. The binaural feature is extracted and used as the cue to train a DNN with a ideal parameter mask. The trained DNN is used to distinguish the target speech and noise, and output the estimated parameter...

chapter

A Binaural Deep Neural Networks Parameter Mask for the Robust Automatic Speech Recognition System

Yi Jiang, Runsheng Liu

2016 International Conference on Network and Information Systems for Computers (ICNISC) > 352 - 356

2016 International Conference on Network and Information Systems for Computers (ICNISC)

Within the framework of computational auditory scene analysis (CASA), a parameter masks estimator based on deep neural networks (DNN) is proposed for automatic speech recognition (ASR) in noisy environments. This paper addresses the robustness in binaural machine speech recognition by speech energy estimation using DNN. An ideal parameter mask (IPM) is introduced as the goal of the DNN estimator,...

chapter

A realtime analysis/synthesis Gammatone filterbank

Youwei Yang, Yi Jiang, Runsheng Liu, Dongmei Li

2015 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC) > 1 - 6

2015 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC)

Gammatone filterbanks are widely used in computational auditory models for modeling the peripheral filtering function of the cochlea. However, the high computational complexity and time consumption limits its usage in portable acoustic applications. To address this issue, a realtime and efficient digital implementation of Gammatone filterbank is proposed. The decomposed signal can be resynthesized...

chapter

Binaural deep neural network for robust speech enhancement

Yi Jiang, Runsheng Liu

2014 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC) > 692 - 695

2014 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC)

Robust speech enhancement is a challenge task, especially in noisy environments. The deep neural network has shown good performance on binaural speech enhancement with various speakers at a same distance. As binaural cues are based on the locations of sound sources, this paper analyze the performance of binaural deep neural network with different distances. The theoretical derivation and experiment...

article

Binaural Classification for Reverberant Speech Segregation Using Deep Neural Networks

Yi Jiang, DeLiang Wang, RunSheng Liu, ZhenMing Feng

IEEE/ACM Transactions on Audio, Speech, and Language Processing > 2014 > 22 > 12 > 2112 - 2121

Speech signal degradation in real environments mainly results from room reverberation and concurrent noise. While human listening is robust in complex auditory scenes, current speech segregation algorithms do not perform well in noisy and reverberant environments. We treat the binaural segregation problem as binary classification, and employ deep neural networks (DNNs) for the classification task...

Filter options

Keywords:
SIGNAL TO NOISE RATIO

Publication date

Set your own date range

Publication type

book (4)
article (1)

Keywords

COMPUTATIONAL AUDITORY SCENE ANALYSIS (CASA) (3)
AZIMUTH (2)
DEEP NEURAL NETWORKS (DNNS) (2)
EAR (2)
FEATURE EXTRACTION (2)
INTERFERENCE (2)
ROBUSTNESS (2)
TRAINING (2)
AUDITORY MODELS (1)
AUTOMATIC SPEECH RECOGNITION (1)
BANDWIDTH (1)
BINARY CLASSIFICATION (1)
BINAURAL (1)
BINAURAL FEATURES (1)
COMPUTATIONAL MODELING (1)
DEEP NEURAL NETWORK (1)
DEEP NEURAL NETWORKS (1)
DELAYS (1)
FILTER BANKS (1)
GAMMATONE FILTERBANK (1)
NEURAL NETWORKS (1)
PARAMETER MASK ALGORITHM (1)
PARAMETER MASKS (1)
PSYCHOACOUSTIC MODELS (1)
REVERBERANT SPEECH SEGREGATION (1)
ROOM REVERBERATION (1)
SHORT-TIME OBJECTIVE INTELLIGIBILITY (1)
SPEECH ENHANCEMENT (1)
SPEECH SEGREGATION (1)
TIME-FREQUENCY ANALYSIS (1)
more

INFONA - science communication portal

Search results for: Runsheng Liu

A DNN parameter mask for the binaural reverberant speech segregation

A Binaural Deep Neural Networks Parameter Mask for the Robust Automatic Speech Recognition System

A realtime analysis/synthesis Gammatone filterbank

Binaural deep neural network for robust speech enhancement

Binaural Classification for Reverberant Speech Segregation Using Deep Neural Networks

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options