Search results for: Jun Du

Items from 1 to 11 out of 11 results

chapter

Gaussian density guided deep neural network for single-channel speech enhancement

Li Chai, Jun Du, Yan-nan Wang

2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP) > 1 - 6

2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP)

Recently, the minimum mean squared error (MMSE) has been a benchmark of optimization criterion for deep neural network (DNN) based speech enhancement. In this study, a probabilistic learning framework to estimate the DNN parameters for single-channel speech enhancement is proposed. First, the statistical analysis shows that the prediction error vector at the DNN output well follows a unimodal density...

chapter

On generating mixing noise signals with basis functions for simulating noisy speech and learning dnn-based speech enhancement models

Shi-Xue Wen, Jun Du, Chin-Hui Lee

2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP) > 1 - 6

2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP)

We first examine the generalization issue with the noise samples used in training nonlinear mapping functions between noisy and clean speech features for deep neural network (DNN) based speech enhancement. Then an empirical proof is established to explain why the DNN-based approach has a good noise generalization capability provided that a large collection of noise types are included in generating...

chapter

Joint noise and mask aware training for DNN-based speech enhancement with SUB-band features

Qing Wang, Jun Du, Li-Rong Dai, Chin-Hui Lee

2017 Hands-free Speech Communications and Microphone Arrays (HSCMA) > 101 - 105

2017 Hands-free Speech Communications and Microphone Arrays (HSCMA)

We present a joint noise and mask aware training strategy for deep neural network (DNN) based speech enhancement with sub-band features. First, based on the analysis of the previously proposed dynamic noise aware training approach tested on the wide-band (16 KHz) speech data, the full-band dynamic noise features cannot always improve the enhancement performance due to inaccurate noise estimation....

chapter

Multiple-target deep learning for LSTM-RNN based speech enhancement

Lei Sun, Jun Du, Li-Rong Dai, Chin-Hui Lee

2017 Hands-free Speech Communications and Microphone Arrays (HSCMA) > 136 - 140

2017 Hands-free Speech Communications and Microphone Arrays (HSCMA)

In this study, we explore long short-term memory recurrent neural networks (LSTM-RNNs) for speech enhancement. First, a regression LSTM-RNN approach for a direct mapping from the noisy to clean speech features is presented and verified to be more effective than deep neural network (DNN) based regression techniques in modeling long-term acoustic context. Then, a comprehensive comparison between the...

chapter

Boosting DNN-based speech enhancement via explicit transformations

Qing Wang, Jun Du, Li-Rong Dai

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 4

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

In this study, we investigate on the learning behaviors of DNN by explicit feature transformations. As a demonstration, linear and logarithm transformations, corresponding to the amplitude spectra and log-power spectra, are compared with the same minimum mean squared error (MMSE) objective function for optimizing DNN parameters. Based on the experimental analysis of the DNN learning behaviors, we...

chapter

A regression approach to binaural speech segregation via deep neural network

Nana Fan, Jun Du, Li-Rong Dai

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP) > 1 - 5

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP)

This paper proposes a novel regression approach to binaural speech segregation based on deep neural network (DNN). In contrast to the conventional ideal binary mask (IBM) method using DNN with the interaural time difference (ITD) and in-teraural level difference (ILD) as the auditory features, the log-power spectra (LPS) features of target speech are directly predicted via a regression DNN model by...

chapter

A unified speaker-dependent speech separation and enhancement system based on deep neural networks

Tian Gao, Jun Du, Li Xu, Cong Liu, more

2015 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP) > 687 - 691

2015 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP)

Speech enhancement and speech separation are important frontends of many speech processing systems. In real tasks, the background noises are often mixed with some human voice interferences. In this paper, we explore a framework to unify speech enhancement and speech separation for a speaker-dependent scenario based on deep neural networks (DNNs). Using a supervised method, DNN is adopted to directly...

chapter

Joint training of front-end and back-end deep neural networks for robust speech recognition

Tian Gao, Jun Du, Li-Rong Dai, Chin-Hui Lee

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4375 - 4379

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Based on the recently proposed speech pre-processing front-end with deep neural networks (DNNs), we first investigate different feature mapping directly from noisy speech via DNN for robust speech recognition. Next, we propose to jointly train a single DNN for both feature mapping and acoustic modeling. In the end, we show that the word error rate (WER) of the jointly trained system could be significantly...

chapter

Synthesized stereo-based stochastic mapping with data selection for robust speech recognition

Jun Du, Qiang Huo

2012 8th International Symposium on Chinese Spoken Language Processing > 122 - 125

2012 8th International Symposium on Chinese Spoken Language Processing (ISCSLP 2012)

In this paper, we present a synthesized stereo-based stochastic mapping approach for robust speech recognition. We extend the traditional stereo-based stochastic mapping (SSM) in two main aspects. First, the constraint of stereo-data, which is not practical in real applications, is relaxed by using HMM-based speech synthesis. Then we make feature mapping more focused on those incorrectly recognized...

chapter

Active Learning with Human-Like Noisy Oracle

Jun Du, C X Ling

2010 IEEE International Conference on Data Mining > 797 - 802

2010 10th IEEE International Conference on Data Mining (ICDM 2010)

When active learning is applied to real-world applications, human experts usually act as oracles to provide labels. However, human make mistakes, thus noise might be introduced during the learning process. Most previous studies simplify the problem by assuming uniformly-distributed noise over the sample space. Such assumption, however, might fail to precisely reflect the human experts' behaviour in...

chapter

HMM-based pseudo-clean speech synthesis for splice algorithm

Jun Du, Yu Hu, Li-Rong Dai, Ren-Hua Wang

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 4570 - 4573

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

In this paper, we present a novel approach to relax the constraint of stereo-data which is needed in a series of algorithms for noise-robust speech recognition. As a demonstration in SPLICE algorithm, we generate the pseudo-clean features to replace the ideal clean features from one of the stereo channels, by using HMM-based speech synthesis. Experimental results on aurora2 database show that the...

Filter options

Keywords:
NOISE MEASUREMENT
Publication type:
book

Publication date

Set your own date range

Keywords

SPEECH (10)
SPEECH ENHANCEMENT (6)
DEEP NEURAL NETWORK (5)
SIGNAL TO NOISE RATIO (4)
SPEECH RECOGNITION (4)
HIDDEN MARKOV MODELS (3)
NEURAL NETWORKS (3)
FEATURE EXTRACTION (2)
HMM-BASED SPEECH SYNTHESIS (2)
IDEAL RATIO MASK (2)
NOISE (2)
SPEECH SYNTHESIS (2)
ACOUSTICS (1)
ACTIVE LEARNING (1)
BIAS ADAPTATION ALGORITHM (1)
BINAURAL SPEECH SEGREGATION (1)
COMPUTATIONAL MODELING (1)
CONTEXT MODELING (1)
COVARIANCE MATRICES (1)
DATA MINING (1)
DATA SELECTION (1)
DATABASES (1)
DEEP NEURAL NETWORKS (1)
DIRECT MAPPING (1)
DISTORTION (1)
DYNAMIC NOISE ESTIMATION (1)
ESTIMATION (1)
FEATURE MAPPING (1)
HIDDEN MARKOV MODEL (1)
HMM (1)
HUMAN EXPERTS (1)
HUMANS (1)
INTERAURAL LEVEL DIFFERENCE (1)
INTERPOLATION (1)
JOINT TRAINING (1)
JOINTS (1)
LABELING (1)
LABELLED DATA (1)
LEARNING (ARTIFICIAL INTELLIGENCE) (1)
LINEAR PROGRAMMING (1)
LONG SHORT-TERM MEMORY RECURRENT NEURAL NETWORK (1)
MATHEMATICAL MODEL (1)
MAXIMUM LIKELIHOOD ESTIMATION (1)
MICROSTRUCTURE (1)
MULTIPLE-TARGET JOINT LEARNING (1)
MULTIVARIATE GAUSSIAN DENSITY (1)
NOISE BASIS (1)
NOISE GENERALIZATION (1)
NOISE-ROBUST SPEECH RECOGNITION (1)
NOISY SPEECH RECOGNITION (1)
OBJECTIVE PERFORMANCE MEASURES (1)
ORACLE (1)
ORACLES (1)
PREDICTION ERROR MODELING (1)
PSEUDO-CLEAN SPEECH SYNTHESIS (1)
ROBUST SPEECH RECOGNITION (1)
SAMPLING METHODS (1)
SPEAKER-DEPENDENT (1)
SPEECH PROCESSING (1)
SPEECH SEPARATION (1)
SPLICE (1)
SPLICE ALGORITHM (1)
STEREO CHANNEL (1)
STEREO-BASED STOCHASTIC MAPPING (1)
SUB-BAND BINAURAL FEATURES (1)
SUB-BAND FEATURES (1)
SUPERVISED METHOD (1)
SYSTEM FUSION (1)
TESTING (1)
TRAINING DATA (1)
UNCERTAINTY (1)
UNCERTAINTY SAMPLING (1)
UNLABELLED DATA (1)
WORD ERROR RATE (1)
more

INFONA - science communication portal

Search results for: Jun Du

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options