Search results for: Zhe Wang

Items from 1 to 7 out of 7 results

chapter

Single-channel speech separation based on robust sparse Bayesian learning

Zhe Wang, Guoan Bi, Xiumei Li

2017 13th IEEE International Conference on Control & Automation (ICCA) > 113 - 117

2017 13th IEEE International Conference on Control & Automation (ICCA)

This paper describes a novel algorithm to improve the performance of sparsity based single-channel speech separation(SCSS) problem based on compressed sensing which is an emerging technique for efficient data reconstruction. The conventional approach assumes the mixing conditions and source signals are stationary. For practical applications of audio source separation, however, we face the challenges...

chapter

Blind separation method of overlapped speech mixtures in STFT domain with noise and residual crosstalk suppression

A. Zhe Wang, C. Guoan Bi, B. Xiumei Li

2016 12th IEEE International Conference on Control and Automation (ICCA) > 876 - 880

2016 12th IEEE International Conference on Control and Automation (ICCA)

Noise and residual crosstalk are two important issues that have to be addressed in practical applications of underdetermined blind source separation (UBSS) for speech mixture. This paper proposes a noise-robust UBSS algorithm to deal with highly overlapped speech sources with residual crosstalk suppression scheme in the short-time Fourier transform (STFT) domain. The proposed algorithm is firstly...

chapter

A time-frequency preprocessing method for blind source separation of speech signal with temporal structure

Zhe Wang, Guoan Bi

2015 10th International Conference on Information, Communications and Signal Processing (ICICS) > 1 - 6

2015 10th International Conference on Information, Communications and Signal Processing (ICICS)

Determination of the number of sources is a practical issue that has to be addressed in applications of underdetermined blind source separation (UBSS). This paper proposes a noise-robust UBSS algorithm for highly overlapped speech sources in the short-time Fourier transform (STFT) domain. The basic principle of the proposed algorithm firstly estimates the unknown number of sources in time-frequency...

chapter

Linear prediction based comfort noise generation in the EVS codec

Zhe Wang, Lei Miao, Jon Gibbs, Tomas Toftgard, more

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5903 - 5907

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

A Discontinuous transmission (DTX) system, which is widely adopted in speech codecs, is an important function for speech communication systems that can reduce the transmission bandwidth by at least a half. Within a DTX system, the comfort noise generation (CNG) plays a key role in the overall quality. Critical performance parameters with respect to the CNG including the transition quality from active...

chapter

Overview of the EVS codec architecture

Martin Dietz, Markus Multrus, Vaclav Eksler, Vladimir Malenovsky, more

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5698 - 5702

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The recently standardized 3GPP codec for Enhanced Voice Services (EVS) offers new features and improvements for low-delay real-time communication systems. Based on a novel, switched low-delay speech/audio codec, the EVS codec contains various tools for better compression efficiency and higher quality for clean/noisy speech, mixed content and music, including support for wideband, super-wideband and...

chapter

Automatic multi-speaker speech recognition system based on time-frequency blind source separation under ubiquitous environment

Zhe Wang, Haijian Zhang, Guoan Bi, Xiumei Li

2014 9th IEEE Conference on Industrial Electronics and Applications > 101 - 106

2014 IEEE 9th Conference on Industrial Electronics and Applications (ICIEA)

In this paper, an automatic speech recognition (ASR) system under ubiquitous environment is proposed, which is successfully implemented in a personalized voice command system under vehicle and living room environment. The proposed ASR system describes a novel scheme of separating speech sources from multi-speakers, detecting speech presence/absence by tracking the higher portion of speech power spectrum...

chapter

A Voice Activity Detector Based on Noise Spectrum Adaptation and Discrimination Information for Automatic Speech Recognition System

Zhe Wang, Guoan Bi

2014 5th International Conference on Intelligent Systems, Modelling and Simulation > 301 - 305

2014 5th International Conference on Intelligent Systems, Modelling and Simulation (ISMS)

In this paper, an adaptive voice activity detector (VAD) is proposed, which is successfully implemented in a MFCC based speech recognition system. The proposed VAD describes a novel scheme of detecting speech presence/absence by tracking the higher portion of speech power spectrum and judging the discrimination information. The VAD will adjust judgment threshold adaptively. An automatic speech recognition...

Filter options

Keywords:
SPEECH

Publication date

Set your own date range

Keywords

NOISE MEASUREMENT (3)
TIME-FREQUENCY ANALYSIS (3)
BLIND SOURCE SEPARATION (2)
CODECS (2)
NOISE (2)
PRINCIPAL COMPONENT ANALYSIS (2)
SPEECH RECOGNITION (2)
ALGORITHM DESIGN AND ANALYSIS (1)
AUDIO CODING (1)
AUTOMATIC SPEECH RECOGNITION (1)
BANDWIDTH (1)
BAYES METHODS (1)
BIT RATE (1)
CLUSTERING METHODS (1)
CNG (1)
COMFORT NOISE (1)
COMPRESSED SENSING (1)
COMPUTATIONAL MODELING (1)
CROSSTALK (1)
DATABASES (1)
DECODING (1)
DELAYS (1)
DICTIONARIES (1)
DISCRIMINATION INFORMATION (1)
DTX (1)
EDUCATIONAL INSTITUTIONS (1)
ENCODING (1)
EQUATIONS (1)
ESTIMATION (1)
EVS (1)
EXPECTATION MAXIMIZATION (1)
LINEAR PREDICTION (1)
MOBILE COMMUNICATION (1)
NOISE ROBUSTNESS (1)
RESIDUAL CROSSTALK SUPPRESSION (1)
SIGNAL TO NOISE RATIO (1)
SINGLE-CHANNEL SPEECH SEPARATION (1)
SOFT DECISION (1)
SPARSE BAYESIAN LEARNING (1)
SPARSE MATRICES (1)
SPECTRAL SUBTRACTION (1)
SPEECH CODING (1)
SPEECH ENHANCEMENT (1)
SWITCHES (1)
TRAINING (1)
UNDERDETERMINED BLIND SOURCE SEPARATION (1)
VOICE ACTIVITY DETECTOR (1)
more

INFONA - science communication portal

Search results for: Zhe Wang

Single-channel speech separation based on robust sparse Bayesian learning

Blind separation method of overlapped speech mixtures in STFT domain with noise and residual crosstalk suppression

A time-frequency preprocessing method for blind source separation of speech signal with temporal structure

Linear prediction based comfort noise generation in the EVS codec

Overview of the EVS codec architecture

Automatic multi-speaker speech recognition system based on time-frequency blind source separation under ubiquitous environment

A Voice Activity Detector Based on Noise Spectrum Adaptation and Discrimination Information for Automatic Speech Recognition System

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options