2013 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP)

Items from 1 to 8 out of 8 results

chapter

A variable step-size-based ICA method for a fast and robust acoustic echo cancellation system without requiring double-talk detector

Marko Kanadi, Muhammad Tahir Akhtar, Wataru Mitsuhashi

2013 IEEE China Summit and International Conference on Signal and Information Processing > 118 - 121

2013 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP)

In this paper, we propose a novel method for improving performance of the acoustic echo canceller (AEC) employed in the hands-free communication. The main objective is to realize an improved performance without requiring a double talk detector (DTD). The basic idea is to employ a gradient-based independent component analysis (ICA) method with a generalized Cauchy distribution-based flexible score...

chapter

Improve low-resource non-native mispronunciation detection with native speech by articulatory-based tandem feature

Hua Yuan, Ji Xu, Junhong Zhao, Jia Liu

2013 IEEE China Summit and International Conference on Signal and Information Processing > 127 - 131

2013 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP)

In this paper, we propose a method to improve detecting the mispronunciation type of the non-native learners. In order to cope with the low-resource condition of non-native speech and the difference of native and non-native speech, the following efforts are made: 1) train acoustic model with the low-resource non-native data; 2) introduce the articulatory-based tandem feature; 3) pool auxiliary native...

chapter

Vocal source features for bilingual speaker identification

Jianglin Wang, Michael T. Johnson

2013 IEEE China Summit and International Conference on Signal and Information Processing > 170 - 173

2013 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP)

This paper introduces the use of two new features for speaker identification, Residual Phase Cepstrum Coefficients (RPCC) and Glottal Flow Cepstrum Coefficients (GLFCC), to capture speaker-specific characteristics from their vocal excitation patterns. Results on a cross-lingual speaker identification task taken from the NIST 2004 SRE demonstrate that these RPCC and GLFCC features are significantly...

chapter

Show-through removal for scanned images using non-linear NMF with adaptive smoothing

Qingju Liu, Wenwu Wang

2013 IEEE China Summit and International Conference on Signal and Information Processing > 650 - 654

2013 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP)

Scans of double-sided documents often suffer from show-through distortions, where contents of the reverse side (verso) may appear in the front-side page (recto). Several algorithms employed for show-through removal from the scanned images, are based on linear mixing models, including blind source separation (BSS), non-negative matrix factorization (NMF), and adaptive filtering. However, a recent study...

chapter

Emotional speaker verification with linear adaptation

Fanhu Bie, Dong Wang, Thomas Fang Zheng, Ruxin Chen

2013 IEEE China Summit and International Conference on Signal and Information Processing > 91 - 94

2013 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP)

Speaker verification suffers from significant performance degradation on emotional speech. We present an adaptation approach based on maximum likelihood linear regression (MLLR) and its feature-space variant, CMLLR. Our preliminary experiments demonstrate that this approach leads to considerable performance improvement, particularly with CMLLR (about 10% relative EER reduction in average). We also...

chapter

An adaptive β-order MMSE estimator for speech enhancement using super-Gaussian speech model

Shan An, Chang-chun Bao, Bing-yin Xia

2013 IEEE China Summit and International Conference on Signal and Information Processing > 327 - 331

2013 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP)

This paper proposes an adaptive β-order Minimum-Mean-Square-Error (MMSE) estimator for speech enhancement using super-Gaussian speech model (β-SG-MMSE). The spectral amplitude of clean speech is estimated by MMSE estimator under the assumption that the DFT coefficients of clean speech are modeled by super-Gaussian distribution and the DFT coefficients of noise signal are modeled by Gaussian distribution...

chapter

A speech enhancement algorithm based on β-order GARCH model

Xian-bo Meng, Chang-chun Bao, Bing-yin Xia

2013 IEEE China Summit and International Conference on Signal and Information Processing > 342 - 346

2013 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP)

This paper presents a novel speech enhancement algorithm based on β-order GARCH (Generalized Auto-regressive Conditional Heteroscedasticity) model. The speech signal is modeled as β-order GARCH process, and the a priori SNR is estimated effectively. The noisy signal is divided into several critical bands, and then the value of order β is updated adaptively according to the signal-to-noise ratios in...

chapter

Sequential UBM adaptation for speaker verification

Jun Wang, Dong Wang, Xiaojun Wu, Thomas Fang Zheng

2013 IEEE China Summit and International Conference on Signal and Information Processing > 356 - 359

2013 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP)

GMM-UBM-based speaker verification heavily relies on a well trained UBM. In practice, it is not often easy to obtain an UBM that fully matches acoustic channels in operation. To solve this problem, we propose a novel sequential MAP adaptation approach: by being sequentially updated with data from new enrollments, the UBM learns and converges to the working channel. Our experiments are conducted on...

Filter options

Keywords:
ADAPTATION MODELS

Publication date

Set your own date range

Keywords

SPEECH (7)
ACOUSTICS (3)
FEATURE EXTRACTION (2)
HIDDEN MARKOV MODELS (2)
NOISE MEASUREMENT (2)
SIGNAL TO NOISE RATIO (2)
SPEAKER RECOGNITION (2)
SPEAKER VERIFICATION (2)
SPEECH ENHANCEMENT (2)
TRAINING (2)
VECTORS (2)
β-ORDER GARCH MODEL (1)
β-ORDER MMSE (1)
A PRIORI SNR ESTIMATION (1)
ACCURACY (1)
ACOUSTIC ECHO CANCELLATION (1)
ADAPTIVE FILTER (1)
ALGORITHM DESIGN AND ANALYSIS (1)
ARTICULATORY FEATURE (1)
BSS (1)
CHANNEL ESTIMATION (1)
DATABASES (1)
DISCRETE FOURIER TRANSFORMS (1)
EMOTIONAL SPEECH (1)
EQUATIONS (1)
ESTIMATION (1)
FILTERING (1)
GLOTTAL SOURCE EXCITATION (1)
GRAY-SCALE (1)
IAIF AND GMM (1)
INDEPENDENT COMPONENT ANALYSIS (1)
LOW-RESOURCE MISPRONUNCIATION DETECTION (1)
MAP (1)
MATHEMATICAL MODEL (1)
MEL FREQUENCY CEPSTRAL COEFFICIENT (1)
MLLR (1)
MODEL PARAMETER ESTIMATION (1)
MULTI-LAYER PERCEPTION (MLP) (1)
NMF (1)
NON-LINEAR NMF (1)
OPTIMIZATION (1)
PROJECTED GRADIENT (1)
ROBUSTNESS (1)
SHOW-THROUGH (1)
SIGNAL PROCESSING ALGORITHMS (1)
SIMULATION (1)
SMOOTHING METHODS (1)
SPEAKER IDENTIFICATION (1)
SPEECH PRESENCE PROBABILITY (1)
SUPER-GAUSSIAN MODEL (1)
TANDEM FEATURE (1)
TRANSFORMS (1)
UBM (1)
VARIABLE STEP-SIZE (1)
WHITE NOISE (1)
more

INFONA - science communication portal

2013 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2013 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP)