Search results for: Eng Siong Chng

Items from 1 to 4 out of 4 results

chapter

Spoofing speech detection using temporal convolutional neural network

Xiaohai Tian, Xiong Xiao, Eng Siong Chng, Haizhou Li

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 6

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Spoofing speech detection aims to differentiate spoofing speech from natural speech. Frame-based features are usually used in most of previous works. Although multiple frames or dynamic features are used to form a super-vector to represent the temporal information, the time span covered by these features are not sufficient. Most of the systems failed to detect the non-vocoder or unit selection based...

chapter

Detecting synthetic speech using long term magnitude and phase information

Xiaohai Tian, Steven Du, Xiong Xiao, Haihua Xu, more

2015 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP) > 611 - 615

2015 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP)

Synthetic speech is speech signals generated by text-to-speech (TTS) and voice conversion (VC) techniques. They impose a threat to speaker verification (SV) systems as an attacker may make use of TTS or VC to synthesize a speakers voice to cheat the SV system. To address this challenge, we study the detection of synthetic speech using long term magnitude and phase information of speech. As most of...

article

A Study on the Generalization Capability of Acoustic Models for Robust Speech Recognition

Xiong Xiao, Jinyu Li, Eng Siong Chng, Haizhou Li, more

IEEE Transactions on Audio, Speech, and Language Processing > 2010 > 18 > 6 > 1158 - 1169

In this paper, we explore the generalization capability of acoustic model for improving speech recognition robustness against noise distortions. While generalization in statistical learning theory originally refers to the model's ability to generalize well on unseen testing data drawn from the same distribution as that of the training data, we show that good generalization capability is also desirable...

chapter

Integrating Acoustic, Prosodic and Phonotactic Features for Spoken Language Identification

Rong Tong, Bin Ma, Donglai Zhu, Haizhou Li, more

2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings > 1 > I

2006 IEEE International Conference on Acoustics, Speech, and Signal Processing

The fundamental issue of the automatic language identification is to explore the effective discriminative cues for languages. This paper studies the fusion of five features at different level of abstraction for language identification, including spectrum, duration, pitch, n-gram phonotactic, and bag-of-sounds features. We build a system and report test results on NIST 1996 and 2003 LRE datasets. The...

Filter options

Keywords:
NATURAL LANGUAGES

Publication date

Set your own date range

Publication type

book (3)
article (1)

Keywords

FEATURE EXTRACTION (2)
SPEECH (2)
SPEECH RECOGNITION (2)
2003 LRE DATASETS (1)
ACOUSTIC DISTORTION (1)
ACOUSTIC FEATURES (1)
ACOUSTIC MISMATCHES (1)
ACOUSTIC MODELS (1)
ACOUSTIC NOISE (1)
ACOUSTIC TESTING (1)
AURORA TASK (1)
AURORA-2 CONNECTED DIGIT STRING RECOGNITION TASKS (1)
AURORA-3 CONNECTED DIGIT STRING RECOGNITION TASKS (1)
AUTOMATIC LANGUAGE IDENTIFICATION (1)
BAG-OF-SOUNDS FEATURES (1)
CONTEXT (1)
CONVOLUTION (1)
DISCRIMINATIVE TRAINING (1)
GENERALIZATION CAPABILITY (1)
INSTANTANEOUS FREQUENCY (1)
LARGE MARGIN (1)
LEARNING (ARTIFICIAL INTELLIGENCE) (1)
MACHINE LEARNING (1)
MARGIN-BASED LEARNING FRAMEWORK (1)
MARGIN-BASED MODEL TRAINING METHOD (1)
MEL FREQUENCY CEPSTRAL COEFFICIENT (1)
N-GRAM PHONOTACTIC (1)
NEURAL NETWORKS (1)
NIST 1996 (1)
NOISE DISTORTIONS (1)
NOISE ROBUSTNESS (1)
NOISY SPEECH RECOGNITION (1)
NORMALIZATION TECHNIQUES (1)
PHONOTACTIC FEATURES (1)
PROSODIC FEATURES (1)
ROBUST SPEECH RECOGNITION (1)
SOFT-MARGIN ESTIMATION (1)
SPEECH PROCESSING (1)
SPOKEN LANGUAGE IDENTIFICATION (1)
SPOOFING ATTACK (1)
STATISTICAL ANALYSIS (1)
STATISTICAL LEARNING (1)
STATISTICAL LEARNING THEORY (1)
SYSTEM PERFORMANCE (1)
TRAINING DATA (1)
TRAJECTORY (1)
UNSEEN TESTING DATA (1)
VARIANCE NORMALIZATION (1)
VOCODERS (1)
VOICE CONVERSION (1)
more

INFONA - science communication portal

Search results for: Eng Siong Chng

Spoofing speech detection using temporal convolutional neural network

Detecting synthetic speech using long term magnitude and phase information

A Study on the Generalization Capability of Acoustic Models for Robust Speech Recognition

Integrating Acoustic, Prosodic and Phonotactic Features for Spoken Language Identification

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options