2008 6th International Symposium on Chinese Spoken Language Processing

chapter

Speech Database Compacted for an Embedded Mandarin TTS System

Qing Guo, Bin Wang, N. Katae

2008 6th International Symposium on Chinese Spoken Language Processing > 1 - 4

In recent years, the unit selection based concatenative speech synthesis system that uses large speech database has become popular because it can produce high quality synthesized speech. However, using such a large speech database is not practical for many applications such as those ported on embedded devices with the storage requirement and the computational complexity involved in searching it. In...

chapter

Double Gauss Based Unsupervised Score Normalization in Speaker Verification

Wu Guo, Li-Rong Dai, Ren-Hua Wang

2008 6th International Symposium on Chinese Spoken Language Processing > 1 - 4

2008 6th International Symposium on Chinese Spoken Language Processing

In text-independent speaker verification, unsupervised mode can improve system performance. In traditional systems, the speaker model is updated when a test speech has a score higher than a particular threshold; we call this unsupervised model training. In this paper, an unsupervised score normalization is proposed. A target speaker score Gauss and an impostor score Gauss are set up as a prior; the...

chapter

Evaluation and Analysis of Minimum Phone Error Training and its Modified Versions for Large Vocabulary Mandarin Speech Recognition

Yung-Jen Cheng, Che-Kuang Lin, Lin-Shan Lee

2008 6th International Symposium on Chinese Spoken Language Processing > 1 - 4

2008 6th International Symposium on Chinese Spoken Language Processing

This paper reports a detailed study on minimum phone error (MPE), minimum phone frame error (MPFE), and a physical-state level version of minimum Bayes risk (sMBR) training, as well as several modified versions of them, for transcription of large vocabulary Mandarin broadcast news. We found the results are quite different from these observed previously for English and Arabic broadcast news tasks[l],...

chapter

CityBrowser II: A Multimodal Restaurant Guide in Mandarin

Jingjing Liu, Yushi Xu, S. Seneff, V. Zue

2008 6th International Symposium on Chinese Spoken Language Processing > 1 - 4

2008 6th International Symposium on Chinese Spoken Language Processing

In this paper we present a conversational dialogue system, CityBrowser II, which allows users to inquire about information about restaurants in Mandarin. Developed in the Galaxy infrastructure with a common, language-independent semantic representation, CityBrowser integrates portability and scalability. By inheriting the infrastructure and main language understanding/generation components from its...

chapter

Using Pseudo-Key for Language Recognition System Design

Hanwu Sun, Bin Ma, Haizhou Li

2008 6th International Symposium on Chinese Spoken Language Processing > 1 - 4

2008 6th International Symposium on Chinese Spoken Language Processing

In this paper, we present a novel pseudo-key analysis approach for the fusion system of language recognition. The state-of-the-art language recognition systems for the NIST language recognition evaluation (LRE) commonly consist of multiple language classifiers. To avoid the fusion system to be spoiled by one abnormal classifier, pseudo keys are designed to check the integrity of each of the individual...

chapter

Discriminative Feedback Adaptation for GMM-UBM Speaker Verification

Yi-Hsiang Chao, Wei-Ho Tsai, Hsin-Min Wang

2008 6th International Symposium on Chinese Spoken Language Processing > 1 - 4

2008 6th International Symposium on Chinese Spoken Language Processing

The GMM-UBM system is the current state-of-the-art approach for text-independent speaker verification. The advantage of the approach is that both target speaker model and impostor model (UBM) have generalization ability to handle "unseen" acoustic patterns. However, since GMM-UBM uses a common anti-model, namely UBM, for all target speakers, it tends to be weak in rejecting impostors' voices...

chapter

An Efficient Feature Selection Method for Speaker Recognition

Hanwu Sun, Bin Ma, Haizhou Li

2008 6th International Symposium on Chinese Spoken Language Processing > 1 - 4

2008 6th International Symposium on Chinese Spoken Language Processing

In this paper, a new feature selection method for speaker recognition is proposed to keep the high quality speech frames for speaker modelling and to remove noisy and corrupted speech frames. In order to obtain robust voice activity detection in variety of acoustic conditions, the spectral subtraction algorithm is adopted to estimate the frame power. An energy based frame selection algorithm is then...

chapter

ISCSLP 2008 Cover

2008 6th International Symposium on Chinese Spoken Language Processing > 1

2008 6th International Symposium on Chinese Spoken Language Processing

chapter

ISCSLP 2008 Organizing Committees

2008 6th International Symposium on Chinese Spoken Language Processing > 1

2008 6th International Symposium on Chinese Spoken Language Processing

chapter

ISCSLP 2008 Preface

2008 6th International Symposium on Chinese Spoken Language Processing > 1

2008 6th International Symposium on Chinese Spoken Language Processing

chapter

ISCSLP 2008 Message from the General Chair

2008 6th International Symposium on Chinese Spoken Language Processing > 1

2008 6th International Symposium on Chinese Spoken Language Processing

chapter

ISCSLP 2008 Technical Program Committee

2008 6th International Symposium on Chinese Spoken Language Processing > 1 - 3

2008 6th International Symposium on Chinese Spoken Language Processing

chapter

Subword Latent Semantic Analysis for Texttiling-Based Automatic Story Segmentation of Chinese Broadcast News

Yulian Yang, Lei Xie

2008 6th International Symposium on Chinese Spoken Language Processing > 1 - 4

2008 6th International Symposium on Chinese Spoken Language Processing

This paper proposes to perform latent semantic analysis (LSA) on character/syllable n-gram sequences of automatic speech recognition (ASR) transcripts, namely subword LSA, as an extension of our previous work on subword text tiling for automatic story segmentation of Chinese broadcast news. LSA represents the 'meaning' of a lexical term by a feature vector conveying the term's relations with other...

chapter

Eigenchannel Compensation and Symmetric Score for Robust Text-Independent Speaker Verification

Yuan Dong, Jian Zhao, Liang Lu, Jiqing Lui, more

2008 6th International Symposium on Chinese Spoken Language Processing > 1 - 4

2008 6th International Symposium on Chinese Spoken Language Processing

The negative effect of the session variability has become more and more severe for the performance of the speaker verification system. This paper discusses the eigenchannel compensation and investigates the symmetric scoring method to diminish the session variability and further enhance the performance. Experiments were conducted on the core tests of the 2006 and 2008 speaker recognition evaluation...

chapter

A Synchronous Method for Automatic Scoring of Language Learning

Bin Dong, Yonghong Yan

2008 6th International Symposium on Chinese Spoken Language Processing > 1 - 5

2008 6th International Symposium on Chinese Spoken Language Processing

In this paper, a synchronous method based on state graph is proposed to calculate the evaluation feature for automatic scoring in computer-assisted language learning (CALL). The posterior probabilities of states are selected as the main feature. The score of hypothesized phonemes and words are estimated using the information of corresponding states. Traditional systems use two passes and two different...

chapter

Noise Reduction Based Random Matrix Theory

X. Lu, S. Matsuda, T. Shimizu, S. Nakamura

2008 6th International Symposium on Chinese Spoken Language Processing > 1 - 4

2008 6th International Symposium on Chinese Spoken Language Processing

In speech enhancement literature, the signal subspace based method gains a lot of attention because of its simplicity in analytical formulations. The original idea in this method is based on the assumption that clean speech signal occupies a certain low dimensional space, while the noise signal which is a white additive noise spread the whole observation space. In this method, accurate estimation...

chapter

The Pitch Analysis of Imperative Sentences in Standard Chinese

Jia Sun, Jilun Lu, Aijun Li, Yuan Jia

2008 6th International Symposium on Chinese Spoken Language Processing > 1 - 4

2008 6th International Symposium on Chinese Spoken Language Processing

The present study investigates the intonational pattern of imperative sentence, especially those having intensive mood, such as ordering and forbidding in Standard Chinese. Grouping the sentences by length and focusing on the fundamental frequency, this paper tries to provide a description of pitch patterns of Chinese strong imperatives. Comparing to the declarative sentence, the pitch contour of...

chapter

Word Alignment Based on Multi-Grain Model

Yanqing He, Yu Zhou, Chengqing Zong

2008 6th International Symposium on Chinese Spoken Language Processing > 1 - 4

2008 6th International Symposium on Chinese Spoken Language Processing

Word alignment plays a critical role in statistical machine translation (SMT) and cross-language information retrieval. Until now, most existing methods get the word alignment within the whole range of the sentence length. The alignment quality is unsatisfactory. In this paper, we propose a novel approach to word alignment based on multi-grain model (WAMG). We split a parallel sentence pair into blocks...

chapter

An Improvement for Training Efficiency of Semi-Tied Covariance

Si-Bao Chen, Yu Hu, Bin Luo, Ren-Hua Wang

2008 6th International Symposium on Chinese Spoken Language Processing > 1 - 4

2008 6th International Symposium on Chinese Spoken Language Processing

Semi-tied covariance (STC) is applied widely in speech recognition due to its feature de-correlation ability. Solving the transform matrices of STC is a nonlinear optimization problem. Gales proposed an efficient method by iteratively updating a row of transform matrices. However, it needs to solve cofactors of elements of a matrix row in two layers of loops. Directly solving them is very time-consuming...

chapter

Order Adaptation of the Fractional Fourier Transform Using the Intraframe Pitch Change Rate for Speech Recognition

Hui Yin, C. Nadeu, V. Hohmann, Xiang Xie, more

2008 6th International Symposium on Chinese Spoken Language Processing > 1 - 4

2008 6th International Symposium on Chinese Spoken Language Processing

We propose an acoustic feature for speech recognition based on the combination of MFCC and fractional Fourier transform (FrFT). The transform orders for FrFT are adaptively set according to the intraframe pitch change rate. This method is motivated by the fact that the speech is not stationary even in a short period of time, and the idea is shown using an AM-FM speech model and some spectrograms of...

INFONA - science communication portal

2008 6th International Symposium on Chinese Spoken Language Processing

Speech Database Compacted for an Embedded Mandarin TTS System

Double Gauss Based Unsupervised Score Normalization in Speaker Verification

Evaluation and Analysis of Minimum Phone Error Training and its Modified Versions for Large Vocabulary Mandarin Speech Recognition

CityBrowser II: A Multimodal Restaurant Guide in Mandarin

Using Pseudo-Key for Language Recognition System Design

Discriminative Feedback Adaptation for GMM-UBM Speaker Verification

An Efficient Feature Selection Method for Speaker Recognition

ISCSLP 2008 Cover

ISCSLP 2008 Organizing Committees

ISCSLP 2008 Preface

ISCSLP 2008 Message from the General Chair

ISCSLP 2008 Technical Program Committee

Subword Latent Semantic Analysis for Texttiling-Based Automatic Story Segmentation of Chinese Broadcast News

Eigenchannel Compensation and Symmetric Score for Robust Text-Independent Speaker Verification

A Synchronous Method for Automatic Scoring of Language Learning

Noise Reduction Based Random Matrix Theory

The Pitch Analysis of Imperative Sentences in Standard Chinese

Word Alignment Based on Multi-Grain Model

An Improvement for Training Efficiency of Semi-Tied Covariance

Order Adaptation of the Fractional Fourier Transform Using the Intraframe Pitch Change Rate for Speech Recognition

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

2008 6th International Symposium on Chinese Spoken Language Processing $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2008 6th International Symposium on Chinese Spoken Language Processing