Search results for: Yong Qin

Items from 1 to 7 out of 7 results

chapter

Improved Mandarin Keyword Spotting Using Confusion Garbage Model

Shilei Zhang, Zhiwei Shuang, Qin Shi, Yong Qin

2010 20th International Conference on Pattern Recognition > 3700 - 3703

2010 20th International Conference on Pattern Recognition (ICPR 2010)

This paper presents an improved acoustic keyword spotting (KWS) algorithm using a novel confusion garbage model in Mandarin conversational speech. Observing the KWS corpus, we found there are many words with similar pronunciation with predefined keywords, although they have different Chinese characters and different meanings, which easily result in high false alarm rate. In this paper, an improved...

chapter

Automatic Pronunciation Transliteration for Chinese-English Mixed Language Keyword Spotting

Shilei Zhang, Zhiwei Shuang, Yong Qin

2010 20th International Conference on Pattern Recognition > 1610 - 1613

2010 20th International Conference on Pattern Recognition (ICPR 2010)

This paper presents automatic pronunciation transliteration method with acoustic and contextual analysis for Chinese-English mixed language keyword spotting (KWS) system. More often, we need to develop robust Chinese-English mixed language spoken language technology without Chinese accented English acoustic data. In this paper, we exploit pronunciation conversion method based on syllable-based characteristic...

chapter

Comparison of Syllable/Phone HMM Based Mandarin TTS

Quansheng Duan, Shiyin Kang, Zhiyong Wu, Lianhong Cai, more

2010 20th International Conference on Pattern Recognition > 4496 - 4499

2010 20th International Conference on Pattern Recognition (ICPR 2010)

The performance of HMM-based text to speech (TTS) system is affected by the basic modeling units and the size of training data. This paper compares two HMM based Mandarin TTS systems using syllable and phone as basic units respectively with 1000, 3000 and 5000 sentences' training data. Two female speakers' corpora are used as training data for evaluation. For both corpora, the system using syllable...

chapter

The 2009 IBM GALE Mandarin broadcast transcription system

Stephen M Chu, Daniel Povey, Hong-Kwang Kuo, Lidia Mangu, more

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 4374 - 4377

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

This paper gives an up-to-date description of the IBM Mandarin broadcast transcription system developed under the DARPA GALE program. Technical advances over our previous system include a novel acoustic modeling approach using subspace Gaussian mixture models, a speaking rate adaptation method using frame rate normalization, and an effective recipe for lattice combination. We present results on three...

chapter

Chinese prosodic phrasing with the source-channel model

Honghui Dong, Yong Qin, Limin Jia

2009 Chinese Control and Decision Conference > 6168 - 6171

2009 Chinese Control and Decision Conference (CCDC 2009)

The prosodic phrasing is a classic problem in nature language process, which is not only useful for text-to-speech(TTS), but for speech recognition, statistic machine learning etc.. This paper introduces and discusses the source-channel model for Chinese prosodic phrasing. Based on the basic idea, the hidden Markov model (HMM) and the improved source-channel model are both used to describe the phrasing...

chapter

Utterance verification using improved confidence measures based on alignment confusion rate in Chinese digits recognition

Shilei Zhang, Danning Jiang, Yong Qin

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 1309 - 1312

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

In this paper, we explore an approach to improved confidence measures based on a novel alignment confusion rate (ACR) which integrates alignment information from two different modeling unit sets in Chinese digits recognition system. Both initial-final (IF) phone set and head-body-tail (HBT) models have proven to obtain good recognition performance for connected digit strings. These two different modeling...

chapter

Main vowel domain tone modeling with lexical and prosodic analysis for Mandarin ASR

Shilei Zhang, Qin Shi, S.M. Chu, Yong Qin

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 4561 - 4564

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

The tone is a distinctive discriminative feature in Mandarin Chinese. Often functional, yet seldom thorough are most large-scale Mandarin speech recognition systems in treating tone modeling. In particular, many lack the necessary sophistication to deal with the myriad variations arising from the combination of acoustic and lexical contexts. This paper reports an attempt to account for these variabilities...

Filter options

Keywords:
SPEECH

Publication date

Set your own date range

Keywords

HIDDEN MARKOV MODELS (6)
SPEECH RECOGNITION (6)
NATURAL LANGUAGE PROCESSING (5)
ACOUSTICS (4)
LATTICES (3)
TRAINING (3)
COMPUTATIONAL MODELING (2)
CONFIDENCE MEASURE (2)
CONTEXT (2)
HIDDEN MARKOV MODEL (2)
HMM (2)
KEYWORD SPOTTING (2)
SPEECH PROCESSING (2)
VOCABULARY (2)
ACCURACY (1)
ACOUSTIC ANALYSIS (1)
ACOUSTIC CONTEXT (1)
ACOUSTIC KEYWORD SPOTTING (1)
ACOUSTIC KWS METHOD (1)
ACOUSTIC MODELING APPROACH (1)
ACOUSTIC SIGNAL PROCESSING (1)
ADAPTATION MODEL (1)
ALIGNMENT CONFUSION RATE (1)
ANALYTICAL MODELS (1)
AUTOMATIC PRONUNCIATION TRANSLITERATION (1)
CAR-KIT MICROPHONE (1)
CFRN (1)
CHARACTER ERROR RATE (1)
CHINESE CHARACTERS (1)
CHINESE DIGITS RECOGNITION (1)
CHINESE PROSODIC PHRASING (1)
CHINESE-ENGLISH MIXED LANGUAGE KEYWORD SPOTTING (1)
CONFUSION GARBAGE MODEL (1)
CONNECTED-DIGITS SET (1)
CONSORTIUM DEFINED TEST SETS (1)
CONTEXT MODELING (1)
CONTEXT-DEPENDENT MODEL (1)
CONTEXTUAL ANALYSIS (1)
CONVERSATIONAL TELEPHONE DATASET (1)
DARPA GALE PROGRAM (1)
DECISION TREE (1)
DECISION TREES (1)
DECODING (1)
DISCRIMINATIVE TRAINING (1)
ENTROPY (1)
FALSE ALARM RATE (1)
FEATURE EXTRACTION (1)
FOOT (1)
FOOT-PATTERN MODEL (1)
FRAME RATE NORMALIZATION (1)
GAUSSIAN PROCESSES (1)
HEAD-BODY-TAIL MODELS (1)
HETEROJUNCTION BIPOLAR TRANSISTORS (1)
HMM-BASED CM METHOD (1)
HMM-BASED CONFIDENCE MEASURE METHOD (1)
HMM-BASED MANDARIN TEXT TO SPEECH SYSTEM (1)
IBM GALE MANDARIN BROADCAST TRANSCRIPTION SYSTEM (1)
IBM MANDARIN BROADCAST TRANSCRIPTION SYSTEM (1)
INITIAL-FINAL PHONE SET (1)
IP NETWORKS (1)
LANGUAGE TRANSLATION (1)
LARGE-VOCABULARY BROADCAST TRANSCRIPTION (1)
LATTICE RESCORING (1)
LEARNING (ARTIFICIAL INTELLIGENCE) (1)
LEXICAL ANALYSIS (1)
LEXICAL CONTEXT (1)
LINGUISTICS (1)
MAIN VOWEL (1)
MAIN VOWEL DOMAIN TONE MODELING (1)
MANDARIN (1)
MANDARIN ASR (1)
MANDARIN BROADCAST SPEECH TRANSCRIPTION (1)
MANDARIN CHINESE (1)
MANDARIN CONVERSATIONAL SPEECH (1)
MANDARIN KEYWORD SPOTTING (1)
MANDARIN SPEECH RECOGNITION SYSTEM (1)
MARKOV PROCESSES (1)
MAXIMUM ENTROPY METHODS (1)
MAXIMUM ENTROPY MODEL (1)
MEASUREMENT (1)
MICROPHONES (1)
MIXED LANGUAGE (1)
NATURE LANGUAGE PROCESS AREA (1)
POSTERIOR PROBABILITY (1)
PROBABILITY (1)
PRONUNCIATION CONVERSION (1)
PROSODIC ANALYSIS (1)
PROSODIC PHRASING (1)
RHYTHM (1)
RHYTHM MODEL (1)
ROBUST TONE TRACKING (1)
SIMILAR PRONUNCIATION (1)
SIMILAR PRONUNCIATION WORDS (1)
SOURCE-CHANNEL MODEL (1)
SPEAKING RATE ADAPTATION (1)
SPEECH SYNTHESIS (1)
STATE-OF-THE-ART SPEECH RECOGNITION TECHNOLOGY (1)
STATISTIC MACHINE LEARNING (1)
SUBSPACE GAUSSIAN MIXTURE MODELS (1)
more

INFONA - science communication portal

Search results for: Yong Qin

Improved Mandarin Keyword Spotting Using Confusion Garbage Model

Automatic Pronunciation Transliteration for Chinese-English Mixed Language Keyword Spotting

Comparison of Syllable/Phone HMM Based Mandarin TTS

The 2009 IBM GALE Mandarin broadcast transcription system

Chinese prosodic phrasing with the source-channel model

Utterance verification using improved confidence measures based on alignment confusion rate in Chinese digits recognition

Main vowel domain tone modeling with lexical and prosodic analysis for Mandarin ASR

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options