Search results for: Yong Qin

Items from 1 to 5 out of 5 results

chapter

Automatic Pronunciation Transliteration for Chinese-English Mixed Language Keyword Spotting

Shilei Zhang, Zhiwei Shuang, Yong Qin

2010 20th International Conference on Pattern Recognition > 1610 - 1613

2010 20th International Conference on Pattern Recognition (ICPR 2010)

This paper presents automatic pronunciation transliteration method with acoustic and contextual analysis for Chinese-English mixed language keyword spotting (KWS) system. More often, we need to develop robust Chinese-English mixed language spoken language technology without Chinese accented English acoustic data. In this paper, we exploit pronunciation conversion method based on syllable-based characteristic...

chapter

Chinese prosodic phrasing with the source-channel model

Honghui Dong, Yong Qin, Limin Jia

2009 Chinese Control and Decision Conference > 6168 - 6171

2009 Chinese Control and Decision Conference (CCDC 2009)

The prosodic phrasing is a classic problem in nature language process, which is not only useful for text-to-speech(TTS), but for speech recognition, statistic machine learning etc.. This paper introduces and discusses the source-channel model for Chinese prosodic phrasing. Based on the basic idea, the hidden Markov model (HMM) and the improved source-channel model are both used to describe the phrasing...

chapter

Recent advances in the IBM GALE Mandarin transcription system

S.M. Chu, Hong-kwang Kuo, L. Mangu, Yi Liu, more

2008 IEEE International Conference on Acoustics, Speech and Signal Processing > 4329 - 4332

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

This paper describes the system and algorithmic developments in the automatic transcription of Mandarin broadcast speech made at IBM in the second year of the DARPA GALE program. Technical advances over our previous system include improved acoustic models using embedded tone modeling, and a new topic-adaptive language model (LM) rescoring technique based on dynamically generated LMs. We present results...

chapter

Voice conversion by combining frequency warping with unit selection

Zhiwei Shuang, Fanping Meng, Yong Qin

2008 IEEE International Conference on Acoustics, Speech and Signal Processing > 4661 - 4664

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

In this paper, we propose a novel voice conversion method by combining frequency warping and unit selection to improve the similarity to target speaker. We use frequency warping to get the warped source spectrum, which will be used as estimated target for later unit selection of the target speaker's spectrum. Such estimated target can preserve the natural transition of human's speech. Then, part of...

chapter

The IBM Mandarin Broadcast Speech Transcription System

Stephen M. Chu, Hong-kwang Kuo, Yi Y Liu, Yong Qin, more

2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '7 > 2 > II-345 - II-348

2007 IEEE International Conference on Acoustics, Speech, and Signal Processing

This paper describes the technical and system building advances in the automatic transcription of Mandarin broadcast speech made at IBM in the first year of the DARPA GALE program. In particular, we discuss the application of minimum phone error (MPE) discriminative training and a new topic-adaptive language modeling technique. We present results on both the RT04 evaluation data and two larger community-defined...

Filter options

Keywords:
SPEECH PROCESSING

Publication date

Set your own date range

Keywords

SPEECH RECOGNITION (4)
HIDDEN MARKOV MODELS (2)
NATURAL LANGUAGE PROCESSING (2)
SPEECH (2)
ACOUSTIC ANALYSIS (1)
ACOUSTIC MODELS (1)
ACOUSTIC SIGNAL PROCESSING (1)
ACOUSTICS (1)
ALGORITHMIC DEVELOPMENTS (1)
ANALYTICAL MODELS (1)
AUTOMATIC PRONUNCIATION TRANSLITERATION (1)
AUTOMATIC TRANSCRIPTION (1)
BROADCAST CONVERSATION DOMAIN (1)
CHARACTER ERROR RATE (1)
CHINESE PROSODIC PHRASING (1)
CHINESE-ENGLISH MIXED LANGUAGE KEYWORD SPOTTING (1)
COMPUTATIONAL MODELING (1)
CONTEXTUAL ANALYSIS (1)
DARPA GALE PROGRAM (1)
DISCRIMINATIVE TRAINING (1)
EMBEDDED TONE MODELING (1)
ENTROPY (1)
FOOT (1)
FOOT-PATTERN MODEL (1)
FREQUENCY WARPING (1)
HIDDEN MARKOV MODEL (1)
HMM (1)
HUMAN SPEECH NATURAL TRANSITION (1)
IBM GALE MANDARIN TRANSCRIPTION SYSTEM (1)
KEYWORD SPOTTING (1)
LATTICES (1)
LEARNING (ARTIFICIAL INTELLIGENCE) (1)
LINGUISTICS (1)
MANDARIN BROADCAST SPEECH (1)
MARKOV PROCESSES (1)
MAXIMUM ENTROPY METHODS (1)
MAXIMUM ENTROPY MODEL (1)
MEASUREMENT (1)
MIXED LANGUAGE (1)
NATURE LANGUAGE PROCESS AREA (1)
PRONUNCIATION CONVERSION (1)
PROSODIC PHRASING (1)
RELATIVE REDUCTION (1)
RHYTHM (1)
RHYTHM MODEL (1)
SELECTION (1)
SOURCE SPECTRUM (1)
SOURCE-CHANNEL MODEL (1)
STATISTIC MACHINE LEARNING (1)
TARGET SPEAKER SPECTRUM (1)
TC- STAR 2007 (1)
TEXT-TO-SPEECH (1)
TONE MODELING (1)
TOPIC ADAPTATION (1)
TOPIC-ADAPTIVE LANGUAGE MODEL RESCORING TECHNIQUE (1)
UNIT SELECTION (1)
VOICE CONVERSION (1)
VOICE CONVERSION METHOD (1)
WARPING (1)
more

INFONA - science communication portal

Search results for: Yong Qin

Automatic Pronunciation Transliteration for Chinese-English Mixed Language Keyword Spotting

Chinese prosodic phrasing with the source-channel model

Recent advances in the IBM GALE Mandarin transcription system

Voice conversion by combining frequency warping with unit selection

The IBM Mandarin Broadcast Speech Transcription System

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options