Wyniki wyszukiwania dla: Yong Qin

Pozycje od 1 do 7 spośród 7 wyników

rozdział

Improved Mandarin Keyword Spotting Using Confusion Garbage Model

Shilei Zhang, Zhiwei Shuang, Qin Shi, Yong Qin

2010 20th International Conference on Pattern Recognition > 3700 - 3703

2010 20th International Conference on Pattern Recognition (ICPR 2010)

This paper presents an improved acoustic keyword spotting (KWS) algorithm using a novel confusion garbage model in Mandarin conversational speech. Observing the KWS corpus, we found there are many words with similar pronunciation with predefined keywords, although they have different Chinese characters and different meanings, which easily result in high false alarm rate. In this paper, an improved...

rozdział

Automatic Pronunciation Transliteration for Chinese-English Mixed Language Keyword Spotting

Shilei Zhang, Zhiwei Shuang, Yong Qin

2010 20th International Conference on Pattern Recognition > 1610 - 1613

2010 20th International Conference on Pattern Recognition (ICPR 2010)

This paper presents automatic pronunciation transliteration method with acoustic and contextual analysis for Chinese-English mixed language keyword spotting (KWS) system. More often, we need to develop robust Chinese-English mixed language spoken language technology without Chinese accented English acoustic data. In this paper, we exploit pronunciation conversion method based on syllable-based characteristic...

rozdział

Comparison of Syllable/Phone HMM Based Mandarin TTS

Quansheng Duan, Shiyin Kang, Zhiyong Wu, Lianhong Cai, więcej

2010 20th International Conference on Pattern Recognition > 4496 - 4499

2010 20th International Conference on Pattern Recognition (ICPR 2010)

The performance of HMM-based text to speech (TTS) system is affected by the basic modeling units and the size of training data. This paper compares two HMM based Mandarin TTS systems using syllable and phone as basic units respectively with 1000, 3000 and 5000 sentences' training data. Two female speakers' corpora are used as training data for evaluation. For both corpora, the system using syllable...

rozdział

The 2009 IBM GALE Mandarin broadcast transcription system

Stephen M Chu, Daniel Povey, Hong-Kwang Kuo, Lidia Mangu, więcej

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 4374 - 4377

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

This paper gives an up-to-date description of the IBM Mandarin broadcast transcription system developed under the DARPA GALE program. Technical advances over our previous system include a novel acoustic modeling approach using subspace Gaussian mixture models, a speaking rate adaptation method using frame rate normalization, and an effective recipe for lattice combination. We present results on three...

rozdział

Chinese prosodic phrasing with the source-channel model

Honghui Dong, Yong Qin, Limin Jia

2009 Chinese Control and Decision Conference > 6168 - 6171

2009 Chinese Control and Decision Conference (CCDC 2009)

The prosodic phrasing is a classic problem in nature language process, which is not only useful for text-to-speech(TTS), but for speech recognition, statistic machine learning etc.. This paper introduces and discusses the source-channel model for Chinese prosodic phrasing. Based on the basic idea, the hidden Markov model (HMM) and the improved source-channel model are both used to describe the phrasing...

rozdział

Utterance verification using improved confidence measures based on alignment confusion rate in Chinese digits recognition

Shilei Zhang, Danning Jiang, Yong Qin

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 1309 - 1312

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

In this paper, we explore an approach to improved confidence measures based on a novel alignment confusion rate (ACR) which integrates alignment information from two different modeling unit sets in Chinese digits recognition system. Both initial-final (IF) phone set and head-body-tail (HBT) models have proven to obtain good recognition performance for connected digit strings. These two different modeling...

rozdział

Main vowel domain tone modeling with lexical and prosodic analysis for Mandarin ASR

Shilei Zhang, Qin Shi, S.M. Chu, Yong Qin

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 4561 - 4564

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

The tone is a distinctive discriminative feature in Mandarin Chinese. Often functional, yet seldom thorough are most large-scale Mandarin speech recognition systems in treating tone modeling. In particular, many lack the necessary sophistication to deal with the myriad variations arising from the combination of acoustic and lexical contexts. This paper reports an attempt to account for these variabilities...

Opcje filtrowania

Słowa kluczowe:
SPEECH

Data publikacji

Ustaw własny zakres dat

Słowa kluczowe

HIDDEN MARKOV MODELS (6)
SPEECH RECOGNITION (6)
NATURAL LANGUAGE PROCESSING (5)
ACOUSTICS (4)
LATTICES (3)
TRAINING (3)
COMPUTATIONAL MODELING (2)
CONFIDENCE MEASURE (2)
CONTEXT (2)
HIDDEN MARKOV MODEL (2)
HMM (2)
KEYWORD SPOTTING (2)
SPEECH PROCESSING (2)
VOCABULARY (2)
ACCURACY (1)
ACOUSTIC ANALYSIS (1)
ACOUSTIC CONTEXT (1)
ACOUSTIC KEYWORD SPOTTING (1)
ACOUSTIC KWS METHOD (1)
ACOUSTIC MODELING APPROACH (1)
ACOUSTIC SIGNAL PROCESSING (1)
ADAPTATION MODEL (1)
ALIGNMENT CONFUSION RATE (1)
ANALYTICAL MODELS (1)
AUTOMATIC PRONUNCIATION TRANSLITERATION (1)
CAR-KIT MICROPHONE (1)
CFRN (1)
CHARACTER ERROR RATE (1)
CHINESE CHARACTERS (1)
CHINESE DIGITS RECOGNITION (1)
CHINESE PROSODIC PHRASING (1)
CHINESE-ENGLISH MIXED LANGUAGE KEYWORD SPOTTING (1)
CONFUSION GARBAGE MODEL (1)
CONNECTED-DIGITS SET (1)
CONSORTIUM DEFINED TEST SETS (1)
CONTEXT MODELING (1)
CONTEXT-DEPENDENT MODEL (1)
CONTEXTUAL ANALYSIS (1)
CONVERSATIONAL TELEPHONE DATASET (1)
DARPA GALE PROGRAM (1)
DECISION TREE (1)
DECISION TREES (1)
DECODING (1)
DISCRIMINATIVE TRAINING (1)
ENTROPY (1)
FALSE ALARM RATE (1)
FEATURE EXTRACTION (1)
FOOT (1)
FOOT-PATTERN MODEL (1)
FRAME RATE NORMALIZATION (1)
GAUSSIAN PROCESSES (1)
HEAD-BODY-TAIL MODELS (1)
HETEROJUNCTION BIPOLAR TRANSISTORS (1)
HMM-BASED CM METHOD (1)
HMM-BASED CONFIDENCE MEASURE METHOD (1)
HMM-BASED MANDARIN TEXT TO SPEECH SYSTEM (1)
IBM GALE MANDARIN BROADCAST TRANSCRIPTION SYSTEM (1)
IBM MANDARIN BROADCAST TRANSCRIPTION SYSTEM (1)
INITIAL-FINAL PHONE SET (1)
IP NETWORKS (1)
LANGUAGE TRANSLATION (1)
LARGE-VOCABULARY BROADCAST TRANSCRIPTION (1)
LATTICE RESCORING (1)
LEARNING (ARTIFICIAL INTELLIGENCE) (1)
LEXICAL ANALYSIS (1)
LEXICAL CONTEXT (1)
LINGUISTICS (1)
MAIN VOWEL (1)
MAIN VOWEL DOMAIN TONE MODELING (1)
MANDARIN (1)
MANDARIN ASR (1)
MANDARIN BROADCAST SPEECH TRANSCRIPTION (1)
MANDARIN CHINESE (1)
MANDARIN CONVERSATIONAL SPEECH (1)
MANDARIN KEYWORD SPOTTING (1)
MANDARIN SPEECH RECOGNITION SYSTEM (1)
MARKOV PROCESSES (1)
MAXIMUM ENTROPY METHODS (1)
MAXIMUM ENTROPY MODEL (1)
MEASUREMENT (1)
MICROPHONES (1)
MIXED LANGUAGE (1)
NATURE LANGUAGE PROCESS AREA (1)
POSTERIOR PROBABILITY (1)
PROBABILITY (1)
PRONUNCIATION CONVERSION (1)
PROSODIC ANALYSIS (1)
PROSODIC PHRASING (1)
RHYTHM (1)
RHYTHM MODEL (1)
ROBUST TONE TRACKING (1)
SIMILAR PRONUNCIATION (1)
SIMILAR PRONUNCIATION WORDS (1)
SOURCE-CHANNEL MODEL (1)
SPEAKING RATE ADAPTATION (1)
SPEECH SYNTHESIS (1)
STATE-OF-THE-ART SPEECH RECOGNITION TECHNOLOGY (1)
STATISTIC MACHINE LEARNING (1)
SUBSPACE GAUSSIAN MIXTURE MODELS (1)
więcej

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania dla: Yong Qin

Improved Mandarin Keyword Spotting Using Confusion Garbage Model

Automatic Pronunciation Transliteration for Chinese-English Mixed Language Keyword Spotting

Comparison of Syllable/Phone HMM Based Mandarin TTS

The 2009 IBM GALE Mandarin broadcast transcription system

Chinese prosodic phrasing with the source-channel model

Utterance verification using improved confidence measures based on alignment confusion rate in Chinese digits recognition

Main vowel domain tone modeling with lexical and prosodic analysis for Mandarin ASR

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Słowa kluczowe

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu