Search results for: Yuan Dong

Items from 1 to 5 out of 5 results

chapter

Sports audio segmentation and classification

Jun Huang, Yuan Dong, Jiqing Liu, Chengyu Dong, more

2009 IEEE International Conference on Network Infrastructure and Digital Content > 379 - 383

2009 IEEE International Conference on Network Infrastructure and Digital Content (IC-NIDC 2009)

The audio stream is an important component of a sports video. In this paper, we present a system for audio segmentation and classification, which can segment and classify the sports audio stream into speech, non-speech very well. The novel point in our research is that we apply the segmentation and clustering method which is often used in speaker diarization system for broadcast news to the analysis...

chapter

A Three-Stage Text Normalization Strategy for Mandarin Text-to-Speech Systems

Tao Zhou, Yuan Dong, Dezhi Huang, Wu Liu, more

2008 6th International Symposium on Chinese Spoken Language Processing > 1 - 4

2008 6th International Symposium on Chinese Spoken Language Processing

Text normalization is an important component in mandarin Text-to-Speech system. This paper develops a taxonomy of Non-Standard Words (NSW's) based on a Large-scale Chinese corpus and proposes a three-stage text normalization strategy: Finite State Automata (FSA) for initial classification, Maximum Entropy (ME) Classifier & Rules for further classification and General Rules for standard word conversion...

chapter

Selecting optimal non-uniform units for hierarchical unit selection

Jun Xu, Dezhi Huang, Yuan Dong, Lianhong Cai, more

2008 International Conference on Audio, Language and Image Processing > 1610 - 1614

2008 International Conference on Audio, Language and Image Processing

For concatenative speech synthesis based on non-uniform unit selection, the key to improve the synthetic quality is the careful designing of measuring criteria respect to the units adopted. With our previous hierarchical non-uniform unit selection framework (Xu et al., 2007), two measurements for selecting optimal non-uniform units during searching at different layers are proposed in this paper, including...

chapter

A Comparative Study of Diverse Knowledge Sources and Smoothing Techniques via Maximum Entropy for Polyphone Disambiguation in Mandarin TTS Systems

Xinnian Mao, Yuan Dong, Jinyu Han, Haila Wang

2007 International Conference on Natural Language Processing and Knowledge Engineering > 162 - 169

2007 IEEE International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE '07)

This paper comparatively evaluated various knowledge sources and smoothing algorithms for pronunciation disambiguation in Mandarin TTS (text-to-speech) systems under maximum entropy (maxent) framework. In particular, five kinds of knowledge sources, namely characters and their pronunciations, words, their pronunciations and part-of-speech. together with two smoothing algorithms, i.e. Gaussian prior...

chapter

Svm-Based Speaker Verification by Location in the Space of Reference Speakers

Xianyu Zhao, Yuan Dong, Hao Yang, Jian Zhao, more

2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '7 > 4 > IV-281 - IV-284

2007 IEEE International Conference on Acoustics, Speech, and Signal Processing

In this paper, we investigate SVM-based speaker verification by location in the space of reference speakers. Speaker location is represented by a vector of log-likelihoods of utterance data given reference speaker models. Channel or session variability in speaker locations due to microphone, acoustic environments etc. would impair verification performance. To reduce such variability, Within-Class...

Filter options

Keywords:
SPEECH PROCESSING

Publication date

Set your own date range

Content availability

Available (4)
None (1)

Keywords

SPEECH (3)
CONFERENCES (2)
ACOUSTIC MEASUREMENTS (1)
AUDIO SEGMENTATION AND CLASSIFICATION (1)
AUDIO STREAM (1)
AUDIO STREAMING (1)
AUTOMATA (1)
BAYESIAN INFORMATION CRITERION CLUSTERING (1)
CHARACTER-BASED FEATURES (1)
CHINESE POLYPHONES (1)
CLASSIFICATION ALGORITHMS (1)
COLON (1)
CONCATENATIVE SPEECH SYNTHESIS (1)
CONTENT ANALYSIS (1)
COST FUNCTION (1)
COVARIANCE MATRIX (1)
DISTANCE MEASUREMENT (1)
DIVERSE KNOWLEDGE SOURCES (1)
ENGINES (1)
ENTROPY (1)
FINITE STATE AUTOMATA (1)
GAUSSIAN MIXTURE MODEL (1)
GAUSSIAN PRIOR ALGORITHM (1)
GAUSSIAN PROCESSES (1)
GMM (1)
INEQUALITY ALGORITHM (1)
INTERSYLLABLE PITCH CONTROL (1)
LARGE-SCALE CHINESE CORPUS (1)
LEARNING SYSTEMS (1)
MACHINE LEARNING METHOD (1)
MANDARIN TEXT-TO-SPEECH SYSTEMS (1)
MANDARIN TTS SYSTEMS (1)
MAXENT CLASSIFIER (1)
MAXENT FRAMEWORK (1)
MAXIMUM ENTROPY (1)
MAXIMUM ENTROPY CLASSIFIER (1)
MAXIMUM ENTROPY METHODS (1)
MUSIC (1)
NON-STANDARD WORDS (1)
OPTIMAL NONUNIFORM UNIT SELECTION (1)
PHONETICS (1)
POLYPHONE DISAMBIGUATION (1)
PRONUNCIATION DISAMBIGUATION (1)
ROBUSTNESS (1)
SESSION VARIABILITY (1)
SMOOTHING ALGORITHMS (1)
SMOOTHING METHODS (1)
SMOOTHING TECHNIQUES (1)
SPEAKER DIARIZATION SYSTEM (1)
SPEAKER LOCATION (1)
SPEAKER RECOGNITION (1)
SPECTRA DISTANCE (1)
SPECTRAL ANALYSIS (1)
SPEECH RECOGNITION (1)
SPEECH SYNTHESIS (1)
SPORTS AUDIO (1)
SPORTS AUDIO SEGMENTATION (1)
SUPPORTING VECTOR MACHINES (1)
TAXONOMY (1)
TEXT ANALYSIS (1)
TEXT-TO-SPEECH SYSTEM (1)
THREE-STAGE TEXT NORMALIZATION STRATEGY (1)
TRANSFORM-BASED ERROR-DRIVEN LEARNING ALGORITHM (1)
VOICELESS CONSONANT (1)
more

INFONA - science communication portal

Search results for: Yuan Dong

Sports audio segmentation and classification

A Three-Stage Text Normalization Strategy for Mandarin Text-to-Speech Systems

Selecting optimal non-uniform units for hierarchical unit selection

A Comparative Study of Diverse Knowledge Sources and Smoothing Techniques via Maximum Entropy for Polyphone Disambiguation in Mandarin TTS Systems

Svm-Based Speaker Verification by Location in the Space of Reference Speakers

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options