Search results for: Haila Wang

Items from 1 to 5 out of 5 results

chapter

Sports audio segmentation and classification

Jun Huang, Yuan Dong, Jiqing Liu, Chengyu Dong, more

2009 IEEE International Conference on Network Infrastructure and Digital Content > 379 - 383

2009 IEEE International Conference on Network Infrastructure and Digital Content (IC-NIDC 2009)

The audio stream is an important component of a sports video. In this paper, we present a system for audio segmentation and classification, which can segment and classify the sports audio stream into speech, non-speech very well. The novel point in our research is that we apply the segmentation and clustering method which is often used in speaker diarization system for broadcast news to the analysis...

chapter

The effect of language factors for robust speaker recognition

Liang Lu, Yuan Dong, Xianyu Zhao, Jiqing Liu, more

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 4217 - 4220

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

From the results of the NIST speaker recognition evaluation in resent years, speaker recognition systems which are mainly developed based on English training data suffer the language gap problem, namely, the performance of non-English trails is much worse than that of English trails. This problem is addressed in this paper. Based on the conventional joint factor analysis model, we enrolled in the...

chapter

Eigenchannel Compensation and Symmetric Score for Robust Text-Independent Speaker Verification

Yuan Dong, Jian Zhao, Liang Lu, Jiqing Lui, more

2008 6th International Symposium on Chinese Spoken Language Processing > 1 - 4

2008 6th International Symposium on Chinese Spoken Language Processing

The negative effect of the session variability has become more and more severe for the performance of the speaker verification system. This paper discusses the eigenchannel compensation and investigates the symmetric scoring method to diminish the session variability and further enhance the performance. Experiments were conducted on the core tests of the 2006 and 2008 speaker recognition evaluation...

chapter

A Three-Stage Text Normalization Strategy for Mandarin Text-to-Speech Systems

Tao Zhou, Yuan Dong, Dezhi Huang, Wu Liu, more

2008 6th International Symposium on Chinese Spoken Language Processing > 1 - 4

2008 6th International Symposium on Chinese Spoken Language Processing

Text normalization is an important component in mandarin Text-to-Speech system. This paper develops a taxonomy of Non-Standard Words (NSW's) based on a Large-scale Chinese corpus and proposes a three-stage text normalization strategy: Finite State Automata (FSA) for initial classification, Maximum Entropy (ME) Classifier & Rules for further classification and General Rules for standard word conversion...

chapter

Selecting optimal non-uniform units for hierarchical unit selection

Jun Xu, Dezhi Huang, Yuan Dong, Lianhong Cai, more

2008 International Conference on Audio, Language and Image Processing > 1610 - 1614

2008 International Conference on Audio, Language and Image Processing

For concatenative speech synthesis based on non-uniform unit selection, the key to improve the synthetic quality is the careful designing of measuring criteria respect to the units adopted. With our previous hierarchical non-uniform unit selection framework (Xu et al., 2007), two measurements for selecting optimal non-uniform units during searching at different layers are proposed in this paper, including...

Filter options

Keywords:
SPEECH

Publication date

Set your own date range

Keywords

SPEECH PROCESSING (3)
CONFERENCES (2)
EIGENVALUES AND EIGENFUNCTIONS (2)
NIST (2)
SPEAKER RECOGNITION (2)
SPEECH RECOGNITION (2)
ACOUSTIC MEASUREMENTS (1)
ADAPTATION MODEL (1)
ANALYTICAL MODELS (1)
AUDIO SEGMENTATION AND CLASSIFICATION (1)
AUDIO STREAM (1)
AUDIO STREAMING (1)
AUTOMATA (1)
BAYESIAN INFORMATION CRITERION CLUSTERING (1)
CLASSIFICATION ALGORITHMS (1)
COLON (1)
CONCATENATIVE SPEECH SYNTHESIS (1)
CONTENT ANALYSIS (1)
COST FUNCTION (1)
COVARIANCE MATRIX (1)
DISTANCE MEASUREMENT (1)
EIGENCHANNEL COMPENSATION (1)
EIGENCHANNELS (1)
ENGINES (1)
ENTROPY (1)
EQUAL ERROR RATE (1)
FINITE STATE AUTOMATA (1)
GAUSSIAN MIXTURE MODEL (1)
GMM (1)
INTERSYLLABLE PITCH CONTROL (1)
JOINT FACTOR ANALYSIS (1)
JOINTS (1)
LANGUAGE FACTOR COMPENSATION (1)
LARGE-SCALE CHINESE CORPUS (1)
LEARNING SYSTEMS (1)
MACHINE LEARNING METHOD (1)
MANDARIN TEXT-TO-SPEECH SYSTEMS (1)
MAXIMUM ENTROPY CLASSIFIER (1)
MICROPHONES (1)
MUSIC (1)
NON-STANDARD WORDS (1)
OPTIMAL NONUNIFORM UNIT SELECTION (1)
PHONETICS (1)
ROBUST SPEAKER RECOGNITION (1)
ROBUST TEXT INDEPENDENT (1)
ROBUSTNESS (1)
SESSION VARIABILITY (1)
SILICON (1)
SPEAKER DIARIZATION SYSTEM (1)
SPEAKER VERIFICATION (1)
SPECTRA DISTANCE (1)
SPECTRAL ANALYSIS (1)
SPEECH SYNTHESIS (1)
SPORTS AUDIO (1)
SPORTS AUDIO SEGMENTATION (1)
SYMMETRIC SCORING METHOD (1)
TAXONOMY (1)
TESTING (1)
TEXT ANALYSIS (1)
THREE-STAGE TEXT NORMALIZATION STRATEGY (1)
TRAINING (1)
VOICELESS CONSONANT (1)
more

INFONA - science communication portal

Search results for: Haila Wang

Sports audio segmentation and classification

The effect of language factors for robust speaker recognition

Eigenchannel Compensation and Symmetric Score for Robust Text-Independent Speaker Verification

A Three-Stage Text Normalization Strategy for Mandarin Text-to-Speech Systems

Selecting optimal non-uniform units for hierarchical unit selection

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options