Search results for: Ru Wang

Items from 1 to 9 out of 9 results

chapter

Rich prosodic information exploration on spontaneous Mandarin speech

Cheng-Hsien Lin, Chung-Long You, Chen-Yu Chiang, Yih-Ru Wang, more

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP) > 1 - 5

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP)

In this paper, rich prosodic information of spontaneous Mandarin speech is explored. The joint prosody labeling and modeling algorithm proposed previously for read speech is extended to spontaneous-speech prosody modeling by additionally considering the modeling of disfluency speech parts. It trains a hierarchical prosodic model and performs prosody labeling from a large speech corpus automatically...

article

Modeling of Speaking Rate Influences on Mandarin Speech Prosody and Its Application to Speaking Rate-controlled TTS

Sin-Horng Chen, Chiao-Hua Hsieh, Chen-Yu Chiang, Hsi-Chun Hsiao, more

IEEE/ACM Transactions on Audio, Speech, and Language Processing > 2014 > 22 > 7 > 1158 - 1171

A new data-driven approach to building a speaking rate-dependent hierarchical prosodic model (SR-HPM), directly from a large prosody-unlabeled speech database containing utterances of various speaking rates, to describe the influences of speaking rate on Mandarin speech prosody is proposed. It is an extended version of the existing HPM model which contains 12 sub-models to describe various relationships...

chapter

A study on Hakka and mixed Hakka-Mandarin speech recognition

Tsai-Lu Tsai, Chen-Yu Chiang, Hsiu-Min Yu, Lieh-Shih Lo, more

2010 7th International Symposium on Chinese Spoken Language Processing > 199 - 204

7th International Symposium on Chinese Spoken Language Processing (ISCSLP 2010)

A first study on Hakka and mixed Hakka-Mandarin speech recognition (SR) is reported in this paper. The main focus of the study is on solving the problem of the lack of a large text corpus for training a reliable language model. In the Hakka SR, several methods to use the information of part of speech and Hakka-Chinese word translation to assist in language modeling are proposed. For mixed language...

chapter

An investigation on the Mandarin prosody of a parallel multi-speaking rate speech corpus

Chen-Yu Chiang, Cheng-Chang Tang, Hsiu-Min Yu, Yih-Ru Wang, more

2009 Oriental COCOSDA International Conference on Speech Database and Assessments > 148 - 153

2009 Oriental COCOSDA International Conference on Speech Database and Assessments

In this paper, the prosody of a parallel multispeaking rate Mandarin read speech corpus is investigated. The corpus contains four parallel speech datasets uttered by a female professional announcer with various speech rates (SRs) of 4.40 (fast), 3.82 (normal), 2.97 (median) and 2.45 (slow) syllables/second. By using the unsupervised joint prosody labeling and modeling (PLM) method proposed previously,...

chapter

A New Similarity Measure Between HMMS

Yih-Ru Wang

2008 6th International Symposium on Chinese Spoken Language Processing > 1 - 4

2008 6th International Symposium on Chinese Spoken Language Processing

In this paper, a new similarity measure between HMM models which extended the well-known Kullback-Leibler distance was proposed. The Kullback-Leibler distance was defined as the mean of log-likelihood ratio (LLR) in a hypotheses test and the Kullback-Leibler distance was frequently used as a similarity measure for HMM models. Here, the standard deviation of LLR between HMM models was deviated first...

chapter

Prosodic Modeling for Isolated Mandarin Words and its Application

Hung-Kuang Shih, Chen-Yu Chiang, Yih-Ru Wang, Sin-Horng Chen

2008 6th International Symposium on Chinese Spoken Language Processing > 1 - 4

2008 6th International Symposium on Chinese Spoken Language Processing

In this paper, a new approach to syllable-based modeling of FO contour, duration and energy for isolated Mandarin words is proposed. The syllable FO contour model considers three major affecting factors, including lexical tone, syllable position in a word and inter-syllable coarticulation effect; while both the duration and energy models additionally consider one more affecting factor of base syllable...

chapter

The signal change-point detection using the high-order statistics of log-likelihood difference functions

Yih-Ru Wang

2008 IEEE International Conference on Acoustics, Speech and Signal Processing > 4381 - 4384

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

In this paper, a supervised neural network based signal change-point detector is proposed. The proposed detector uses some high order statistics of log-likelihood difference functions as the input features in order to improve the detection performance. These high order statistics can be easily calculated from the CCGMM coefficients of signals. Performance of the proposed signal change-point detector...

chapter

Exploration of high-level prosodic patterns for continuous mandarin speech

Chen-Yu Chiang, Hsiu-Min Yu, Yih-Ru Wang, Sin-Horng Chen

2008 IEEE International Conference on Acoustics, Speech and Signal Processing > 3977 - 3980

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

In this paper, the high-level prosodic patterns of prosodic word (PW), prosodic phrase (PPh) and breath group/prosodic phrase group (BQ/PQ) for syllable pitch-level and duration are explored using an automatic joint prosody labeling and modeling method. Experimental results on a treebank speech corpus showed that the explored high-level prosodic patterns not only matched well with our a priori knowledge...

chapter

Latent Prosody Model of Continuous Mandarin Speech

Chen-Yu Chiang, Xiao-Dong Wang, Yuan-Fu Liao, Yih-Ru Wang, more

2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '7 > 4 > IV-625 - IV-628

2007 IEEE International Conference on Acoustics, Speech, and Signal Processing

The major difficulty of prosody modeling and automatic tone recognition of continuous Mandarin speech is the complex interaction of tones and prosody/intonation on FO contours. In this study, we propose a latent prosody model (LPM) aiming to jointly model the affections of tone and prosody state on FO. The main purposes are twofold including (1) automatic prosody state labeling and (2) improving tone...

Filter options

Keywords:
SPEECH PROCESSING

Publication date

Set your own date range

Publication type

book (8)
article (1)

Keywords

SPEECH (5)
SPEECH RECOGNITION (3)
ENERGY STATES (2)
HIDDEN MARKOV MODELS (2)
LABELING (2)
PRAGMATICS (2)
PROSODY MODELING (2)
SPEECH SYNTHESIS (2)
TRAINING (2)
ACOUSTIC MEASUREMENTS (1)
ACOUSTIC MODEL (1)
ACOUSTIC SIGNAL DETECTION (1)
ACOUSTICS (1)
ANALYTICAL MODELS (1)
ARTIFICIAL NEURAL NETWORKS (1)
AUDIO SIGNAL PROCESSING (1)
AUTOMATIC JOINT PROSODY LABELING (1)
AUTOMATIC SPEECH RECOGNITION (1)
BREAK LABEL (1)
BREATH GROUP (1)
CCGMM-BASED DIVERGENCE MEASURE (1)
CONTEXT MODELING (1)
CONTINUOUS MANDARIN SPEECH (1)
CONTOUR MODEL (1)
DATA MINING (1)
DATA MODELS (1)
DATABASES (1)
DECODING (1)
DISFLUENCY EVENT (1)
DISTANCE MEASUREMENT (1)
EQUAL ERROR RATE (1)
ESTIMATION (1)
FEATURE EXTRACTION (1)
FEMALE PROFESSIONAL ANNOUNCER (1)
HAKKA (1)
HAKKA CHINESE WORD TRANSLATION (1)
HAKKA MANDARIN ACOUSTIC MODEL (1)
HAKKA MANDARIN SPEECH RECOGNITION (1)
HIDDEN MARKOV MODEL (1)
HIGH LEVEL PROSODIC CONSTITUENTS PATTERN (1)
HIGH-LEVEL PROSODIC PATTERNS (1)
HIGH-ORDER STATISTICS (1)
HIGHER ORDER STATISTICS (1)
HMMS SIMILARITY MEASURE (1)
IEEE TRANSACTIONS (1)
INTER SYLLABLE COARTICULATION EFFECT (1)
ISOLATED MANDARIN WORDS (1)
KULLBACK-LEIBLER DISTANCE (1)
LANGUAGE MODEL (1)
LANGUAGE MODELING (1)
LANGUAGE TRANSLATION (1)
LEARNING SYSTEMS (1)
LEXICAL TONE (1)
LOG-LIKELIHOOD DIFFERENCE FUNCTIONS (1)
LOG-LIKELIHOOD RATIO (1)
MAINTENANCE ENGINEERING (1)
MANDARIN (1)
MANDARIN PROSODY (1)
MANDARIN PROSODY HIERARCHY (1)
MANDARIN PROSODY INVESTIGATION (1)
MANDARIN PROSODY MODELING (1)
MANDARIN SPEECH DATABASE (1)
NATURAL LANGUAGE PROCESSING (1)
NATURAL LANGUAGES (1)
NEURAL NETS (1)
NORMATIVE SPEAKERS (1)
PARALLEL MULTISPEAKING RATE MANDARIN READ SPEECH CORPUS (1)
PARALLEL SPEECH DATASET (1)
PAUSE DURATION (1)
PLM METHOD (1)
PROSODIC INFORMATION (1)
PROSODIC MODELING (1)
PROSODIC PHRASE GROUP (1)
PROSODIC WORD (1)
PROSODY GENERATION MECHANISM (1)
PROSODY LABELING (1)
SIGNAL CHANGE-POINT DETECTION (1)
SPEAKING RATE MODELING (1)
SPEAKING RATE-CONTROLLED TTS (1)
SPONTANEOUS MANDARIN SPEECH (1)
STANDARDS (1)
STRONTIUM (1)
SUPERVISED NEURAL NETWORK (1)
SYLLABLE BASED MODELING (1)
SYLLABLE PITCH-LEVEL (1)
TEXT-TO-SPEECH MODELLING (1)
TONE RECOGNITION (1)
TV BROADCAST NEWS (1)
UNSUPERVISED JOINT PROSODY MODELING (1)
VARYING SPEECH RATE (1)
more

INFONA - science communication portal

Search results for: Ru Wang

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options