Search results for: Ru Wang

Items from 1 to 4 out of 4 results

article

Speaker Adaptation of SR-HPM for Speaking Rate-Controlled Mandarin TTS

I-Bin Liao, Chen-Yu Chiang, Yih-Ru Wang, Sin-Horng Chen

IEEE/ACM Transactions on Audio, Speech, and Language Processing > 2016 > 24 > 11 > 2046 - 2058

In this paper, a structural maximum a posteriori (SMAP) speaker adaptation approach to adjusting the speaking rate (SR)-dependent hierarchical prosodic model (SR-HPM) of an existing SR-controlled Mandarin text-to-speech system to a new speaker's data for producing a new voice is discussed. Two main issues are addressed. One is the small SR coverage of the adaptation data and is solved by using the...

chapter

A preliminary study on applying character-level RNNLM to text-to-speech system

Yuan-Fu Liao, Kuan-Hung Chen, Yih-Ru Wang

2016 Conference of The Oriental Chapter of International Committee for Coordination and Standardization of Speech Databases and Assessment Techniques (O-COCOSDA) > 68 - 72

2016 Conference of The Oriental Chapter of International Committee for Coordination and Standardization of Speech Databases and Assessment Techniques (O-COCOSDA)

High quality linguistic features extraction is the key component to the success of speech synthesis. However, traditional linguistic feature extraction methods are all based on natural language processing (NLP) frontends that relies heavily on feature engineering and ignores backend synthesis errors in feedback. In order to approach the goal of establishing an end-to-end speech synthesis system, in...

chapter

An investigation on the Mandarin prosody of a parallel multi-speaking rate speech corpus

Chen-Yu Chiang, Cheng-Chang Tang, Hsiu-Min Yu, Yih-Ru Wang, more

2009 Oriental COCOSDA International Conference on Speech Database and Assessments > 148 - 153

2009 Oriental COCOSDA International Conference on Speech Database and Assessments

In this paper, the prosody of a parallel multispeaking rate Mandarin read speech corpus is investigated. The corpus contains four parallel speech datasets uttered by a female professional announcer with various speech rates (SRs) of 4.40 (fast), 3.82 (normal), 2.97 (median) and 2.45 (slow) syllables/second. By using the unsupervised joint prosody labeling and modeling (PLM) method proposed previously,...

chapter

Exploration of high-level prosodic patterns for continuous mandarin speech

Chen-Yu Chiang, Hsiu-Min Yu, Yih-Ru Wang, Sin-Horng Chen

2008 IEEE International Conference on Acoustics, Speech and Signal Processing > 3977 - 3980

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

In this paper, the high-level prosodic patterns of prosodic word (PW), prosodic phrase (PPh) and breath group/prosodic phrase group (BQ/PQ) for syllable pitch-level and duration are explored using an automatic joint prosody labeling and modeling method. Experimental results on a treebank speech corpus showed that the explored high-level prosodic patterns not only matched well with our a priori knowledge...

Filter options

Keywords:
SPEECH SYNTHESIS

Publication date

Set your own date range

Publication type

book (3)
article (1)

Keywords

HIDDEN MARKOV MODELS (2)
SPEECH (2)
SPEECH PROCESSING (2)
ADAPTATION MODELS (1)
ANALYTICAL MODELS (1)
AUTOMATIC JOINT PROSODY LABELING (1)
AUTOMATIC SPEECH RECOGNITION (1)
BREAK LABEL (1)
BREATH GROUP (1)
COMPONENT (1)
CONTINUOUS MANDARIN SPEECH (1)
DATA MINING (1)
DATA MODELS (1)
FEATURE EXTRACTION (1)
FEMALE PROFESSIONAL ANNOUNCER (1)
HIERARCHICAL PROSODIC MODEL (1)
HIGH LEVEL PROSODIC CONSTITUENTS PATTERN (1)
HIGH-LEVEL PROSODIC PATTERNS (1)
LABELING (1)
LINEAR REGRESSION (1)
LINGUISTIC FEATURES (1)
MANDARIN PROSODY (1)
MANDARIN PROSODY HIERARCHY (1)
MANDARIN PROSODY INVESTIGATION (1)
NATURAL LANGUAGES (1)
PARALLEL MULTISPEAKING RATE MANDARIN READ SPEECH CORPUS (1)
PARALLEL SPEECH DATASET (1)
PAUSE DURATION (1)
PLM METHOD (1)
PRAGMATICS (1)
PROSODIC PHRASE GROUP (1)
PROSODIC WORD (1)
PROSODY GENERATION MECHANISM (1)
PROSODY MODELING (1)
RECURRENT NEURAL NETWORKS (1)
SPEAKER ADAPTATION (1)
SPEAKING RATE COVERAGE (1)
SPEAKING RATE-CONTROLLED TEXT-TO-SPEECH (1)
STRONTIUM (1)
STRUCTURAL MAXIMUM A POSTERIORI (1)
SYLLABLE PITCH-LEVEL (1)
TEXT-TO-SPEECH MODELLING (1)
UNSUPERVISED JOINT PROSODY MODELING (1)
VARYING SPEECH RATE (1)
WORD EMBEDDINGS (1)
more

INFONA - science communication portal

Search results for: Ru Wang

Speaker Adaptation of SR-HPM for Speaking Rate-Controlled Mandarin TTS

A preliminary study on applying character-level RNNLM to text-to-speech system

An investigation on the Mandarin prosody of a parallel multi-speaking rate speech corpus

Exploration of high-level prosodic patterns for continuous mandarin speech

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options