Search results for: Lei Xie

Items from 1 to 5 out of 5 results

chapter

On the training of DNN-based average voice model for speech synthesis

Shan Yang, Zhizheng Wu, Lei Xie

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 6

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Adaptability and controllability are the major advantages of statistical parametric speech synthesis (SPSS) over unit-selection synthesis. Recently, deep neural networks (DNNs) have significantly improved the performance of SPSS. However, current studies are mainly focusing on the training of speaker-dependent DNNs, which generally requires a significant amount of data from a single speaker. In this...

chapter

A waveform representation framework for high-quality statistical parametric speech synthesis

Bo Fan, Siu Wa Lee, Xiaohai Tian, Lei Xie, more

2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 530 - 536

2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

State-of-the-art statistical parametric speech synthesis (SPSS) generally uses a vocoder to represent speech signals and parameterize them into features for subsequent modeling. Magnitude spectrum has been a dominant feature over the years. Although perceptual studies have shown that phase spectrum is essential to the quality of synthesized speech, it is often ignored by using a minimum phase filter...

chapter

Automatic prosody prediction for Chinese speech synthesis using BLSTM-RNN and embedding features

Chuang Ding, Lei Xie, Jie Yan, Weini Zhang, more

2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU) > 98 - 102

2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)

Prosody affects the naturalness and intelligibility of speech. However, automatic prosody prediction from text for Chinese speech synthesis is still a great challenge and the traditional conditional random fields (CRF) based method always heavily relies on feature engineering. In this paper, we propose to use neural networks to predict prosodic boundary labels directly from Chinese characters without...

chapter

Speech and Auditory Interfaces for Ubiquitous, Immersive and Personalized Applications

Lei Xie, Wenhuai Zhao, Xiangzeng Zhou, Xiaohai Tian, more

2010 7th International Conference on Ubiquitous Intelligence&Computing and 7th International Conference on Autonomic&Trusted Computing > 503 - 505

2010 7th International Conference on Ubiquitous Intelligence & Computing and 7th International Conference on Autonomic & Trusted Computing (UIC/ATC 2010)

In this demonstration, we introduce our recent progress on speech and auditory technologies for potential ubiquitous, immersive and personalized applications. The first demo shows an intelligent spoken question answering system, which enables users to interact with a talking avatar via natural speech dialogues. The prototype system demonstrates our latest development on automatic speech recognition,...

chapter

Lip Assistant: Visualize Speech for Hearing Impaired People in Multimedia Services

Lei Xie, Yi Wang, Zhi-Qiang Liu

2006 IEEE International Conference on Systems, Man and Cybernetics > 5 > 4331 - 4336

2006 IEEE International Conference on Systems, Man and Cybernetics

This paper presents a very low bit rate speech-to-video synthesizer, named lip assistant, to help hearing impaired people to better access multimedia services via lipreading. Lip assistant can automatically convert acoustic speech to lip parameters with a bit rate of 2.2 kbps, and decode them to video-realistic mouth animation on the fly. We use multi-stream HMMs (MSHMMs) and the principal component...

Filter options

Keywords:
SPEECH SYNTHESIS

Publication date

Set your own date range

Keywords

HIDDEN MARKOV MODELS (4)
SPEECH (3)
ACOUSTICS (1)
ADAPTATION MODELS (1)
AUDITORY INTERFACES (1)
AUTOMATIC PROSODY PREDICTION (1)
AUTOMATIC SPEECH RECOGNITION (1)
AVATARS (1)
BLSTM (1)
COMPUTER ANIMATION (1)
DATA VISUALISATION (1)
EAR (1)
EMBEDDING FEATURES (1)
EXPECTATION MAXIMIZATION ALGORITHM (1)
EXPECTATION-MAXIMISATION ALGORITHM (1)
HANDICAPPED AIDS (1)
HEAD RELATED TRANSFER FUNCTIONS (1)
HEARING IMPAIRED PEOPLE (1)
HUMAN COMPUTER INTERACTION (1)
HUMAN-COMPUTER INTERACTION (1)
INTELLIGENT SPOKEN QUESTION ANSWERING SYSTEM (1)
KEYWORD SPOTTING (1)
LIP ASSISTANT (1)
MAGNETIC HEADS (1)
MULTIMEDIA COMPUTING (1)
MULTIMEDIA SERVICE (1)
MULTISTREAM HMM (1)
NATURAL LANGUAGE PROCESSING (1)
NATURAL SPEECH DIALOGUES (1)
NEURAL NETWORK (1)
PCA (1)
PERSONALIZED TEXT-TO-SPEECH SYNTHESIS (1)
PRAGMATICS (1)
PRINCIPAL COMPONENT ANALYSIS (1)
QUESTION ANSWERING (1)
ROBUSTNESS (1)
SPEECH INTELLIGIBILITY (1)
SPEECH INTERFACES (1)
SPEECH RECOGNITION (1)
SPEECH VISUALIZATION (1)
SPEECH-TO-VIDEO SYNTHESIZER (1)
SPOKEN DIALOGUE SYSTEM (1)
TALKING FACE (1)
TEXT ANALYSIS (1)
THREE DIMENSIONAL DISPLAYS (1)
TIME-DOMAIN ANALYSIS (1)
TRAINING (1)
TRAJECTORY (1)
UBIQUITOUS COMPUTING (1)
VIDEO SIGNAL PROCESSING (1)
VIDEO-REALISTIC MOUTH ANIMATION (1)
VIRTUAL AUDITORY (1)
VIRTUAL AUDITORY TECHNOLOGY (1)
VISUA SPEECH SYNTHESIS (1)
VISUAL SPEECH SYNTHESIS (1)
VOCODERS (1)
more

INFONA - science communication portal

Search results for: Lei Xie

On the training of DNN-based average voice model for speech synthesis

A waveform representation framework for high-quality statistical parametric speech synthesis

Automatic prosody prediction for Chinese speech synthesis using BLSTM-RNN and embedding features

Speech and Auditory Interfaces for Ubiquitous, Immersive and Personalized Applications

Lip Assistant: Visualize Speech for Hearing Impaired People in Multimedia Services

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options