Wyniki wyszukiwania dla: Lei Xie

Pozycje od 1 do 5 spośród 5 wyników

rozdział

On the training of DNN-based average voice model for speech synthesis

Shan Yang, Zhizheng Wu, Lei Xie

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 6

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Adaptability and controllability are the major advantages of statistical parametric speech synthesis (SPSS) over unit-selection synthesis. Recently, deep neural networks (DNNs) have significantly improved the performance of SPSS. However, current studies are mainly focusing on the training of speaker-dependent DNNs, which generally requires a significant amount of data from a single speaker. In this...

rozdział

A waveform representation framework for high-quality statistical parametric speech synthesis

Bo Fan, Siu Wa Lee, Xiaohai Tian, Lei Xie, więcej

2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 530 - 536

2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

State-of-the-art statistical parametric speech synthesis (SPSS) generally uses a vocoder to represent speech signals and parameterize them into features for subsequent modeling. Magnitude spectrum has been a dominant feature over the years. Although perceptual studies have shown that phase spectrum is essential to the quality of synthesized speech, it is often ignored by using a minimum phase filter...

rozdział

Automatic prosody prediction for Chinese speech synthesis using BLSTM-RNN and embedding features

Chuang Ding, Lei Xie, Jie Yan, Weini Zhang, więcej

2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU) > 98 - 102

2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)

Prosody affects the naturalness and intelligibility of speech. However, automatic prosody prediction from text for Chinese speech synthesis is still a great challenge and the traditional conditional random fields (CRF) based method always heavily relies on feature engineering. In this paper, we propose to use neural networks to predict prosodic boundary labels directly from Chinese characters without...

rozdział

Speech and Auditory Interfaces for Ubiquitous, Immersive and Personalized Applications

Lei Xie, Wenhuai Zhao, Xiangzeng Zhou, Xiaohai Tian, więcej

2010 7th International Conference on Ubiquitous Intelligence&Computing and 7th International Conference on Autonomic&Trusted Computing > 503 - 505

2010 7th International Conference on Ubiquitous Intelligence & Computing and 7th International Conference on Autonomic & Trusted Computing (UIC/ATC 2010)

In this demonstration, we introduce our recent progress on speech and auditory technologies for potential ubiquitous, immersive and personalized applications. The first demo shows an intelligent spoken question answering system, which enables users to interact with a talking avatar via natural speech dialogues. The prototype system demonstrates our latest development on automatic speech recognition,...

rozdział

Lip Assistant: Visualize Speech for Hearing Impaired People in Multimedia Services

Lei Xie, Yi Wang, Zhi-Qiang Liu

2006 IEEE International Conference on Systems, Man and Cybernetics > 5 > 4331 - 4336

2006 IEEE International Conference on Systems, Man and Cybernetics

This paper presents a very low bit rate speech-to-video synthesizer, named lip assistant, to help hearing impaired people to better access multimedia services via lipreading. Lip assistant can automatically convert acoustic speech to lip parameters with a bit rate of 2.2 kbps, and decode them to video-realistic mouth animation on the fly. We use multi-stream HMMs (MSHMMs) and the principal component...

Opcje filtrowania

Słowa kluczowe:
SPEECH SYNTHESIS

Data publikacji

Ustaw własny zakres dat

Słowa kluczowe

HIDDEN MARKOV MODELS (4)
SPEECH (3)
ACOUSTICS (1)
ADAPTATION MODELS (1)
AUDITORY INTERFACES (1)
AUTOMATIC PROSODY PREDICTION (1)
AUTOMATIC SPEECH RECOGNITION (1)
AVATARS (1)
BLSTM (1)
COMPUTER ANIMATION (1)
DATA VISUALISATION (1)
EAR (1)
EMBEDDING FEATURES (1)
EXPECTATION MAXIMIZATION ALGORITHM (1)
EXPECTATION-MAXIMISATION ALGORITHM (1)
HANDICAPPED AIDS (1)
HEAD RELATED TRANSFER FUNCTIONS (1)
HEARING IMPAIRED PEOPLE (1)
HUMAN COMPUTER INTERACTION (1)
HUMAN-COMPUTER INTERACTION (1)
INTELLIGENT SPOKEN QUESTION ANSWERING SYSTEM (1)
KEYWORD SPOTTING (1)
LIP ASSISTANT (1)
MAGNETIC HEADS (1)
MULTIMEDIA COMPUTING (1)
MULTIMEDIA SERVICE (1)
MULTISTREAM HMM (1)
NATURAL LANGUAGE PROCESSING (1)
NATURAL SPEECH DIALOGUES (1)
NEURAL NETWORK (1)
PCA (1)
PERSONALIZED TEXT-TO-SPEECH SYNTHESIS (1)
PRAGMATICS (1)
PRINCIPAL COMPONENT ANALYSIS (1)
QUESTION ANSWERING (1)
ROBUSTNESS (1)
SPEECH INTELLIGIBILITY (1)
SPEECH INTERFACES (1)
SPEECH RECOGNITION (1)
SPEECH VISUALIZATION (1)
SPEECH-TO-VIDEO SYNTHESIZER (1)
SPOKEN DIALOGUE SYSTEM (1)
TALKING FACE (1)
TEXT ANALYSIS (1)
THREE DIMENSIONAL DISPLAYS (1)
TIME-DOMAIN ANALYSIS (1)
TRAINING (1)
TRAJECTORY (1)
UBIQUITOUS COMPUTING (1)
VIDEO SIGNAL PROCESSING (1)
VIDEO-REALISTIC MOUTH ANIMATION (1)
VIRTUAL AUDITORY (1)
VIRTUAL AUDITORY TECHNOLOGY (1)
VISUA SPEECH SYNTHESIS (1)
VISUAL SPEECH SYNTHESIS (1)
VOCODERS (1)
więcej

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania dla: Lei Xie

On the training of DNN-based average voice model for speech synthesis

A waveform representation framework for high-quality statistical parametric speech synthesis

Automatic prosody prediction for Chinese speech synthesis using BLSTM-RNN and embedding features

Speech and Auditory Interfaces for Ubiquitous, Immersive and Personalized Applications

Lip Assistant: Visualize Speech for Hearing Impaired People in Multimedia Services

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Słowa kluczowe

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu