Search results for: Xiang Yin

Items from 1 to 4 out of 4 results

chapter

Modeling spectral envelopes using deep conditional restricted Boltzmann machines for statistical parametric speech synthesis

Xiang Yin, Zhen-Hua Ling, Ya-Jun Hu, Li-Rong Dai

2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5125 - 5129

ICASSP 2016 - 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper proposes a spectral modeling method using a deep conditional restricted Boltzmann machine (DCRBM) for statistical parametric speech synthesis. In this method, a DCRBM, which combines a deep neural network (DNN) with a conditional restricted Boltzmann machine (CRBM), is utilized to describe the conditional distribution of spectral envelopes given linguistic features. Compared with DNN and...

article

Modeling F0 trajectories in hierarchically structured deep neural networks

Xiang Yin, Ming Lei, Yao Qian, Frank K. Soong, more

Speech Communication > 2016 > 76 > C > 82-92

This paper investigates F0 modeling of speech in deep neural networks (DNN) for statistical parametric speech synthesis (SPSS). Recently, DNN has been applied to the acoustic modeling of SPSS and has shown good performance in characterizing complex dependencies between contextual features and acoustic observations. However, the additive nature and long-term suprasegmental property of F0 features have...

chapter

Integrating global variance of log power spectrum derived from LSPs into MGE training for HMM-based parametric speech synthesis

Yu-Sheng Sun, Zhen-Hua Ling, Xiang Yin, Li-Rong Dai

The 9th International Symposium on Chinese Spoken Language Processing > 201 - 205

2014 9th International Symposium on Chinese Spoken Language Processing (ISCSLP)

This paper presents a method to improve hidden Markov model (HMM) based parametric speech synthesis by integrating global variance (GV) of log power spectrum (LPS) derived from line spectral pairs (LSPs) into minimum generation error (MGE) model training. In order to alleviate the over-smoothing effect of the generated spectral structures, an LPS-GV based parameter generation method has been proposed...

chapter

Spectral modeling using neural autoregressive distribution estimators for statistical parametric speech synthesis

Xiang Yin, Zhen-Hua Ling, Li-Rong Dai

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 3824 - 3828

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper describes a new approach which utilizes neural autoregressive distribution estimators (NADE) for the spectral modeling in statistical parametric speech synthesis. In order to alleviate the over-smoothing effect on the generated spectral structures, a restricted Boltzmann machine (RBM) modeling method has been proposed in our previous work, where the RBM is adopted to represent the joint...

Filter options

Keywords:
SPEECH SYNTHESIS

Publication date

Set your own date range

INFONA - science communication portal

Search results for: Xiang Yin

Modeling spectral envelopes using deep conditional restricted Boltzmann machines for statistical parametric speech synthesis

Modeling F0 trajectories in hierarchically structured deep neural networks

Integrating global variance of log power spectrum derived from LSPs into MGE training for HMM-based parametric speech synthesis

Spectral modeling using neural autoregressive distribution estimators for statistical parametric speech synthesis

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Data set

Reporting an error / abuse

Sending the report failed

Accessibility options