ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Items from 1 to 7 out of 7 results

chapter

Improving voice quality of HMM-based speech synthesis using voice conversion method

Yishan Jiao, Xiang Xie, Xingyu Na, Ming Tu

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 7914 - 7918

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

HMM-based speech synthesis system (HTS) often generates buzzy and muffled speech. Such degradation of voice quality makes synthetic speech sound robotically rather than naturally. From this point, we suppose that synthetic speech is in a different speaker space apart from the original. We propose to use voice conversion method to transform synthetic speech toward the original so as to improve its...

chapter

Dialogue context sensitive HMM-based speech synthesis

Pirros Tsiakoulis, Catherine Breslin, Milica Gasic, Matthew Henderson, more

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 2554 - 2558

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The focus of this work is speech synthesis tailored to the needs of spoken dialogue systems. More specifically, the framework of HMM-based speech synthesis is utilized to train an emphatic voice that also considers dialogue context for decision tree state clustering. To achieve this, we designed and recorded a speech corpus comprising system prompts from human-computer interaction, as well as additional...

chapter

Excitation modeling for HMM-based speech synthesis: Breaking down the impact of periodic and aperiodic components

Thomas Drugman, Tuomo Raitio

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 260 - 264

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

HMM-based speech synthesis generally suffers from typical buzzi-ness due to over-simplified excitation modeling of voiced speech. In order to alleviate this effect, several studies have proposed various new excitation models. No consensus has however been reached on what is the perceptual importance of the accurate modeling of the periodic and aperiodic components of voiced speech, and to what extent...

chapter

HMM-Based singing voice synthesis and its application to Japanese and English

Kazuhiro Nakamura, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 265 - 269

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The present paper describes Japanese and English singing voice synthesis systems based on hidden Markov models (HMMs). In this approach, the spectrum, excitation, and vibrato of the singing voice are simultaneously modeled by context-dependent HMMs, and waveforms are generated by the HMMs themselves. Japanese singing voice synthesis systems have already been developed and used to create variable musical...

chapter

A postfilter to modify the modulation spectrum in HMM-based speech synthesis

Shinnosuke Takamichi, Tomoki Toda, Graham Neubig, Sakriani Sakti, more

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 290 - 294

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper, we propose a postfilter to compensate modulation spectrum in HMM-based speech synthesis. In order to alleviate over-smoothing effects which is a main cause of quality degradation in HMM-based speech synthesis, it is necessary to consider features that can capture over-smoothing. Global Variance (GV) is one well-known example of such a feature, and the effectiveness of parameter generation...

chapter

Multiple-average-voice-based speech synthesis

Pierre Lanchantin, Mark J.F. Gales, Simon King, Junichi Yamagishi

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 285 - 289

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper describes a novel approach for the speaker adaptation of statistical parametric speech synthesis systems based on the interpolation of a set of average voice models (AVM). Recent results have shown that the quality/naturalness of adapted voices depends on the distance from the average voice model used for speaker adaptation. This suggests the use of several AVMs trained on carefully chosen...

chapter

Synthesized stereo mapping via deep neural networks for noisy speech recognition

Jun Du, Li-Rong Dai, Qiang Huo

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1764 - 1768

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In our previous work, we extend the traditional stereo-based stochastic mapping by relaxing the constraint of stereo-data, which is not practical in real applications, via HMM-based speech synthesis to construct the “clean” channel data for noisy speech recognition. In this paper, we propose to use deep neural networks (DNNs) for stereo mapping compared with the joint Gaussian mixture model (GMM)...

Filter options

Keywords:
HMM-BASED SPEECH SYNTHESIS

Publication date

Set your own date range

Keywords

CLUSTER ADAPTIVE TRAINING (1)
DEEP NEURAL NETWORK (1)
DIALOGUE CONTEXT-SENSITIVE SPEECH SYNTHESIS (1)
EMPHATIC SPEECH SYNTHESIS (1)
ENGLISH SINGING VOICE SYNTHESIS (1)
EXCITATION MODELING (1)
GLOBAL VARIANCE (1)
GLOTTAL FLOW (1)
HMM-BASED SINGING VOICE SYNTHESIS (1)
JOINT GAUSSIAN MIXTURE MODEL (1)
LOCAL LINEAR TRANSFORMATION (1)
MODULATION SPECTRUM (1)
MULTIPLE AVERAGE VOICE MODEL (1)
NOISY SPEECH RECOGNITION (1)
OVER-SMOOTHING (1)
POSTFILTER (1)
RESIDUAL SIGNAL (1)
SPEAKER ADAPTATION (1)
TEMPORAL DECOMPOSITION (1)
VOICE CONVERSION (1)
more

INFONA - science communication portal

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) $("#expandableTitles").expandable();

Improving voice quality of HMM-based speech synthesis using voice conversion method

Dialogue context sensitive HMM-based speech synthesis

Excitation modeling for HMM-based speech synthesis: Breaking down the impact of periodic and aperiodic components

HMM-Based singing voice synthesis and its application to Japanese and English

A postfilter to modify the modulation spectrum in HMM-based speech synthesis

Multiple-average-voice-based speech synthesis

Synthesized stereo mapping via deep neural networks for noisy speech recognition

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)