Search results for: Slava Shechtman

Items from 1 to 4 out of 4 results

chapter

Coherent modification of pitch and energy for expressive prosody implantation

Alexander Sorin, Slava Shechtman, Vincent Pollet

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4914 - 4918

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In expressive TTS and voice transformation systems, implantation of expressive prosody derived from external out-of-domain sources often leads to extreme pitch modification that compromises the naturalness of the synthesized speech.

chapter

Transient modeling for overlap-add sinusoidal model of speech

Slava Shechtman

2013 IEEE International Conference on Acoustics, Speech and Signal Processing > 8189 - 8192

ICASSP 2013 - 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Speech sinusoidal modeling has been successfully applied to a broad range of speech analysis, synthesis and modification tasks. At most, it reproduces a high quality speech, however for speech transients (e.g. plosives, glottal stops) it suffers from reduced fidelity due to lack of intra-frame modeling of irregularities. Various extensions had been proposed for the stationary sinusoidal model to cope...

chapter

Footprint reduction of Concatenative Text-To-Speech synthesizers using polynomial temporal decomposition

Tamar Shoham, David Malah, Slava Shechtman

2010 4th International Symposium on Communications, Control and Signal Processing (ISCCSP) > 1 - 5

4th International Symposium on Communications, Control and Signal Processing (ISCCSP 2010)

High quality low footprint Concatenative Text-To-Speech (CTTS) synthesizers provide a persistent challenge in the field of speech processing. The spectral parameters representing the short speech segments used in the concatenation process constitute a large portion of the required memory. In this paper we propose to use a vectorial form of Polynomial Temporal Decomposition combined with jointly optimal...

article

Statistical Text-to-Speech Synthesis Based on Segment-Wise Representation With a Norm Constraint

Stas Tiomkin, David Malah, Slava Shechtman

IEEE Transactions on Audio, Speech, and Language Processing > 2010 > 18 > 5 > 1077 - 1082

In statistical HMM-based text-to-speech systems (STTS), speech feature dynamics is modeled by first- and second-order feature frame differences, which, typically, do not satisfactorily represent frame to frame feature dynamics present in natural speech. The reduced dynamics results in over-smoothing of speech features, often sounding as muffled synthesized speech. In this correspondence, we propose...

Filter options

Publication date

Set your own date range

Publication type

book (3)
article (1)

Keywords

HIDDEN MARKOV MODELS (2)
SPEECH (2)
SPEECH PROCESSING (2)
SPEECH SYNTHESIS (2)
ACOUSTIC DISTORTION (1)
ACOUSTIC TESTING (1)
ACOUSTICS (1)
COMPLEXITY THEORY (1)
DECODING (1)
DEGRADATION (1)
ENERGY MODIFICATION (1)
ENERGY MODULATION (1)
ESTIMATION (1)
EXPRESSIVE TTS (1)
FEATURE EXTRACTION (1)
FOOTPRINT REDUCTION (1)
HARMONIC ANALYSIS (1)
HIGH QUALITY LOW FOOTPRINT CONCATENATIVE TEXT-TO-SPEECH SYNTHESIZERS (1)
HMM (1)
ITERATIVE ALGORITHMS (1)
MAGNITUDE ENVELOPE (1)
NATURAL LANGUAGES (1)
NATURAL SPEECH (1)
NOISE (1)
NORM CONSTRAINT (1)
PITCH MODIFICATION (1)
POLYNOMIAL ORDER SELECTION (1)
POLYNOMIAL TEMPORAL DECOMPOSITION (1)
POLYNOMIALS (1)
PRODUCTION SYSTEMS (1)
PROSODY MODIFICATION (1)
PROTOTYPES (1)
SEGMENT-WISE MODEL REPRESENTATION (1)
SEGMENT-WISE REPRESENTATION (1)
SHAPE (1)
SINUSOIDAL MODELING (1)
SPATIAL DATABASES (1)
SPECTRAL AMPLITUDE (1)
SPECTRAL PARAMETERS (1)
SPEECH ANALYSIS (1)
SPEECH ENHANCEMENT (1)
SPEECH FEATURE DYNAMICS (1)
SPEECH TRANSIENT MODELING (1)
STATISTICAL ANALYSIS (1)
STATISTICAL TEXT-TO-SPEECH SYNTHESIS (1)
STATISTICAL TTS (1)
STTS (1)
TEXT-TO-SPEECH (TTS) SYNTHESIS (1)
more

INFONA - science communication portal

Search results for: Slava Shechtman

Coherent modification of pitch and energy for expressive prosody implantation

Transient modeling for overlap-add sinusoidal model of speech

Footprint reduction of Concatenative Text-To-Speech synthesizers using polynomial temporal decomposition

Statistical Text-to-Speech Synthesis Based on Segment-Wise Representation With a Norm Constraint

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options