ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

Items from 41 to 44 out of 44 results

chapter

Text normalization in mandarin text-to-speech system

Yuxiang Jia, Dezhi Huang, Wu Liu, Shiwen Yu, more

2008 IEEE International Conference on Acoustics, Speech and Signal Processing > 4693 - 4696

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

Text normalization is an important component in text-to-speech system and the difficulty in text normalization is to disambiguate the non-standard words (NSWs). This paper develops a taxonomy of NSWs on the basis of a large scale Chinese corpus, and proposes a two-stage NSWs disambiguation strategy, finite state automata (FSA) for initial classification and maximum entropy (ME) classifiers for subclass...

chapter

Improving phoneme and accent estimation by leveraging a dictionary for a stochastic TTS front-end

T. Nagano, R. Tachibana, N. Itoh, M. Nishimura

2008 IEEE International Conference on Acoustics, Speech and Signal Processing > 4689 - 4692

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

Determining the correct phonemes and pitch accents is important for creating natural Japanese speech. We implemented a TTS front-end system based on an n-gram model. However, the vocabulary of the word n-gram model is limited to the list of the words found in the training corpus, and collecting a very large training corpus is not an easy task. In this paper, we propose using an additional class n-gram...

chapter

A decoder for large vocabulary continuous short message dictation on embedded devices

J. Olsen, Yang Cao, Guohong Ding, Xinxing Yang

2008 IEEE International Conference on Acoustics, Speech and Signal Processing > 4337 - 4340

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

We present our recent progress towards implementing large vocabulary continuous SMS dictation in embedded devices. The dictation engine we describe here is based on the popular finite state transducer paradigm and is capable of handling large vocabularies and high order n-gram language models in a small memory footprint - even relative to what is available in current high end devices such as the Nokia...

chapter

Phonetic pronunciations for arabic speech-to-text systems

F. Diehl, M.J.F. Gales, M. Tomalin, P.C. Woodland

2008 IEEE International Conference on Acoustics, Speech and Signal Processing > 1573 - 1576

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

In this paper two aspects of generating and using phonetic arabic dictionaries are described. First, the use of single pronunciation acoustic models in the context of arabic large vocabulary automatic speech recognition (ASR) is investigated. These have been found to be useful for English ASR systems, when combined with standard multiple pronunciation systems. The second area examined is automatically...

Keywords:
SPEECH SYNTHESIS

Publication date

Set your own date range

Keywords

SPEECH PROCESSING (16)
SPEECH RECOGNITION (11)
HIDDEN MARKOV MODELS (8)
NATURAL LANGUAGE PROCESSING (7)
SPEECH ANALYSIS (5)
UNIT SELECTION (5)
CONCATENATIVE SPEECH SYNTHESIS (4)
SPEECH CODING (4)
DECISION TREES (3)
GAUSSIAN PROCESSES (3)
HIDDEN MARKOV MODEL (3)
HMM (3)
PROBABILITY (3)
SPEAKER RECOGNITION (3)
STATISTICAL ANALYSIS (3)
VOICE CONVERSION (3)
AGGLOMERATIVE CLUSTERING (2)
AUDIO DATABASES (2)
CEPSTRAL ANALYSIS (2)
DISCRETE COSINE TRANSFORMS (2)
DISCRIMINATIVE TRAINING (2)
ENERGY ENVELOPE (2)
FINITE STATE MACHINES (2)
GAUSSIAN MIXTURE MODEL (2)
LEARNING (ARTIFICIAL INTELLIGENCE) (2)
LEAST MEAN SQUARES METHODS (2)
MANDARIN TEXT-TO-SPEECH SYSTEM (2)
MAXIMUM LIKELIHOOD ESTIMATION (2)
MINIMUM GENERATION ERROR (2)
MODEL ADAPTATION (2)
NATURAL SPEECH (2)
PREDICTION THEORY (2)
REGRESSION ANALYSIS (2)
SIGNAL REPRESENTATION (2)
SPEAKER ADAPTATION (2)
SPECTRAL ANALYSIS (2)
SPEECH DATABASE (2)
SPEECH ENHANCEMENT (2)
STOCHASTIC PROCESSES (2)
TEXT-TO-SPEECH SYNTHESIS (2)
3D MOTION DATA (1)
ACCENT (1)
ACCENT ESTIMATION (1)
ACOUSTIC PARAMETERS (1)
ADAPTIVE ESTIMATION (1)
ADAPTIVE SIGNAL PROCESSING (1)
ADAPTIVE SPECTRAL SMOOTHING (1)
ADMISSIBLE STOPPING (1)
AGGLOMERATIVE AND SEQUENTIAL INFORMATION BOTTLENECK (1)
ANALYSIS-BY-SYNTHESIS FEATURES (1)
ANALYSIS-BY-SYNTHESIS SPEECH CODERS (1)
APERIODIC ENERGIES (1)
APERIODICITY ESTIMATION (1)
APPROXIMATION THEORY (1)
ARABIC (1)
ARABIC LARGE VOCABULARY (1)
ARABIC SPEECH-TO-TEXT SYSTEMS (1)
ARTICULATORY RECOGNITION (1)
ARTICULATORY SYNTHESIS (1)
AUGMENTED SYSTEM (1)
AUTOMATIC CONTEXT SENSITIVE PHONE SET MAPPING METHOD (1)
AUTOMATIC EVALUATION (1)
AUTOMATIC FREQUENCY ESTIMATION (1)
AUTOMATIC JOINT PROSODY LABELING (1)
AUTOMATIC PARAMETER SELECTION (1)
AUTOMATIC PHONETICS RECONSTRUCTION (1)
AUTOMATIC SPEECH RECOGNITION (1)
AUTOMATIC TRANSCRIPTION (1)
AVERAGE VOICE (1)
AVERAGE VOICE MODEL (1)
AVERAGE VOICE MODELS (1)
BASELINE SYSTEM (1)
BI-DIRECTIONAL CROSS-LINGUAL MAPPINGS (1)
BILINGUAL (1)
BILINGUAL CODE SPEECH SYNTHESIS (1)
BILINGUAL MANDARIN-ENGLISH TTS (1)
BILINGUAL SPEECH DATABASE (1)
BLIZZARD CHALLENGE (1)
BLIZZARD CHALLENGE 2007 (1)
BREATH GROUP (1)
CASCADED MODELINGS (1)
CEPSTRAL LIFTERING (1)
CEPSTRAL MODEL ORDER SELECTION CRITERION (1)
CHINESE CHARACTER TRANSCRIPTION (1)
COMPUTATIONAL LINGUISTICS (1)
CONCATENATIVE TEXT-TO-SPEECH SYNTHESIS (1)
CONFIDENCE MEASURE (1)
CONSISTENT SAMPLING (1)
CONSISTENT SAMPLING THEORY (1)
CONTEXT CLUSTERING (1)
CONTEXT SENSITIVE MAPPING (1)
CONTEXT WINDOW LENGTH (1)
CONTEXTUAL GRAPHEME (1)
CONTEXTUAL GRAPHEMES (1)
CONTINUOUS MANDARIN SPEECH (1)
CONTOURS INTERPOLATION (1)
CORRECT PRONUNCIATION PREDICTION (1)
COST DEGRADATION CRITERION (1)
COVARIANCE ANALYSIS (1)
more

INFONA - science communication portal

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes

Text normalization in mandarin text-to-speech system

Improving phoneme and accent estimation by leveraging a dictionary for a stochastic TTS front-end

A decoder for large vocabulary continuous short message dictation on embedded devices

Phonetic pronunciations for arabic speech-to-text systems

Filter options

Publication date

Keywords

INFONA - science communication portal

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes $("#expandableTitles").expandable();

Text normalization in mandarin text-to-speech system

Improving phoneme and accent estimation by leveraging a dictionary for a stochastic TTS front-end

A decoder for large vocabulary continuous short message dictation on embedded devices

Phonetic pronunciations for arabic speech-to-text systems

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

ICASSP 2008. IEEE International Conference on Acoustic, Speech and Signal Processes