Advanced search

Advanced search in people

From:

To:

Items from 1 to 7 out of 7 results

chapter

Duration prediction using multiple Gaussian process experts for GPR-based speech synthesis

Decha Moungsri, Tomoki Koriyama, Takao Kobayashi

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5495 - 5499

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper proposes an alternative multi-level approach to duration prediction for improving prosody generation in statistical parametric speech synthesis using multiple Gaussian process experts. We use two duration models at different levels, specifically, syllable and phone. First, we individually train syllable- and phone-level duration models. Then, the predictive distributions of syllable and...

chapter

Linguistically motivated tied-state triphones for polish speech recognition

Piotr Zelasko, Bartosz Ziolko, Tomasz Jadczyk, Tomasz Pedzimaz

2015 IEEE 2nd International Conference on Cybernetics (CYBCONF) > 251 - 254

2015 IEEE 2nd International Conference on Cybernetics (CYBCONF)

The paper presents one of the possible approaches to build a triphone model for automatic speech recognition of Polish. Even though classifiers are well developed and described, such task is not a trivial one because of lack of enough training data and importance of calculation time spent for the training of the model. To overcome this problem, some states are typically tied using data-driven criteria...

chapter

Corpus-independent history compression for stochastic turn-taking models

Kornel Laskowski, Elizabeth Shriberg

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4937 - 4940

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

Stochastic turn-taking models use a truncated representation of past speech activity to specify how likely a speaker is to talk at the next instant. An unanswered question in such modeling is how far back to extend the conditioning context. We study this question using Switchboard (English, telephone) and Spontal (Swedish, face-to-face) conversations. We also explore whether to trade off precision...

chapter

Role of nucleus based context in word-independent syllable stress classification

Harish Doddala, Om D Deshmukh, Ashish Verma

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5712 - 5715

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

An acoustic-phonetics based word-independent technique which uses syllable context for classifying the lexical syllable stress of spoken English words is presented. Nucleus based clustering is remarkably successful in moving from word-dependent syllable stress classification which is intrinsically not scalable to word-independent classification. This however is not possible without an inherent drop...

chapter

Significance of segmentation in phoneme based Tamil speech recognition system

S. Harish, P. Vijayalakshmi, T. Nagarajan

2011 3rd International Conference on Electronics Computer Technology > 3 > 212 - 215

2011 3rd International Conference on Electronics Computer Technology (ICECT)

Over the last few decades speech recognition has evolved and matured enough to be used in commercial applications. The applications include automatic dictation software, voice dialling, voice controlled navigation and simple data entry. Automatic Speech Recognition (ASR) deals with automatic conversion of acoustic signals of an utterance into text. In this work speech recognition system for Tamil...

chapter

Emphasized speech synthesis based on hidden Markov models

K. Morizane, K. Nakamura, T. Toda, H. Saruwatari, more

2009 Oriental COCOSDA International Conference on Speech Database and Assessments > 76 - 81

2009 Oriental COCOSDA International Conference on Speech Database and Assessments

This paper presents a statistical approach to synthesizing emphasized speech based on hidden Markov models (HMMs). Context-dependent HMMs are trained using emphasized speech data uttered by intentionally emphasizing an arbitrary accentual phrase in a sentence. To model acoustic characteristics of emphasized speech, new contextual factors describing an emphasized accentual phrase are additionally considered...

chapter

An analysis of grammatical errors in non-native speech in english

J. Lee, S. Seneff

2008 IEEE Spoken Language Technology Workshop > 89 - 92

2008 IEEE Workshop on Spoken Language Technology. SLT 2008

While a wide variety of grammatical mistakes may be observed in the speech of non-native speakers, the types and frequencies of these mistakes are not random. Certain parts of speech, for example, have been shown to be especially problematic for Japanese learners of English [1]. Modeling these errors can potentially enhance the performance of computer-assisted language learning systems. This paper...

Filter options

Content availability:
Available
Keywords:
CONTEXT
DATA MODELS
CONTEXT MODELING
SPEECH

Publication date

Set your own date range

Keywords

HIDDEN MARKOV MODELS (4)
TRAINING (3)
ACCURACY (2)
SPEECH SYNTHESIS (2)
ACOUSTICS (1)
ADAPTATION MODEL (1)
ASR (1)
CO-ARTICULATION (1)
COMPUTER AIDED INSTRUCTION (1)
COMPUTER-ASSISTED LANGUAGE LEARNING (1)
COMPUTER-ASSISTED LANGUAGE LEARNING SYSTEM (1)
CONTEXT-DEPENDENT HMM (1)
CONVERSATIONAL SPEECH (1)
DATA MINING (1)
DIALOGUE (1)
DIARIZATION (1)
DURATION PREDICTION (1)
ENGLISH (1)
ENTROPY (1)
FINE-GRAINED ANALYSIS (1)
FREQUENCY DOMAIN ANALYSIS (1)
GPRBASED SPEECH SYNTHESIS (1)
GRAMMAR CHECKING (1)
GRAMMARS (1)
GRAMMATICAL ERROR ANALYSIS (1)
HISTORY (1)
LANGUAGE LEARNING (1)
LANGUAGE MODEL (1)
LEXICAL SYLLABLE STRESS (1)
LEXICON (1)
MULTI-LEVEL MODEL (1)
MULTIPLE GAUSSIAN PROCESS EXPERTS (1)
NATURAL LANGUAGES (1)
NONNATIVE SPEECH (1)
POLISH (1)
PREDICTIVE MODELS (1)
PREPOSITION (1)
PRODUCT OF GUASSIANS (1)
SECOND-LANGUAGE ACQUISITION (1)
SEGMENTATION (1)
SPEECH ACTIVITY (1)
SPEECH ANALYSIS (1)
SPEECH PROCESSING (1)
SPEECH RECOGNITION (1)
STRESS (1)
SWITCHES (1)
TIED STATE MODEL (1)
TRIPHONES (1)
TURN-TAKING (1)
more

INFONA - science communication portal

Advanced search

Advanced search in people

Duration prediction using multiple Gaussian process experts for GPR-based speech synthesis

Linguistically motivated tied-state triphones for polish speech recognition

Corpus-independent history compression for stochastic turn-taking models

Role of nucleus based context in word-independent syllable stress classification

Significance of segmentation in phoneme based Tamil speech recognition system

Emphasized speech synthesis based on hidden Markov models

An analysis of grammatical errors in non-native speech in english

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options