Search results

Items from 1 to 9 out of 9 results

chapter

Discriminative Product-of-Expert acoustic mapping for cross-lingual phone recognition

Khe Chai Sim

2009 IEEE Workshop on Automatic Speech Recognition&Understanding > 546 - 551

2009 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU 2009)

This paper presents a product-of-expert framework to perform probabilistic acoustic mapping for cross-lingual phone recognition. Under this framework, the posterior probabilities of the target HMM states are modelled as the weighted product of experts, where the experts or their weights are modelled as functions of the posterior probabilities of the source HMM states generated by a foreign phone recogniser...

chapter

An Effective CALL System for Strongly Accented Mandarin Speech

Tonghai Jiang, Ming Tang, Fengpei Ge, Changliang Liu, more

2009 International Conference on Research Challenges in Computer Science > 92 - 95

2009 International Conference on Research Challenges in Computer Science (ICRCCS 2009)

In this paper, we investigate some specific acoustic problems of the computer assisted language learning (CALL) system by modifying the acoustic model and feature under the speech recognition framework. At first, in order to alleviate the distortion of channel and speaker, speaker-dependent Cepstrum Mean Normalization (Speaker CMN) is adopted, by which the average correlation coefficient (ACC) between...

chapter

Pronunciation modeling for dialectal arabic speech recognition

H. Al-Haj, R. Hsiao, I. Lane, A.W. Black, more

2009 IEEE Workshop on Automatic Speech Recognition&Understanding > 525 - 528

2009 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU 2009)

Short vowels in Arabic are normally omitted in written text which leads to ambiguity in the pronunciation. This is even more pronounced for dialectal Arabic where a single word can be pronounced quite differently based on the speaker's nationality, level of education, social class and religion. In this paper we focus on pronunciation modeling for Iraqi-Arabic speech. We introduce multiple pronunciations...

chapter

Phoneme cluster based state mapping for text-independent voice conversion

Meng Zhang, Jiaohua Tao, J. Nurminen, Jilei Tian, more

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 4281 - 4284

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

This paper takes phonetic information into account for data alignment in text-independent voice conversion. Hidden Markov models are used for representing the phonetic structure of training speech. States belonging to same phoneme are grouped together to form a phoneme cluster. A state mapped codebook based transformation is established using information on the corresponding phoneme clusters from...

article

Techware: Speech recognition software and resources on the web [Best of the Web]

P. Nguyen

IEEE Signal Processing Magazine > 2009 > 26 > 3 > 102 - 105

In this issue, "Best of the Web" presents online resources available to tackle the problem of speech recognition. Automatic speech recognition turns spoken audio into a sequence of words. It is an extraordinarily broad and multidisciplinary field, drawing primarily from statistical signal processing, machine learning, and linguistics. Although speech-recognition-oriented applications are...

chapter

Decision Fusion for Improving Mispronunciation Detection Using Language Transfer Knowledge and Phoneme-Dependent Pronunciation Scoring

W.K. Lo, A.M. Harrison, H. Meng, Lan Wang

2008 6th International Symposium on Chinese Spoken Language Processing > 1 - 4

2008 6th International Symposium on Chinese Spoken Language Processing

Application of linguistic knowledge of language transfer to automatic speech recognition (ASR) technology can enhance mispronunciation detection performance in computer-aided pronunciation training (CAPT). This is achieved by pinpointing salient pronunciation errors made by second language learners. In this work, we propose to apply decision fusion for further improvement in mispronunciation detection...

chapter

Effective modeling of acoustic confusions for Mandarin CALL system

Fengpei Ge, Fuping Pan, Changliang Liu, Bin Dong, more

2008 9th International Conference on Signal Processing > 663 - 666

2008 9th International Conference on Signal Processing (ICSP 2008)

Acoustic confusions degrade the accuracy of pronunciation assessment severely in computer assisted language learning (CALL) systems. This paper presents our recent study on effective modeling of the acoustic confusions. We change the traditional Mandarin syllable structure, which is composed of initial and final, to a novel phoneme structure. Several phoneme splitting strategies are investigated,...

chapter

A self-referential childlike model to acquire phones, syllables and words from acoustic speech

H. Brandl, B. Wrede, F. Joublin, C. Goerick

2008 7th IEEE International Conference on Development and Learning > 31 - 36

2008 7th IEEE International Conference on Development and Learning

Speech understanding requires the ability to parse spoken utterances into words. But this ability is not innate and needs to be developed by infants within the first years of their life. So far almost all computational speech processing systems neglected this bootstrapping process. Here we propose a model for early infant word learning embedded into a layered architecture comprising phone, phonotactics...

chapter

Mispronunciation detection based on cross-language phonological comparisons

Lan Wang, Xin Feng, H.M. Meng

2008 International Conference on Audio, Language and Image Processing > 307 - 311

2008 International Conference on Audio, Language and Image Processing

This paper presents a method using speech recognition with linguistic constraints to detect the mispronunciations made by Cantonese learners of English. The predicted pronunciation errors have been derived from cross-language phonological comparisons, which are used to generate the erroneous pronunciation variations in a lexicon. The acoustic models are trained with native speakerspsila speech and...

Filter options

Keywords:
ACOUSTICS
HIDDEN MARKOV MODELS
LINGUISTICS

Publication date

Set your own date range

Publication type

book (8)
article (1)

Keywords

SPEECH (8)
SPEECH RECOGNITION (8)
NATURAL LANGUAGE PROCESSING (4)
AUTOMATIC SPEECH RECOGNITION (3)
COMPUTATIONAL MODELING (3)
ARTIFICIAL NEURAL NETWORKS (2)
COMPUTER AIDED INSTRUCTION (2)
DECODING (2)
DICTIONARIES (2)
HIDDEN MARKOV MODEL (2)
MISPRONUNCIATION DETECTION (2)
PROBABILITY (2)
SPEECH PROCESSING (2)
ACOUSTIC CONFUSION MODELING (1)
ACOUSTIC MODEL (1)
ACOUSTIC PROBLEM (1)
ACOUSTIC SIGNAL PROCESSING (1)
ACOUSTIC SPEECH (1)
BEHAVIOURAL SCIENCES (1)
BIOPHYSICS (1)
BOOTSTRAPPING (1)
CALL SYSTEM (1)
CANTONESE LEARNERS (1)
CAPT (1)
CEPSTRAL ANALYSIS (1)
CHANNEL DISTORTION (1)
CHILDLIKE MODEL (1)
COMPUTATIONAL SPEECH PROCESSING (1)
COMPUTER ASSISTED LANGUAGE LEARNING (1)
COMPUTER ASSISTED LANGUAGE LEARNING SYSTEM (1)
COMPUTER BASED TRAINING (1)
COMPUTER-AIDED PRONUNCIATION TRAINING (1)
CORRELATION COEFFICIENT (1)
CROSS-LANGUAGE PHONOLOGICAL COMPARISONS (1)
CROSS-LINGUAL PHONE RECOGNITION (1)
CROSS-LINGUAL VOICE CONVERSION (1)
DATA ALIGNMENT (1)
DATA MINING (1)
DATA MODELS (1)
DECISION FUSION (1)
DECISION TREE MERGING (1)
DECISION TREES (1)
DIALECTAL SPEECH RECOGNITION (1)
DISCRIMINATIVE PRODUCT-OF-EXPERT ACOUSTIC MAPPING (1)
EDUCATION LEVEL (1)
ERROR BACKPROPAGATION (1)
FEATURE EXTRACTION (1)
FEED-FORWARD NEURAL NETWORK (1)
FEEDFORWARD NEURAL NETS (1)
HETEROSCEDASTIC LINEAR DISCRIMINATE ANALYSIS (1)
HMM STATES (1)
HONDAPSILAS ASIMO ROBOT (1)
HUMAN-MACHINE SCORING DIFFERENCE (1)
HUMANS (1)
INFANTS (1)
INTRALINGUAL VOICE CONVERSION (1)
IRAQI-ARABIC SPEECH (1)
LANGUAGE ACQUISITION (1)
LANGUAGE TRANSFER KNOWLEDGE (1)
LEARNING (ARTIFICIAL INTELLIGENCE) (1)
LINGUISTIC CONSTRAINTS (1)
LINGUISTICALLY-MOTIVATED DETECTION (1)
MACHINE LEARNING (1)
MANDARIN CALL SYSTEM (1)
MANDARIN SYLLABLE STRUCTURE (1)
MATERIALS (1)
MATHEMATICAL MODEL (1)
MAXIMUM A POSTERIORI (1)
MAXIMUM LIKELIHOOD ESTIMATION (1)
MULTIDISCIPLINARY FIELD (1)
MULTIMODAL INTEGRATION (1)
NATIVE SPEAKERS SPEECH (1)
NTIMIT DATABASE (1)
ONLINE RESOURCES (1)
ORTHOGRAPHIC TRANSCRIPTIONS (1)
PEDIATRICS (1)
PHONE SEQUENCES RECOGNITION (1)
PHONEME CLUSTER (1)
PHONEME SPLITTING STRATEGY (1)
PHONEME-DEPENDENT PRONUNCIATION SCORING (1)
PHONES (1)
PHONETIC INFORMATION (1)
POSTERIOR PROBABILITY (1)
POSTERIOR WEIGHTED PRODUCT-OF-EXPERT MODEL (1)
PROBABILISTIC ACOUSTIC MAPPING (1)
PROBABILISTIC LOGIC (1)
PRODUCT-OF-POSTERIOR (1)
PRONUNCIATION ASSESSMENT (1)
PRONUNCIATION MODELING (1)
PRONUNCIATION QUALITY ASSESSMENT (1)
PRONUNCIATION WEIGHTS (1)
ROBOTICS (1)
SENSOR FUSION (1)
SHORT VOWELS (1)
SIGNAL PROCESSING (1)
SPEAKER NATIONALITY (1)
more

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options