Search results

Items from 1 to 7 out of 7 results

chapter

Prediction of Korean Prosodic Phrase Boundary by Efficient Feature Selection in Machine Learning

Minho Kim, Youngim Jung, Hyuk-Chul Kwon

2009 21st IEEE International Conference on Tools with Artificial Intelligence > 323 - 327

2009 21st IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2009)

Prediction of the prosodic phrase boundary is a potent influence on the performance of speech recognition and voice synthesis systems. We propose a statistical approach using efficient learning features for the natural prediction of the Korean prosodic phrase boundary. These new features reflect factors that affect the generation of the prosodic phrase boundary better than existing learning features...

chapter

Comparison of sensibilities of Japanese and Koreans in recognizing emotions from speech by using Bayesian networks

Jangsik Cho, S. Kato, H. Itoh

2009 IEEE International Conference on Systems, Man and Cybernetics > 2866 - 2871

2009 IEEE International Conference on Systems, Man and Cybernetics. SMC 2009

The paper describes a comparison of the sensibility of recognizing emotions from human voices speaking Japanese and Korean. Our study focuses on the emotional elements included in the human voice, and our method uses Bayesian networks of prosodic features as models of Japanese's and Korean's sensibilities in recognizing emotions. The training datasets are prosodic features extracted from emotionally...

chapter

Speaker identification with whispered speech based on modified LFCC parameters and feature mapping

Xing Fan, J.H.L. Hansen

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 4553 - 4556

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

Much research recently in speaker recognition has been devoted to robustness due to microphone and channel effects. However, changes in vocal effort, especially whispered speech, present significant challenges in maintaining system performance. Due to the absence of any periodic excitation in whisper, the spectral structure in whisper and neutral speech will differ. Therefore, performance of speaker...

chapter

The CALO meeting speech recognition and understanding system

G. Tur, A. Stolcke, L. Voss, J. Dowding, more

2008 IEEE Spoken Language Technology Workshop > 69 - 72

2008 IEEE Workshop on Spoken Language Technology. SLT 2008

The CALO meeting assistant provides for distributed meeting capture, annotation, automatic transcription and semantic analysis of multiparty meetings, and is part of the larger CALO personal assistant system. This paper summarizes the CALO-MA architecture and its speech recognition and understanding components, which include real-time and offline speech transcription, dialog act segmentation and tagging,...

chapter

Discriminative learning using linguistic features to rescore n-best speech hypotheses

M. Georgescul, M. Rayner, P. Bouillon, N. Tsourakis

2008 IEEE Spoken Language Technology Workshop > 97 - 100

2008 IEEE Workshop on Spoken Language Technology. SLT 2008

We describe how we were able to improve the accuracy of a medium-vocabulary spoken dialog system by rescoring the list of n-best recognition hypotheses using a combination of acoustic, syntactic, semantic and discourse information. The non-acoustic features are extracted from different intermediate processing results produced by the natural language processing module, and automatically filtered. We...

chapter

Robust Classification of Dialog Acts from the Transcription of Utterances

M.S. Sorower, M. Yeasin

International Conference on Semantic Computing (ICSC 2007) > 3 - 10

2007 International Conference on Semantic Computing

This paper presents a robust classification of dialog acts from text utterances. Two different types, namely, bag-of-words and syntactic relationship among words, were used to extract the discourse level features from the transcript of utterances. Subsequently a number of feature mining methods have been used to identify the most relevant features and their roles in classifying dialog acts. The selected...

chapter

Feature extraction based on minimum classification error/generalized probabilistic descent method

A. Biem, S. Katagiri

1993 IEEE International Conference on Acoustics, Speech, and Signal Processing > 2 > 275 - 278 vol.2

Proceedings of ICASSP '93

A novel approach to pattern recognition which comprehensively optimizes both a feature extraction process and a classification process is introduced. Assuming that the best features for recognition are the ones that yield the lowest classification error rate over unknown data, an overall recognizer, consisting of a feature extractor module and a classifier module, is trained using the minimum classification...

Filter options

Data set:
ieee
Keywords:
FEATURE EXTRACTION
LEARNING (ARTIFICIAL INTELLIGENCE)
SPEECH RECOGNITION
DATA MINING

Publication date

Set your own date range

Keywords

SPEECH (5)
ACCURACY (2)
ERROR ANALYSIS (2)
FEATURE SELECTION (2)
MACHINE LEARNING (2)
SPECTRAL ANALYSIS (2)
ACTION ITEM RECOGNITION (1)
AND DISCOURSE ANALYSIS. (1)
AUTOMATIC SPEECH TRANSCRIPTION (1)
BAG-OF-WORDS (1)
BAYESIAN METHODS (1)
BAYESIAN NETWORK (1)
BAYESIAN NETWORKS (1)
BELIEF NETWORKS (1)
CALO MEETING SPEECH RECOGNITION SYSTEM (1)
CALO MEETING SPEECH UNDERSTANDING SYSTEM (1)
CALO PERSONAL MEETING ASSISTANT SYSTEM (1)
CEPSTRAL ANALYSIS (1)
CEPSTRUM (1)
CEPSTRUM COEFFICIENTS (1)
CEPSTRUM REPRESENTATION (1)
CHARACTER RECOGNITION (1)
COGNITIVE AGENT-THAT-LEARNS-AND-ORGANIZES (1)
COGNITIVE SYSTEMS (1)
COMPARISON OF SENSIBILITIES OF JAPANESE AND KOREANS (1)
CONDITIONAL RANDOM FIELDS (1)
CONDITONAL RANDOM FIELDS (1)
DATABASES (1)
DECISION EXTRACTION (1)
DIALOG ACT SEGMENTATION (1)
DIALOG ACT TAGGING (1)
DIALOG ACTS (1)
DISCRIMINATIVE FEATURE EXTRACTION (1)
DISCRIMINATIVE SUPPORT VECTOR LEARNING (1)
DISTRIBUTED MEETING ANNOTATION (1)
DISTRIBUTED MEETING CAPTURE (1)
EFFECTIVENESS (1)
ELECTRONIC MAIL (1)
EMOTION RECOGNITION (1)
EMOTION RECOGNITION FROM HUMAN VOICE (1)
FEATURE MAPPING (1)
FEATURE MAPPINGWHISPER (1)
FEATURE MINING METHOD (1)
FEATURE VECTOR MAPPING (1)
FRAME-BY-FRAME BASIS (1)
FRONT-END FEATURE COMPENSATION METHOD (1)
GAUSSIAN MIXTURE MODEL (1)
GAUSSIAN PROCESSES (1)
GENERALIZED PROBABILISTIC DESCENT (1)
INTEGRATED CIRCUITS (1)
INTELLIGENT SYSTEMS (1)
INTERACTIVE SYSTEMS (1)
KERNEL (1)
KOREAN PROSODIC PHRASE BOUNDARY (1)
LABORATORIES (1)
LINEAR FREQUENCY CEPSTRAL COEFFICIENT (1)
LINEAR SCALE (1)
LINEAR SCALE CEPSTRUM COEFFICIENTS (1)
LINGUISTIC FEATURES (1)
MACHINE LEARNING ALGORITHM (1)
MEDIUM-VOCABULARY SPOKEN DIALOG SYSTEM (1)
MEL FREQUENCY CEPSTRAL COEFFICIENT (1)
MINIMISATION (1)
MINIMUM CLASSIFICATION ERROR (1)
MODIFIED LFCC PARAMETER (1)
MULTI-AGENT SYSTEMS (1)
MULTIPARTY MEETING SEMANTIC ANALYSIS (1)
MULTIPARTY MEETINGS PROCESSING (1)
N-BEST SPEECH RECOGNITION HYPOTHESES (1)
NATURAL LANGUAGE (1)
NATURAL LANGUAGE PROCESSING (1)
NEURAL NETS (1)
NEUTRAL TRAINED SYSTEM (1)
NONVERBAL VOICE FEATURE (1)
OPTIMIZATION METHODS (1)
PATTERN CLASSIFICATION (1)
PATTERN RECOGNITION (1)
PREDICTIVE MODELS (1)
PROSODIC PHRASE BOUNDARY (1)
QUESTION-ANSWER PAIR IDENTIFICATION (1)
REAL TIME SYSTEMS (1)
ROBUST DIALOG ACTS CLASSIFICATION (1)
SERVERS (1)
SPEAKER ID SYSTEM (1)
SPEAKER IDENTIFICATION (1)
SPEAKER INDEPENDENT GMM (1)
SPEAKER RECOGNITION (1)
SPECTRAL STRUCTURE (1)
SPEECH SUMMARIZATION (1)
SPOKEN LANGUAGE UNDERSTANDING (1)
STATISTICAL ANALYSIS (1)
STATISTICAL APPROACH (1)
SUPPORT VECTOR MACHINES (1)
SYSTEM DESIGN (1)
TEXT UTTERANCE TRANSCRIPTION (1)
TRAINING (1)
more

INFONA - science communication portal

Search results

Prediction of Korean Prosodic Phrase Boundary by Efficient Feature Selection in Machine Learning

Comparison of sensibilities of Japanese and Koreans in recognizing emotions from speech by using Bayesian networks

Speaker identification with whispered speech based on modified LFCC parameters and feature mapping

The CALO meeting speech recognition and understanding system

Discriminative learning using linguistic features to rescore n-best speech hypotheses

Robust Classification of Dialog Acts from the Transcription of Utterances

Feature extraction based on minimum classification error/generalized probabilistic descent method

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options