Search results

Items from 1 to 6 out of 6 results

chapter

Active learning for rule-based and corpus-based Spoken Language Understanding models

P. Gotab, F. Bechet, G. Damnati

2009 IEEE Workshop on Automatic Speech Recognition&Understanding > 444 - 449

2009 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU 2009)

Active learning can be used for the maintenance of a deployed spoken dialog system (SDS) that evolves with time and when large collection of dialog traces can be collected on a daily basis. At the spoken language understanding (SLU) level this maintenance process is crucial as a deployed SDS evolves quickly when services are added, modified or dropped. Knowledge-based approaches, based on manually...

chapter

Language model adaptation using auto-induced semantic structures in a voice search system

Yali Li, Ta Li, Yonghong Yan

2009 IEEE International Conference on Intelligent Computing and Intelligent Systems > 3 > 350 - 353

2009 IEEE International Conference on Intelligent Computing and Intelligent Systems (ICIS 2009)

In this paper, we study how to generate in-domain data for statistical language model adaptation in a Chinese voice search dialogue system. Given limited amount of in-domain data, we use unsupervised clustering to induce semantic classes and structures from the first part of test data. These structures are further augmented with domain information to generate large amount of in-domain data. Lastly...

chapter

Improving mispronunciation detection using machine learning

Yuqiang Chen, Chao Huang, F. Soong

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 4865 - 4868

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

In this paper, we investigate the problem of mispronunciation detection by considering the influence of speaker and syllables. Machine learning techniques are used to make our method more convenient and flexible for new features, such as syllables normalization. The experimental results on our database, consisting of 9898 syllables pronounced by 100 speakers, show the effectiveness of our method by...

chapter

Impact of novel sources on content-based image and video retrieval

A. Ghoshal, S. Khudanpur, D. Klakow

2009 IEEE International Conference on Acoustics, Speech and Signal Processing > 1937 - 1940

ICASSP 2009 - 2009 IEEE International Conference on Acoustics, Speech and Signal Processing

The problem of content-based image and video retrieval with textual queries is often posed as that of visual concept classification, where classifiers for a set of predetermined visual concepts are trained using a set of manually annotated images. Such a formulation implicitly assumes that the training data has similar distributional characteristics as that of the data which need to be indexed. In...

chapter

HMM parameter reduction for practical gesture recognition

S. Rajko, Gang Qian

2008 8th IEEE International Conference on Automatic Face&Gesture Recognition > 1 - 6

2008 8th IEEE International Conference on Automatic Face & Gesture Recognition

We examine in detail some properties of gesture recognition models which utilize a reduced number of parameters and lower algorithmic complexity compared to traditional hidden Markov models. We show that the reduced parameter models are comparable to standard HMM-based gesture recognition models in their ability to effectively model gestures, and in some cases superior when training data is limited...

chapter

A novel highlight event decision approach for baseball videos

Yih-Ming Su, Shu-Jiun Liang

2008 IEEE International Symposium on Consumer Electronics > 1 - 2

2008 IEEE International Symposium on Consumer Electronics - (ISCE 2008)

A real-time highlight extraction system using the caption information has been proposed to detect and classify the highlight events of the baseball games. The system contains several stages: caption extraction, caption identification, content recognition, and model-indexing decision stages. A superimposed caption in the baseball videos is extracted using a multi-frame averaging technique. After extracting...

Filter options

Keywords:
DATA MINING
DATA MODELS
ADAPTATION MODEL
HIDDEN MARKOV MODELS

Publication date

Set your own date range

Keywords

TRAINING (4)
CONTENT-BASED RETRIEVAL (2)
IMAGE CLASSIFICATION (2)
LEARNING (ARTIFICIAL INTELLIGENCE) (2)
NATURAL LANGUAGE PROCESSING (2)
SPEECH PROCESSING (2)
SPEECH RECOGNITION (2)
VIDEO RETRIEVAL (2)
ACTIVE LEARNING (1)
ALGORITHMIC COMPLEXITY (1)
ANNOTATED CORPUS (1)
AUTO INDUCED SEMANTIC STRUCTURE (1)
AUTOMATIC MISPRONUNCIATION DETECTION (1)
AUTOMATIC MISPRONUNCIATION DETECTION (AMD) (1)
BASEBALL GAME VIDEOS (1)
BASEBALL VIDEOS (1)
CAPTION CONTENT RECOGNITION (1)
CAPTION EXTRACTION (1)
CAPTION IDENTIFICATION (1)
CAPTION INFORMATION (1)
CHINESE VOICE SEARCH DIALOGUE SYSTEM (1)
COMPUTER AIDED INSTRUCTION (1)
COMPUTER AIDED LANGUAGE LEARNING (CALL) (1)
COMPUTER ASSISTED LANGUAGE LEARNING (1)
CONTENT RECOGNITION (1)
CONTENT-BASED IMAGE RETRIEVAL (1)
CONTENT-BASED VIDEO RETRIEVAL (1)
CONTEXT (1)
CORPUS-BASED SPOKEN LANGUAGE UNDERSTANDING MODELS (1)
DATABASE (1)
DECISION MAKING (1)
EVENT DECISION APPROACH (1)
FALSE ACCEPTANCE RATE (1)
FEATURE EXTRACTION (1)
GAMES (1)
GESTURE RECOGNITION (1)
GESTURE RECOGNITION MODELS (1)
GRAMMARS (1)
HIGHLIGHT EVENTS (1)
HMM PARAMETER REDUCTION (1)
IN-DOMAIN DATA GENERATION (1)
INDEXING (1)
INFERENCE MECHANISMS (1)
INFERENCE RULES (1)
INFORMATION RETRIEVAL (1)
INTERACTIVE SYSTEMS (1)
KNOWLEDGE BASED SYSTEMS (1)
KNOWLEDGE-BASED APPROACHES (1)
KNOWLEDGE-BASED MODELS (1)
MACHINE LEARNING (1)
MANDARIN (1)
MANUALLY WRITTEN GRAMMARS (1)
MODEL INDEXING (1)
MODEL-INDEXING DECISION (1)
MULTIFRAME AVERAGING TECHNIQUE (1)
MULTIMEDIA SYSTEMS (1)
MULTIPLE VISUAL DETECTOR (1)
PATTERN CLUSTERING (1)
PREDETERMINED VISUAL CONCEPTS (1)
PROBABILITY DENSITY FUNCTION (1)
RANKED RETRIEVAL PERFORMANCE (1)
REAL-TIME HIGHLIGHT EXTRACTION SYSTEM (1)
ROBUSTNESS (1)
RULE-BASED APPROACH (1)
RULE-BASED SPOKEN LANGUAGE UNDERSTANDING MODELS (1)
RUNTIME (1)
SEMANTIC CLASS INDUCTION (1)
SLU CRITERION (1)
SPEECH (1)
SPOKEN DIALOG SYSTEM (1)
STATISTIC LANGUAGE MODEL ADAPTATION (1)
STATISTICAL LANGUAGE MODEL ADAPTATION (1)
SUPPORT VECTOR MACHINES (1)
SYLLABLES NORMALIZATION (1)
SYSTEM DESIGNERS (1)
TEXTUAL QUERIES (1)
TRAINING DATA (1)
UNSUPERVISED CLUSTERING (1)
VIDEOS (1)
VISUAL CONCEPT CLASSIFICATION (1)
VISUALIZATION (1)
VOICE SEARCH SYSTEM (1)
more

INFONA - science communication portal

Search results

Active learning for rule-based and corpus-based Spoken Language Understanding models

Language model adaptation using auto-induced semantic structures in a voice search system

Improving mispronunciation detection using machine learning

Impact of novel sources on content-based image and video retrieval

HMM parameter reduction for practical gesture recognition

A novel highlight event decision approach for baseball videos

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options