Search results

Items from 1 to 5 out of 5 results

chapter

Automatic gain control and multi-style training for robust small-footprint keyword spotting with deep neural networks

Rohit Prabhavalkar, Raziel Alvarez, Carolina Parada, Preetum Nakkiran, more

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4704 - 4708

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We explore techniques to improve the robustness of small-footprint keyword spotting models based on deep neural networks (DNNs) in the presence of background noise and in far-field conditions. We find that system performance can be improved significantly, with relative improvements up to 75% in far-field conditions

chapter

Improving HMM-Based Keyword Spotting with Character Language Models

Andreas Fischer, Volkmar Frinken, Horst Bunke, Ching Y. Suen

2013 12th International Conference on Document Analysis and Recognition > 506 - 510

2013 12th International Conference on Document Analysis and Recognition (ICDAR)

Facing high error rates and slow recognition speed for full text transcription of unconstrained handwriting images, keyword spotting is a promising alternative to locate specific search terms within scanned document images. We have previously proposed a learning-based method for keyword spotting using character hidden

chapter

Visual language model for keyword spotting on historical mongolian document images

Hongxi Wei, Guanglai Gao

2017 29th Chinese Control And Decision Conference (CCDC) > 1737 - 1742

2017 29th Chinese Control And Decision Conference (CCDC)

The Bag-of-Visual-Words (BoVW) approach has been attracted some attention in the field of keyword spotting. However, the BoVW approach discards the spatial relations of the visual words. Therefore, a visual language model is integrated into the BoVW framework in this study so as to add the spatial information. To

chapter

Realizing speech to gesture conversion by keyword spotting

Na Zhao, Hongwu Yang

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP) > 1 - 5

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP)

The paper proposed a method to realize a speech-to-gesture conversion for communication between normal and speech-impaired people. Keyword spotting was employed to recognize the keywords from input speech signals. At the same time, the three dimensional gesture models of keywords were built by 3D modeling technology

chapter

Comparison of GMM and fuzzy-GMM applied to phoneme classification

K. Abida, F. Karray, Jiping Sun

2009 3rd International Conference on Signals, Circuits and Systems (SCS) > 1 - 4

2009 3rd International Conference on Signals, Circuits and Systems (SCS 2009)

of vocabulary words in the users speech utterance. In this paper, we investigate an approach that can be deployed in keyword spotting systems. We propose a phoneme classifier that will be ultimately used to provide confidence values to be compared against existing Automatic Speech Recognizer word confidences. The end

Filter options

Keywords:
MATHEMATICAL MODEL
KEYWORD SPOTTING

Publication date

Set your own date range

Keywords

HIDDEN MARKOV MODELS (3)
SPEECH (3)
SPEECH RECOGNITION (3)
COMPUTATIONAL MODELING (2)
TRAINING (2)
ASSISTIVE TECHNOLOGY (1)
AUTOMATIC GAIN CONTROL (1)
AUTOMATIC SPEECH RECOGNIZER WORD CONFIDENCE (1)
CHARACTER RECOGNITION (1)
CLASSIFICATION ALGORITHMS (1)
CONFIDENCE VALUES (1)
DATABASES (1)
ENCODING (1)
ENGLISH PHONEMES CLASSIFICATION (1)
FUZZY GAUSSIAN MIXTURE MODELING (1)
FUZZY GMM (1)
FUZZY LOGIC (1)
FUZZY SET THEORY (1)
GAIN CONTROL (1)
GAUSSIAN MIXTURE MODELING (1)
GAUSSIAN PROCESSES (1)
GESTURE MODELING (1)
HANDWRITING RECOGNITION (1)
IMAGE CODING (1)
IMAGE RETRIEVAL (1)
JOINTS (1)
KEYWORD SPOTTING SYSTEMS (1)
KL DIVERGENCE (1)
LANGUAGE MODELS (1)
MULTI-STYLE TRAINING (1)
NATURAL HUMAN MACHINE INTERFACE (1)
NATURAL LANGUAGE PROCESSING (1)
NATURAL LANGUAGE SPEECH (1)
NATURAL SPEECH ENABLED SYSTEMS (1)
NOISE (1)
NOISE MEASUREMENT (1)
PATTERN CLASSIFICATION (1)
PHONEME CLASSIFICATION (1)
PHONEME CLASSIFIER (1)
QUERY LIKELIHOOD MODEL (1)
SMALL-FOOTPRINT MODELS (1)
SMOOTHING (1)
SMOOTHING METHODS (1)
SOLID MODELING (1)
SPEECH RECOGNITION SYSTEM (1)
SPEECH TO GESTURE CONVERSION (1)
TEXT RECOGNITION (1)
VISUAL LANGUAGE MODEL (1)
VISUALIZATION (1)
VITERBI ALGORITHM (1)
VOICE BASED REQUEST (1)
more

INFONA - science communication portal

Search results

Automatic gain control and multi-style training for robust small-footprint keyword spotting with deep neural networks

Improving HMM-Based Keyword Spotting with Character Language Models

Visual language model for keyword spotting on historical mongolian document images

Realizing speech to gesture conversion by keyword spotting

Comparison of GMM and fuzzy-GMM applied to phoneme classification

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options