Search results

Items from 1 to 6 out of 6 results

chapter

Fast speech keyword recognition based on improved filler model

Yang Wang, Jie Yang, Le Zhang

2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC) > 530 - 534

2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC)

Most traditional template matching based keyword recognition methods don't need training data, just rely on frame matching. However, the recognition speed is relatively slow and it can't be used in practice. The LVCSR-based method needs to convert the speech signal into text signal before recognition, which has an

chapter

A robust keyword spotting system for Persian conversational telephone speech using feature and score normalization and ARMA filter

A Shokri, S Tabibian, A Akbari, B Nasersharif, more

2011 IEEE GCC Conference and Exhibition (GCC) > 497 - 500

2011 IEEE GCC Conference and Exhibition (GCC)

Keyword spotting (KWS) refers to detection of a limited number of given keywords in speech utterances. In this paper, we evaluate a robust keyword spotting system based on hidden markov models for speaker independent Persian conversational telephone speech. Performance of base line keyword spotter is improved by means

chapter

A robust keyword detection system for criminal scene analysis

Nengheng Zheng, Xia Li

2010 5th IEEE Conference on Industrial Electronics and Applications > 2127 - 2131

2010 5th IEEE Conference on Industrial Electronics and Applications (ICIEA 2010)

This paper presents a robust keyword detection system for criminal scene analysis. The system follows the classical keyword spotting framework. A universal background model is designed and served as the filler model and anti-word model in keyword recognition and verification, respectively. Specifically, we analyze the

chapter

Word detection in recorded speech using textual queries

Lukasz Laszko

2015 Federated Conference on Computer Science and Information Systems (FedCSIS) > 849 - 853

2015 Federated Conference on Computer Science and Information Systems (FedCSIS)

The paper presents unsupervised method for word detection in recorded spoken language signal. The method is based on examining signal similarity of two analyzed media description: registered voice and a word (textual query) synthesized by using Text-to-Speech tools. The descriptions of media were given by a sequence of Mel-Frequency Cepstral Coefficients or Human-Factor Cepstral Coefficients. Dynamic...

chapter

Non-negative matrix factorization for highly noise-robust ASR: To enhance or to recognize?

Felix Weninger, Martin Wollmer, Jurgen Geiger, Bjorn Schuller, more

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4681 - 4684

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

methodology, reaching up to 91.9% average keyword accuracy on the Challenge test set at signal-to-noise ratios from −6 to 9 dB-the best result reported so far on these data.

chapter

Text-dependent speaker recognition for vietnamese

Diep Dao Thi Thu, Loan Trinh Van, Quang Nguyen Hong, Hung Pham Ngoc

2013 International Conference on Soft Computing and Pattern Recognition (SoCPaR) > 196 - 200

2013 International Conference of Soft Computing and Pattern Recognition (SoCPaR)

This paper presents a new method for Vietnamese text-dependent speaker recognition. The system is modeled for each speaker using mixture model Gaussian GMM (Gaussian Mixture Model). The phonemes in the keywords are represented by hidden Markov models HMM. The prior and posterior probabilities for keywords and speakers

Filter options

Keywords:
HIDDEN MARKOV MODELS
MEL FREQUENCY CEPSTRAL COEFFICIENT
SPEECH RECOGNITION

Publication date

Set your own date range

Keywords

TRAINING (3)
FILLER MODEL (2)
HMM (2)
ADAPTATION MODELS (1)
ALIZE (1)
ANTI-WORD MODEL (1)
ARMA FILTER (1)
AUDIO INFORMATION RETRIEVAL (1)
AUTOREGRESSIVE MOVING AVERAGE (1)
AUTOREGRESSIVE MOVING AVERAGE PROCESSES (1)
CEPSTRAL ANALYSIS (1)
CEPSTRAL GAIN NORMALIZATION (1)
CEPSTRAL MEAN AND VARIANCE NORMALIZATION (1)
CMVN (1)
COMPUTATIONAL MODELING (1)
CRIMINAL SCENE ANALYSIS (1)
DECODING (1)
FEATURE EXTRACTION (1)
FILTER BANKS (1)
FILTERING (1)
HEURISTIC ALGORITHMS (1)
IMAGE ANALYSIS (1)
KEYWORD DETECTION (1)
KEYWORD RECOGNITION (1)
KEYWORD SEARCH (1)
KEYWORD SPOTTING (1)
KEYWORD SPOTTING FRAMEWORK (1)
KEYWORD VERIFICATION (1)
LAW (1)
LDA (1)
MFCC (1)
MVA/CGN PROCESSING (1)
NOISE (1)
NON-NEGATIVE MATRIX FACTORIZATION (1)
PATTERN MATCHING (1)
PERSIAN CONVERSATIONAL TELEPHONE SPEECH (1)
PITCH VARIATION (1)
PITCH VARIATION CHARACTERISTICS (1)
PLP (1)
PLP FEATURES (1)
ROBUST KEYWORD DETECTION SYSTEM (1)
SIGNAL TO NOISE RATIO (1)
SPEAKER RECOGNITION (1)
SPEECH ANALYSIS (1)
SPEECH ENHANCEMENT (1)
SPEECH PROCESSING (1)
SPEECH UTTERANCES (1)
SPHINX (1)
SPOKEN KEYWORDS DETECTION (1)
TANDEM SPEECH RECOGNITION (1)
TEXT-DEPENDENT (1)
UNIVERSAL BACKGROUND MODEL (1)
VIETNAMESE (1)
more

INFONA - science communication portal

Search results

Fast speech keyword recognition based on improved filler model

A robust keyword spotting system for Persian conversational telephone speech using feature and score normalization and ARMA filter

A robust keyword detection system for criminal scene analysis

Word detection in recorded speech using textual queries

Non-negative matrix factorization for highly noise-robust ASR: To enhance or to recognize?

Text-dependent speaker recognition for vietnamese

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options