Search results

Items from 1 to 11 out of 11 results

chapter

Fast speech keyword recognition based on improved filler model

Yang Wang, Jie Yang, Le Zhang

2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC) > 530 - 534

2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC)

Most traditional template matching based keyword recognition methods don't need training data, just rely on frame matching. However, the recognition speed is relatively slow and it can't be used in practice. The LVCSR-based method needs to convert the speech signal into text signal before recognition, which has an

chapter

Keyword detection system with zero-resources techniques

Juan Bujjamer, Claudio Estienne, Patricia Pelle

2015 XVI Workshop on Information Processing and Control (RPIC) > 1 - 6

2015 XVI Workshop on Information Processing and Control (RPIC)

A keyword detection system with zero-resources techniques is presented. It consists of a primary alignment method and a later rescoring of its hipotheses. Both stages based on a segmental dynamic time warping method and a segmental model respectively. The resulting system is totally language independent and has no pre

chapter

A robust keyword spotting system for Persian conversational telephone speech using feature and score normalization and ARMA filter

A Shokri, S Tabibian, A Akbari, B Nasersharif, more

2011 IEEE GCC Conference and Exhibition (GCC) > 497 - 500

2011 IEEE GCC Conference and Exhibition (GCC)

Keyword spotting (KWS) refers to detection of a limited number of given keywords in speech utterances. In this paper, we evaluate a robust keyword spotting system based on hidden markov models for speaker independent Persian conversational telephone speech. Performance of base line keyword spotter is improved by means

chapter

A robust keyword detection system for criminal scene analysis

Nengheng Zheng, Xia Li

2010 5th IEEE Conference on Industrial Electronics and Applications > 2127 - 2131

2010 5th IEEE Conference on Industrial Electronics and Applications (ICIEA 2010)

This paper presents a robust keyword detection system for criminal scene analysis. The system follows the classical keyword spotting framework. A universal background model is designed and served as the filler model and anti-word model in keyword recognition and verification, respectively. Specifically, we analyze the

chapter

Word detection in recorded speech using textual queries

Lukasz Laszko

2015 Federated Conference on Computer Science and Information Systems (FedCSIS) > 849 - 853

2015 Federated Conference on Computer Science and Information Systems (FedCSIS)

The paper presents unsupervised method for word detection in recorded spoken language signal. The method is based on examining signal similarity of two analyzed media description: registered voice and a word (textual query) synthesized by using Text-to-Speech tools. The descriptions of media were given by a sequence of Mel-Frequency Cepstral Coefficients or Human-Factor Cepstral Coefficients. Dynamic...

chapter

A fast query-by-example spoken term detection for zero resource languages

Karthik Pandia D S, M S Saranya, Hema A Murthy

2016 International Conference on Signal Processing and Communications (SPCOM) > 1 - 5

2016 International Conference on Signal Processing and Communications (SPCOM)

proposed approach uses a segmental DTW, wherein search is carried out only at syllable boundaries. This reduces the search complexity by 9 times compared to conventional sliding window DTW. The first pass of the proposed method uses a minimum set of templates for a keyword to search through the segmented audio. New templates

chapter

Developing an automatic transcription and retrieval system for spoken lectures in Turkish

Ebru Arisoy

2017 25th Signal Processing and Communications Applications Conference (SIU) > 1 - 4

2017 25th Signal Processing and Communications Applications Conference (SIU)

Turkish video lectures using a large vocabulary continuous speech recognition (LVCSR) system and finding keywords on the lattices obtained from the LVCSR system using a speech retrieval system based on keyword search. While developing this system, first a state-of-the-art LVCSR system was developed for Turkish using advance

chapter

Query-by-example spoken term detection using bessel features

Drisya Vasudev, Suryakanth V Gangashetty, K. K Anish Babu, K. S Riyas

2015 IEEE International Conference on Signal Processing, Informatics, Communication and Energy Systems (SPICES) > 1 - 4

2015 IEEE International Conference on Signal Processing, Informatics, Communication and Energy Systems (SPICES)

Cepstral Coefficients(FBCC) is used in this paper. Here, from the spoken example of a keyword, segmental Dynamic Time Warping is used to compare the Gaussian Posteriorgrams, which are created from the FBCC feature vector. The keyword detection result obtained using MediaEval 2012 database shows that this system outperforms

chapter

Non-negative matrix factorization for highly noise-robust ASR: To enhance or to recognize?

Felix Weninger, Martin Wollmer, Jurgen Geiger, Bjorn Schuller, more

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4681 - 4684

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

methodology, reaching up to 91.9% average keyword accuracy on the Challenge test set at signal-to-noise ratios from −6 to 9 dB-the best result reported so far on these data.

article

Speech detection on broadcast audio

Unal Zubari, Ezgi Can Ozan, Banu Oskay Acar, Tolga Ciloglu, more

02010 00018th European Signal Processing Conference > 2010 > 85 - 89

2010 18th European Signal Processing Conference

Speech boundary detection contributes to performance of speech based applications such as speech recognition and speaker recognition. Speech boundary detector implemented in this study works on broadcast audio as a pre-processor module of a keyword spotter. Speech boundary detection is handled in 3 steps. At first

chapter

Text-dependent speaker recognition for vietnamese

Diep Dao Thi Thu, Loan Trinh Van, Quang Nguyen Hong, Hung Pham Ngoc

2013 International Conference on Soft Computing and Pattern Recognition (SoCPaR) > 196 - 200

2013 International Conference of Soft Computing and Pattern Recognition (SoCPaR)

This paper presents a new method for Vietnamese text-dependent speaker recognition. The system is modeled for each speaker using mixture model Gaussian GMM (Gaussian Mixture Model). The phonemes in the keywords are represented by hidden Markov models HMM. The prior and posterior probabilities for keywords and speakers

Filter options

Keywords:
SPEECH RECOGNITION
MEL FREQUENCY CEPSTRAL COEFFICIENT

Publication date

Set your own date range

Publication type

book (10)
article (1)

Keywords

SPEECH (10)
HIDDEN MARKOV MODELS (6)
TRAINING (4)
FEATURE EXTRACTION (3)
HEURISTIC ALGORITHMS (3)
SPEECH PROCESSING (3)
FILLER MODEL (2)
HMM (2)
KEYWORD DETECTION (2)
ADAPTATION MODELS (1)
ALIZE (1)
ANTI-WORD MODEL (1)
ARMA FILTER (1)
AUDIO INFORMATION RETRIEVAL (1)
AUTOREGRESSIVE MOVING AVERAGE (1)
AUTOREGRESSIVE MOVING AVERAGE PROCESSES (1)
CEPSTRAL ANALYSIS (1)
CEPSTRAL GAIN NORMALIZATION (1)
CEPSTRAL MEAN AND VARIANCE NORMALIZATION (1)
CMVN (1)
COMPUTATIONAL MODELING (1)
COMPUTER SCIENCE (1)
COVARIANCE MATRICES (1)
CRIMINAL SCENE ANALYSIS (1)
DATABASES (1)
DECODING (1)
DETECTORS (1)
DYNAMIC TIME WARPING (1)
ELECTRONIC MAIL (1)
FBCC (1)
FILTER BANKS (1)
FILTERING (1)
GAUSSIAN MIXTURE (1)
GAUSSIAN POSTERIORGRAM (1)
HARMONIC ANALYSIS (1)
IMAGE ANALYSIS (1)
INDEXES (1)
KEYWORD RECOGNITION (1)
KEYWORD SEARCH (1)
KEYWORD SPOTTING (1)
KEYWORD SPOTTING FRAMEWORK (1)
KEYWORD VERIFICATION (1)
LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION (1)
LATTICES (1)
LAW (1)
LDA (1)
MFCC (1)
MVA/CGN PROCESSING (1)
NANOELECTROMECHANICAL SYSTEMS (1)
NOISE (1)
NON-NEGATIVE MATRIX FACTORIZATION (1)
PATTERN MATCHING (1)
PERSIAN CONVERSATIONAL TELEPHONE SPEECH (1)
PITCH VARIATION (1)
PITCH VARIATION CHARACTERISTICS (1)
PLP (1)
PLP FEATURES (1)
QUERY (1)
ROBUST KEYWORD DETECTION SYSTEM (1)
SEGMENTAL MODELS (1)
SIGNAL TO NOISE RATIO (1)
SILICON (1)
SILICON COMPOUNDS (1)
SPEAKER RECOGNITION (1)
SPEECH ANALYSIS (1)
SPEECH AND LANGUAGE PROCESSING FOR EDUCATIONAL TECHNOLOGIES (1)
SPEECH ENHANCEMENT (1)
SPEECH RETRIEVAL (1)
SPEECH UTTERANCES (1)
SPHINX (1)
SPOKEN KEYWORDS DETECTION (1)
SPOKEN TERM DETECTION (1)
SYSTEM PERFORMANCE (1)
TANDEM SPEECH RECOGNITION (1)
TEXT-DEPENDENT (1)
TIME FACTORS (1)
UNIVERSAL BACKGROUND MODEL (1)
VIETNAMESE (1)
VOCABULARY (1)
ZERO-RESOURCES (1)
more

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options