Search results

chapter

Weakly supervised keyword learning using sparse representations of speech

Joris Driesen, Jort Gemmeke, Hugo Van hamme

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5145 - 5148

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

When applied to speech, Non-negative Matrix Factorization is capable of learning a small vocabulary of words, foregoing any prior linguistic knowledge. This makes it adequate for small-scale speech applications where flexibility is of the utmost importance, e.g. assistive technology for the speech impaired. However, its performance depends on the way its inputs are represented. We propose the use...

chapter

Improving keyword detection rate using a set of rules to merge HMM-based and SVM-based keyword spotting results

Akram Shokri, Mohammad Hossein Davarpour, Ahmad Akbari

2014 International Conference on Advances in Computing, Communications and Informatics (ICACCI) > 1715 - 1718

2014 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

Evaluating the accuracy of HMM-based and SVM-based spotters in detecting keywords and recognizing the true place of keyword occurrence shows that the HMM-based spotter detects the place of occurrence more precisely than the SVM-based spotter. On the other hand, the SVM-based spotter performs much better in detecting

chapter

Improved keyword spotting based on keyword/garbage models

Qiyu Chen, Weibin Zhang, Xiangmin Xu, Xiaofen Xing

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1 - 4

2016 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

We propose two simple methods to improve the performance of a keyword spotting system. In our application, the users are allowed to change the keywords anytime if they want. Thus we focused on phone-based GMM-HMM models since they do not require keyword-specific training data. However, the GMM-HMM based models usually

chapter

ClRank: A Method for Keyword Extraction from Web Pages Using Clustering and Distribution of Nouns

Mohammad Rezaei, Najlah Gali, Pasi Franti

2015 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT) > 1 > 79 - 84

2015 IEEE / WIC / ACM International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT)

Text analysis of a web page is more difficult than the analysis of the text of normal document due to the presence of additional information, such as HTML structure, styling codes, irrelevant text, and presence of hyperlinks. In this paper, we propose an unsupervised method to extract keywords from a web page. The

chapter

Automated English mnemonic keyword suggestion for learning Japanese vocabulary

Orapin Anonthanasap, Monticha Ketna, Teerapong Leelanupab

2015 7th International Conference on Information Technology and Electrical Engineering (ICITEE) > 638 - 643

2015 7th International Conference on Information Technology and Electrical Engineering (ICITEE)

This paper proposes a new methodology that automatically generates English mnemonic keywords to support the learning of basic Japanese vocabulary. A new phonetic algorithm, called JemSoundex, is also introduced for phonetically transliterating the Japanese and English languages for phonetic matching. The effective

chapter

A keyword-aware grammar framework for LVCSR-based spoken keyword search

I-Fan Chen, Chongjia Ni, Boon Pang Lim, Nancy F. Chen, more

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5196 - 5200

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper, we proposed a method to realize the recently developed keyword-aware grammar for LVCSR-based keyword search using weight finite-state automata (WFSA). The approach creates a compact and deterministic grammar WFSA by inserting keyword paths to an existing n-gram WFSA. Tested on the evalpart1 data of the

chapter

Keyword recognition with phone confusion networks and phonological features based keyword threshold detection

A Sangwan, J H L Hansen

2010 Conference Record of the Forty Fourth Asilomar Conference on Signals, Systems and Computers > 711 - 715

2010 44th Asilomar Conference on Signals, Systems and Computers

In this study, a new keyword spotting system (KWS) that utilizes phone confusion networks (PCNs) is presented. The new system exploits the compactness and accuracy of phone confusion networks to deliver fast and accurate results. Special design considerations are provided within the new algorithm to account for phone

chapter

Keyword-specific normalization based keyword spotting for spontaneous speech

Weifeng Li, Qingmin Liao

2012 8th International Symposium on Chinese Spoken Language Processing > 233 - 237

2012 8th International Symposium on Chinese Spoken Language Processing (ISCSLP 2012)

This paper presents a novel architecture for keyword spotting in spontaneous speech, in which keyword model is trained from a small number of acoustic examples provided by a user. The word-spotting architecture relies on scoring patch feature vector sequences extracted by using sliding windows, and performing keyword

chapter

Noise robust keyword spotting for user generated video blogs

M. S. Barakat, C. H. Ritz, D. A. Stirling

2013 IEEE International Conference on Multimedia and Expo (ICME) > 1 - 6

2013 IEEE International Conference on Multimedia and Expo (ICME)

This paper presents a template-based system for speaker independent key word spotting (KWS) in continuous speech that can help in automatic analysis, indexing, search and retrieval of user generated videos by content. Extensive experiments on clean speech confirm that the proposed approach is superior to a HMM approach when applied to noisy speech with different signal-to-noise ratio (SNR) levels...

chapter

Segmental acoustic indexing for zero resource keyword search

Keith Levin, Aren Jansen, Benjamin Van Durme

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5828 - 5832

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The task of zero resource query-by-example keyword search has received much attention in recent years as the speech technology needs of the developing world grow. These systems traditionally rely upon dynamic time warping (DTW) based retrieval algorithms with runtimes that are linear in the size of the search

chapter

A small vocabulary automatic filipino speech profanity suppression system using hybrid Hidden Markov Model/Artificial Neural Network (HMM/ANN) keyword spotting framework

Fernando I. Ablaza, Timothy Oliver D. Danganan, Bryan Paul L. Javier, Kevin S. Manalang, more

2014 International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment and Management (HNICEM) > 1 - 5

2014 International Conference on Humanoid, Nanotechnology, Information Technology, Communication and Control, Environment and Management (HNICEM)

Markov Model/ Artificial Neural Network (HMM/ANN) keyword spotting framework. The feature extraction method used was Mel-Frequency Cepstral Coefficients (MFCC). The ANN is a 3-layer feedforward neural network using Multi-Layer Perceptron (MLP). In recognizing the words, an HMM decoder was used which implemented the Viterbi

chapter

Voice activation system using acoustic event detection and keyword/speaker recognition

Namgook Cho, Taeyoon Kim, Sangwook Shin, Eun-Kyoung Kim

2011 IEEE International Conference on Consumer Electronics (ICCE) > 21 - 22

2011 IEEE International Conference on Consumer Electronics (ICCE)

We study user-friendly voice interface to consumer electronics and propose a voice activation system that can make speech recognition activated only when voice sounds from legitimate users are detected. The proposed system enables efficient operation of speech recognition in a continuous listening environment without any touch and/or key input.

chapter

Fast speech keyword recognition based on improved filler model

Yang Wang, Jie Yang, Le Zhang

2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC) > 530 - 534

2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC)

Most traditional template matching based keyword recognition methods don't need training data, just rely on frame matching. However, the recognition speed is relatively slow and it can't be used in practice. The LVCSR-based method needs to convert the speech signal into text signal before recognition, which has an

chapter

An Improved Template-Based Approach to Keyword Spotting Applied to the Spoken Content of User Generated Video Blogs

M.S. Barakat, C.H. Ritz, D.A. Stirling

2012 IEEE International Conference on Multimedia and Expo > 723 - 728

2012 IEEE International Conference on Multimedia and Expo (ICME)

This paper presents a new technique for preparing word templates to improve the performance of dynamic time warping based keyword spotting. The proposed technique selects one reference template from a small set of examples and in contrast to existing model based approaches does not require extensive training

chapter

End-to-end ASR-free keyword search from speech

Kartik Audhkhasi, Andrew Rosenberg, Abhinav Sethy, Bhuvana Ramabhadran, more

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4840 - 4844

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

sequence during training. This paper explores the design of an ASR-free end-to-end system for text query-based keyword search (KWS) from speech trained with minimal supervision. Our E2E KWS system consists of three sub-systems. The first sub-system is a recurrent neural network (RNN)-based acoustic auto-encoder trained to

chapter

Hybrid context dependent CD-DNN-HMM Keyword Spotting (KWS) in speech conversations

Vivek Tyagi

2016 IEEE 26th International Workshop on Machine Learning for Signal Processing (MLSP) > 1 - 6

2016 IEEE 26th International Workshop on Machine Learning for Signal Processing (MLSP)

corpus. Using a bigram phoneme language model, phoneme recognition experiments are performed on a two hour independent test set using the Viterbi decoding which show a relative 33.3% improvement by our CD-DNN acoustic model. We then present a filler based Hybrid DNN-HMM Keyword Spotting KWS system which to our knowledge is

chapter

Automatic gain control and multi-style training for robust small-footprint keyword spotting with deep neural networks

Rohit Prabhavalkar, Raziel Alvarez, Carolina Parada, Preetum Nakkiran, more

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4704 - 4708

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We explore techniques to improve the robustness of small-footprint keyword spotting models based on deep neural networks (DNNs) in the presence of background noise and in far-field conditions. We find that system performance can be improved significantly, with relative improvements up to 75% in far-field conditions

chapter

Learning speech semantics with keyword classification trees

R. Kuhn, R. De Mori

1993 IEEE International Conference on Acoustics, Speech, and Signal Processing > 2 > 55 - 58 vol.2

Proceedings of ICASSP '93

A linguistic analyzer based on KCTs (keyword classification trees) was trained on sentences from the ATIS (Air Travel Information System) air travel task and incorporated into the system (CHANEL) built at CRIM (Centre de Recherche Informatique de Montreal) for the Nov. 1992 ATIS benchmarks. Word sequences were

chapter

Analysis of keyword spotting performance across IARPA babel languages

William Hartmann, Damianos Karakos, Roger Hsiao, Le Zhang, more

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5765 - 5769

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

With the completion of the IARPA Babel program, it is possible to systematically analyze the performance of speech recognition systems across a wide variety of languages. We select 16 languages from the dataset and compare performance using a deep neural network-based acoustic model. The focus is on keyword spotting

chapter

Improved mandarin spoken term detection by using deep neural network for keyword verification

Xuyang Wang, Ta Li, Yeming Xiao, Jielin Pan, more

2014 10th International Conference on Natural Computation (ICNC) > 144 - 148

2014 10th International Conference on Natural Computation (ICNC)

In this paper, we propose to use Deep Neural Network (DNN), which has been proved to be the state-of-the-art technique in speech recognition, to re-estimate the confidence of keyword hypotheses in the verification stage of spoken term detection. The speech recognition system based on DNN outperforms that based on

INFONA - science communication portal

Search results

Weakly supervised keyword learning using sparse representations of speech

Improving keyword detection rate using a set of rules to merge HMM-based and SVM-based keyword spotting results

Improved keyword spotting based on keyword/garbage models

ClRank: A Method for Keyword Extraction from Web Pages Using Clustering and Distribution of Nouns

Automated English mnemonic keyword suggestion for learning Japanese vocabulary

A keyword-aware grammar framework for LVCSR-based spoken keyword search

Keyword recognition with phone confusion networks and phonological features based keyword threshold detection

Keyword-specific normalization based keyword spotting for spontaneous speech

Noise robust keyword spotting for user generated video blogs

Segmental acoustic indexing for zero resource keyword search

A small vocabulary automatic filipino speech profanity suppression system using hybrid Hidden Markov Model/Artificial Neural Network (HMM/ANN) keyword spotting framework

Voice activation system using acoustic event detection and keyword/speaker recognition

Fast speech keyword recognition based on improved filler model

An Improved Template-Based Approach to Keyword Spotting Applied to the Spoken Content of User Generated Video Blogs

End-to-end ASR-free keyword search from speech

Hybrid context dependent CD-DNN-HMM Keyword Spotting (KWS) in speech conversations

Automatic gain control and multi-style training for robust small-footprint keyword spotting with deep neural networks

Learning speech semantics with keyword classification trees

Analysis of keyword spotting performance across IARPA babel languages

Improved mandarin spoken term detection by using deep neural network for keyword verification

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options