Search results

Items from 1 to 7 out of 7 results

chapter

Hybrid context dependent CD-DNN-HMM Keyword Spotting (KWS) in speech conversations

Vivek Tyagi

2016 IEEE 26th International Workshop on Machine Learning for Signal Processing (MLSP) > 1 - 6

2016 IEEE 26th International Workshop on Machine Learning for Signal Processing (MLSP)

corpus. Using a bigram phoneme language model, phoneme recognition experiments are performed on a two hour independent test set using the Viterbi decoding which show a relative 33.3% improvement by our CD-DNN acoustic model. We then present a filler based Hybrid DNN-HMM Keyword Spotting KWS system which to our knowledge is

chapter

Keyword Spotting in Online Chinese Handwritten Documents with Candidate Scoring Based on Semi-CRF Model

Heng Zhang, Xiang-Dong Zhou, Cheng-Lin Liu

2013 12th International Conference on Document Analysis and Recognition > 567 - 571

2013 12th International Conference on Document Analysis and Recognition (ICDAR)

For text-query-based keyword spotting from handwritten Chinese documents, the index is usually organized as a candidate lattice to overcome the ambiguity of character segmentation. Each edge in the lattice denotes a candidate character associated with a candidate class. Character similarity (between character and

chapter

Keyword Spotting in Offline Chinese Handwritten Documents Using a Statistical Model

Liang Huang, Fei Yin, Qing-Hu Chen, Cheng-Lin Liu

2011 International Conference on Document Analysis and Recognition > 78 - 82

2011 International Conference on Document Analysis and Recognition (ICDAR)

This paper proposes a method for keyword spotting in offline Chinese handwritten documents using a statistical model. On a text query word, the method measures the similarity between the query word and every candidate word in the document by combining a character classifier and four classifiers characterizing the

chapter

Acoustic keyword spotter - optimization from end-user perspective

Igor Szöke, F Grézl, J Černocký, M Fapšo, more

2010 IEEE Spoken Language Technology Workshop > 189 - 193

2010 IEEE Spoken Language Technology Workshop (SLT 2010)

The paper deals with the development of acoustic keyword spotter (KWS) meeting requirements of a real user from the security community. While the basic scheme of the KWS is relatively standard, it uses novel features derived by a hierarchy of neural networks, and score normalization trained to maximize a user-like

chapter

Max-pooling loss training of long short-term memory networks for small-footprint keyword spotting

Ming Sun, Anirudh Raju, George Tucker, Sankaran Panchapagesan, more

2016 IEEE Spoken Language Technology Workshop (SLT) > 474 - 480

2016 IEEE Spoken Language Technology Workshop (SLT)

We propose a max-pooling based loss function for training Long Short-Term Memory (LSTM) networks for small-footprint keyword spotting (KWS), with low CPU, memory, and latency requirements. The max-pooling loss training can be further guided by initializing with a cross-entropy loss trained network. A posterior

chapter

Spoken term detection with Connectionist Temporal Classification: A novel hybrid CTC-DBN decoder

Martin Wöllmer, Florian Eyben, Bjorn Schuller, Gerhard Rigoll

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 5274 - 5277

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

This paper proposes a novel system for robust keyword detection in continuous speech. Our decoder is composed of a bidirectional Long Short-Term Memory recurrent neural network using a Connectionist Temporal Classification (CTC) output layer, and a Dynamic Bayesian Network (DBN). The CTC network exploits bidirectional

chapter

Voice-activity home care system

Oscal T.-C. Chen, Y. H. Tsai, C. W. Su, P. C. Kuo, more

2016 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI) > 110 - 113

2016 IEEE-EMBS 3rd International Conference on Biomedical and Health Informatics (BHI)

This work proposes a voice-activity home care system which can construct a life log associated with voices at home. Accordingly, the techniques of sound-pressure-level calculation, abnormal sound detection, noise reduction, text-independent speaker recognition and keyword spotting are developed. In abnormal sound

Filter options

Keywords:
KEYWORD SPOTTING
CONTEXT

Publication date

Set your own date range

Keywords

SPEECH (4)
SPEECH RECOGNITION (4)
TRAINING (4)
HIDDEN MARKOV MODELS (3)
ACOUSTICS (2)
DECODING (2)
INDEXES (2)
LATTICES (2)
SPEAKER RECOGNITION (2)
SPOKEN TERM DETECTION (2)
SUPPORT VECTOR MACHINES (2)
ACOUSTIC KEYWORD SPOTTER OPTIMIZATION (1)
ARBITRARY SPEECH (1)
ARTIFICIAL NEURAL NETWORKS (1)
BAYES METHODS (1)
BIDIRECTIONAL CONTEXT INFORMATION (1)
BIDIRECTIONAL LONG SHORT-TERM MEMORY RECURRENT NEURAL NETWORK (1)
CALIBRATION (1)
CHARACTER RECOGNITION (1)
CHINESE HANDWRITTEN DOCUMENTS (1)
COMPUTATIONAL MODELING (1)
CONNECTIONIST TEMPORAL CLASSIFICATION (1)
CONTEXT MODELING (1)
CONTINUOUS SPEECH (1)
CTC NETWORK (1)
CTC PHONEME OUTPUT STRING (1)
CZECH CONVERSATIONAL TELEPHONE SPEECH (1)
DAILY LOG (1)
DEEP NEURAL NETWORKS (1)
DISCRETE COSINE TRANSFORMS (1)
DYNAMIC BAYESIAN NETWORK (1)
DYNAMIC BAYESIAN NETWORKS (1)
END USER PERSPECTIVE (1)
ESTIMATION (1)
FEATURE EXTRACTION (1)
FIRING (1)
GAUSSIAN MIXTURE MODEL (1)
HYBRID CTC-DBN DECODER (1)
IMAGE SEGMENTATION (1)
KEYWORD DETECTION (1)
KWS SCHEME (1)
LOGIC GATES (1)
LSTM (1)
MAX-POOLING LOSS (1)
NEURAL NETS (1)
NEURAL NETWORK HIERARCHY (1)
NEURAL NETWORKS (1)
NOISE REDUCTION (1)
ONLINE CHINESE HANDWRITTEN DOCUMENTS (1)
OPTIMISATION (1)
PERSONAL COMPUTING (1)
PRAGMATICS (1)
PROTOTYPES (1)
RECURRENT NEURAL NETS (1)
RECURRENT NEURAL NETWORKS (1)
SCORE NORMALIZATION (1)
SECURITY COMMUNITY (1)
SEMI-MARKOV CONDITIONAL RANDOM FIELDS (1)
SIGNAL CLASSIFICATION (1)
SIGNAL TO NOISE RATIO (1)
SMALL-FOOTPRINT (1)
SPECIAL SOUND RECOGNITION (1)
SPEECH CODING (1)
STATISTICAL MODEL (1)
SUPPORT VECTOR MACHINE (1)
SYSTEM ARCHITECTURE (1)
TELEPHONE SETS (1)
USER LIKE EVALUATION METRICS (1)
VOCABULARY (1)
more

INFONA - science communication portal

Search results

Hybrid context dependent CD-DNN-HMM Keyword Spotting (KWS) in speech conversations

Keyword Spotting in Online Chinese Handwritten Documents with Candidate Scoring Based on Semi-CRF Model

Keyword Spotting in Offline Chinese Handwritten Documents Using a Statistical Model

Acoustic keyword spotter - optimization from end-user perspective

Max-pooling loss training of long short-term memory networks for small-footprint keyword spotting

Spoken term detection with Connectionist Temporal Classification: A novel hybrid CTC-DBN decoder

Voice-activity home care system

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options