Search results

Items from 1 to 7 out of 7 results

chapter

Stimulated training for automatic speech recognition and keyword search in limited resource conditions

A. Ragni, C. Wu, M. J. F. Gales, J. Vasilakes, more

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4830 - 4834

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

families, alphabets, phone sets and vocabulary sizes. In particular, it looks at ensembles of stimulated networks to ensure that improved generalisation will withstand system combination effects. In order to assess stimulated training beyond 1-best transcription accuracy, this paper looks at keyword search as a proxy for

chapter

End-to-end ASR-free keyword search from speech

Kartik Audhkhasi, Andrew Rosenberg, Abhinav Sethy, Bhuvana Ramabhadran, more

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4840 - 4844

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

sequence during training. This paper explores the design of an ASR-free end-to-end system for text query-based keyword search (KWS) from speech trained with minimal supervision. Our E2E KWS system consists of three sub-systems. The first sub-system is a recurrent neural network (RNN)-based acoustic auto-encoder trained to

chapter

Trainable frontend for robust and far-field keyword spotting

Yuxuan Wang, Pascal Getreuer, Thad Hughes, Richard F. Lyon, more

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5670 - 5674

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

of an automatic gain control based dynamic compression to replace the widely used static (such as log or root) compression. We evaluate PCEN on the keyword spotting task. On our large rerecorded noisy and far-field eval sets, we show that PCEN significantly improves recognition performance. Furthermore, we model PCEN as

chapter

An LSTM-CTC based verification system for proxy-word based OOV keyword search

Zhiqiang Lv, Jian Kang, Wei-Qiang Zhang, Jia Liu

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5655 - 5659

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Proxy-word based out of vocabulary (OOV) keyword search has been proven to be quite effective in keyword search. In proxy-word based OOV keyword search, each OOV keyword is assigned several proxies and detections of the proxies are regarded as detections of the OOV keywords. However, the confidence scores of these

article

Data Augmentation for Deep Neural Network Acoustic Modeling

Xiaodong Cui, Vaibhava Goel, Brian Kingsbury

IEEE/ACM Transactions on Audio, Speech, and Language Processing > 2015 > 23 > 9 > 1469 - 1477

) and convolutional neural networks (CNNs). The approaches are focused on increasing speaker and speech variations of the limited training data such that the acoustic models trained with the augmented data are more robust to such variations. In addition, a two-stage data augmentation scheme based on a stacked architecture

chapter

Investigations on byte-level convolutional neural networks for language modeling in low resource speech recognition

Kazuki Irie, Pavel Golik, Ralf Schluter, Hermann Ney

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5740 - 5744

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

characters, even on syllabic alphabets like Amharic. In addition, we report improvements in word error rate from rescoring lattices and evaluate keyword search performance on several languages.

chapter

Multilingual exemplar-based acoustic model for the NIST Open KWS 2015 evaluation

Van Hai Do, Xiong Xiao, Haihua Xu, Eng Siong Chng, more

2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 594 - 98

2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

In this paper, we investigate the use of the proposed non-parametric exemplar-based acoustic modeling for the NIST Open Keyword Search 2015 Evaluation. Specifically, kernel-density model is used to replace GMM in HMM/GMM (Hidden Markov Model / Gaussian Mixture Model) or DNN in HMM/DNN (Hidden Markov Model / Deep

Filter options

Keywords:
NEURAL NETWORKS
ACOUSTICS

Publication date

Set your own date range

Publication type

book (6)
article (1)

Keywords

KEYWORD SEARCH (5)
SPEECH (4)
HIDDEN MARKOV MODELS (3)
AUTOMATIC SPEECH RECOGNITION (2)
DEEP NEURAL NETWORKS (2)
FEATURE EXTRACTION (2)
SPEECH RECOGNITION (2)
TUNING (2)
AUTOMATIC GAIN CONTROL (1)
CONVOLUTION (1)
CONVOLUTIONAL NEURAL NETWORKS (1)
CTC (1)
DATA AUGMENTATION (1)
DATA MODELS (1)
END-TO-END SYSTEMS (1)
GAIN CONTROL (1)
JOINT DECODING (1)
KERNEL (1)
KEYWORD SPOTTING (1)
LANGUAGE MODELING (1)
LATTICES (1)
LIMITED RESOURCES (1)
OOV KEYWORD (1)
PROXY KEYWORD (1)
RECURRENT NEURAL NETWORKS (1)
ROAD TRANSPORTATION (1)
ROBUST AND FAR-FIELD SPEECH RECOGNITION (1)
ROBUSTNESS (1)
SMOOTHING METHODS (1)
STANDARDS (1)
STIMULATED TRAINING (1)
STOCHASTIC FEATURE MAPPING (1)
TOPOLOGY (1)
TRAINING DATA (1)
VERIFICATION (1)
more

INFONA - science communication portal

Search results

Stimulated training for automatic speech recognition and keyword search in limited resource conditions

End-to-end ASR-free keyword search from speech

Trainable frontend for robust and far-field keyword spotting

An LSTM-CTC based verification system for proxy-word based OOV keyword search

Data Augmentation for Deep Neural Network Acoustic Modeling

Investigations on byte-level convolutional neural networks for language modeling in low resource speech recognition

Multilingual exemplar-based acoustic model for the NIST Open KWS 2015 evaluation

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options