Search results

Items from 21 to 27 out of 27 results

chapter

An inner-product lower-bound estimate for dynamic time warping

Yaodong Zhang, James R. Glass

2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5660 - 5663

ICASSP 2011 - 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

similarities between posteriorgrams. In addition to deriving the lower-bound estimate, we show how it can be efficiently used in an admissible K nearest neighbor (KNN) search for spotting matching sequences. We quantify the amount of computational savings achieved by performing a set of unsupervised spoken keyword spotting

chapter

Supervised learning in the wild: Text classification for critical technologies

Arun S. Maiya, Francisco Loaiza-Lemos, Robert M. Rolfe

MILCOM 2012 - 2012 IEEE Military Communications Conference > 1 - 6

MILCOM 2012 - 2012 IEEE Military Communications Conference

, pattern recognition) to detect such critical documents. To address difficult or ambiguous instances, we supplement the text classifier with an automated keyword search. That is, we extract, in an automated fashion, discriminative terms (i.e., keywords) from the training set and match them against documents during the

chapter

Investigations on byte-level convolutional neural networks for language modeling in low resource speech recognition

Kazuki Irie, Pavel Golik, Ralf Schluter, Hermann Ney

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5740 - 5744

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

characters, even on syllabic alphabets like Amharic. In addition, we report improvements in word error rate from rescoring lattices and evaluate keyword search performance on several languages.

article

Data Augmentation for Deep Neural Network Acoustic Modeling

Xiaodong Cui, Vaibhava Goel, Brian Kingsbury

IEEE/ACM Transactions on Audio, Speech, and Language Processing > 2015 > 23 > 9 > 1469 - 1477

is proposed to combine VTLP and SFM as complementary approaches. Experiments are conducted on Assamese and Haitian Creole, two development languages of the IARPA Babel program, and improved performance on automatic speech recognition (ASR) and keyword search (KWS) is reported.

chapter

Semi-supervised training in low-resource ASR and KWS

Florian Metze, Ankur Gandhe, Yajie Miao, Zaid Sheikh, more

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4699 - 4703

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In particular for “low resource” Keyword Search (KWS) and Speech-to-Text (STT) tasks, more untranscribed test data may be available than training data. Several approaches have been proposed to make this data useful during system development, even when initial systems have Word Error Rates (WER) above 70

chapter

Knowledge distillation across ensembles of multilingual models for low-resource languages

Jia Cui, Brian Kingsbury, Bhuvana Ramabhadran, George Saon, more

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4825 - 4829

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper investigates the effectiveness of knowledge distillation in the context of multilingual models. We show that with knowledge distillation, Long Short-Term Memory(LSTM) models can be used to train standard feed-forward Deep Neural Network (DNN) models for a variety of low-resource languages. We then examine how the agreement between the teacher's best labels and the original labels affects...

chapter

Applications of Recurrent Neural Network Language Model in Offline Handwriting Recognition and Word Spotting

Nan Li, Jinying Chen, Huaigu Cao, Bing Zhang, more

2014 14th International Conference on Frontiers in Handwriting Recognition > 134 - 139

2014 14th International Conference on Frontiers in Handwriting Recognition (ICFHR)

The recurrent neural network language model (RNNLM) is a discriminative, non-Markovian model that can capture long-span word history in natural language. It has been proved to be successful in automatic speech recognition and machine translation. In this work, we applied RNNLM to the n-best rescoring stage of the state-of-the-art BBN Byblos OCR (optical character recognition) system for handwriting...

Keywords:
KEYWORD SEARCH
TRAINING

Publication date

Set your own date range

Publication type

book (24)
article (3)

Keywords

SPEECH RECOGNITION (17)
ACOUSTICS (14)
SPEECH (12)
HIDDEN MARKOV MODELS (8)
NEURAL NETWORKS (8)
LATTICES (7)
VOCABULARY (7)
FEATURE EXTRACTION (6)
DATA MODELS (5)
DECODING (5)
SPOKEN TERM DETECTION (5)
AUTOMATIC SPEECH RECOGNITION (4)
RECURRENT NEURAL NETWORKS (4)
CTC (3)
KEYWORD SPOTTING (3)
LANGUAGE MODELING (3)
MULTILINGUAL (3)
ACTIVE LEARNING (2)
DYNAMIC TIME WARPING (2)
END-TO-END SYSTEMS (2)
INDEXES (2)
INFORMATION RETRIEVAL (2)
LOW-RESOURCE (2)
LSTM (2)
MORPHOLOGY (2)
NIST (2)
SEMI-SUPERVISED TRAINING (2)
STANDARDS (2)
TRAINING DATA (2)
VGG (2)
ACOUSTIC MODEL (1)
ACOUSTIC MODELING (1)
ADAPTIVE USER INTEREST HIERARCHY (1)
AGGLUTINATIVE LANGUAGES (1)
APPROXIMATION METHODS (1)
ARTIFICIAL NEURAL NETWORKS (1)
ATTENTION MODELS (1)
ATTENTION NETWORKS (1)
AUTOMATA (1)
BIOLOGICAL NEURAL NETWORKS (1)
BOOK REVIEWS (1)
BUILDINGS (1)
CHARACTER RECOGNITION (1)
CLUSTERING ALGORITHMS (1)
COMPUTER ARCHITECTURE (1)
COMPUTERS (1)
CONVOLUTION (1)
CONVOLUTIONAL NEURAL NETWORKS (1)
DATA AUGMENTATION (1)
DEEP NEURAL NETWORK (1)
DEEP NEURAL NETWORK (DNN) (1)
DEEP NEURAL NETWORKS (1)
DIGITAL FORENSIC (1)
DIGITAL FORENSIC FIELD (1)
DISPERSION (1)
DISTANCE METRIC LEARNING (1)
ELECTRONIC EVIDENCE (1)
END-TO-END SPEECH RECOGNITION (1)
ERROR ANALYSIS (1)
GLASS (1)
GRAMMAR (1)
GRAMMAR NETWORK (1)
GRAPHONES (1)
HANDWRITING RECOGNITION (1)
HARD DISKS (1)
HISTOGRAMS (1)
HUMAN FACTORS (1)
INDEX TECHNOLOGY (1)
INDEXING (1)
INFLECTIVE LANGUAGES (1)
INVESTIGATOR-DEFINED KEYWORD (1)
JOINT DECODING (1)
KERNEL (1)
KEYWORD SELECTION (1)
KNOWLEDGE TRANSFER (1)
LANGUAGE MODEL (1)
LIMITED RESOURCES (1)
LINEAR PROGRAMMING (1)
LOGISTICS (1)
LOW RESOURCED LANGUAGES (1)
LOW-RESOURCE LANGUAGE (1)
LOW-RESOURCE LANGUAGES (1)
LOW-RESOURCE LTS (1)
LOW-RESOURCED LANGUAGES (1)
MACHINE LEARNING (1)
MACHINE LEARNING ALGORITHMS (1)
MEASUREMENT (1)
MODELING (1)
MORPHOLOGICAL ANALYSIS (1)
MULTILINGUAL TRAINING (1)
NOISE MEASUREMENT (1)
OOV KEYWORD (1)
OOV KEYWORDS (1)
OPTICAL CHARACTER RECOGNITION (1)
OPTICAL CHARACTER RECOGNITION SOFTWARE (1)
OPTIMIZATION (1)
OUT-OF-VOCABULARY (OOV) WORDS (1)
POSTERIORGRAM (1)
more

INFONA - science communication portal

Search results

An inner-product lower-bound estimate for dynamic time warping

Supervised learning in the wild: Text classification for critical technologies

Investigations on byte-level convolutional neural networks for language modeling in low resource speech recognition

Data Augmentation for Deep Neural Network Acoustic Modeling

Semi-supervised training in low-resource ASR and KWS

Knowledge distillation across ensembles of multilingual models for low-resource languages

Applications of Recurrent Neural Network Language Model in Offline Handwriting Recognition and Word Spotting

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Search results

An inner-product lower-bound estimate for dynamic time warping

Supervised learning in the wild: Text classification for critical technologies

Investigations on byte-level convolutional neural networks for language modeling in low resource speech recognition

Data Augmentation for Deep Neural Network Acoustic Modeling

Semi-supervised training in low-resource ASR and KWS

Knowledge distillation across ensembles of multilingual models for low-resource languages

Applications of Recurrent Neural Network Language Model in Offline Handwriting Recognition and Word Spotting

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options