Search results for: Rohit Prabhavalkar

Items from 1 to 9 out of 9 results

chapter

On the compression of recurrent neural networks with an application to LVCSR acoustic modeling for embedded speech recognition

Rohit Prabhavalkar, Ouais Alsharif, Antoine Bruguier, Lan McGraw

2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5970 - 5974

ICASSP 2016 - 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We study the problem of compressing recurrent neural networks (RNNs). In particular, we focus on the compression of RNN acoustic models, which are motivated by the goal of building compact and accurate speech recognition systems which can be run efficiently on mobile devices. In this work, we present a technique for general recurrent model compression that jointly compresses both recurrent and non-recurrent...

chapter

Personalized speech recognition on mobile devices

Ian McGraw, Rohit Prabhavalkar, Raziel Alvarez, Montse Gonzalez Arenas, more

2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5955 - 5959

ICASSP 2016 - 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We describe a large vocabulary speech recognition system that is accurate, has low latency, and yet has a small enough memory and computational footprint to run faster than real-time on a Nexus 5 Android smartphone. We employ a quantized Long Short-Term Memory (LSTM) acoustic model trained with connectionist temporal classification (CTC) to directly predict phoneme targets, and further reduce its...

chapter

Automatic gain control and multi-style training for robust small-footprint keyword spotting with deep neural networks

Rohit Prabhavalkar, Raziel Alvarez, Carolina Parada, Preetum Nakkiran, more

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4704 - 4708

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We explore techniques to improve the robustness of small-footprint keyword spotting models based on deep neural networks (DNNs) in the presence of background noise and in far-field conditions. We find that system performance can be improved significantly, with relative improvements up to 75% in far-field conditions, by employing a combination of multi-style training and a proposed novel formulation...

chapter

Discriminative articulatory models for spoken term detection in low-resource conversational settings

Rohit Prabhavalkar, Karen Livescu, Eric Fosler-Lussier, Joseph Keshet

2013 IEEE International Conference on Acoustics, Speech and Signal Processing > 8287 - 8291

ICASSP 2013 - 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We study spoken term detection (STD) - the task of determining whether and where a given word or phrase appears in a given segment of speech - using articulatory feature-based pronunciation models. The models are motivated by the requirements of STD in low-resource settings, in which it may not be feasible to train a large-vocabulary continuous speech recognition system, as well as by the need to...

chapter

An evaluation of posterior modeling techniques for phonetic recognition

Rohit Prabhavalkar, Tara N. Sainath, David Nahamoo, Bhuvana Ramabhadran, more

2013 IEEE International Conference on Acoustics, Speech and Signal Processing > 7165 - 7169

ICASSP 2013 - 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Several methods have been proposed recently for modeling posterior representations derived from local classifiers [1, 2]. In recent work, Sainath et al. have proposed the use of a tied-mixture-based posterior modeling approach [3] to enhance exemplar-based posterior representations for phone recognition tasks. In this work, we conduct a detailed evaluation to determine the effectiveness of this technique...

article

Conditional Random Fields in Speech, Audio, and Language Processing

Eric Fosler-Lussier, Yanzhang He, Preethi Jyothi, Rohit Prabhavalkar

Proceedings of the IEEE > 2013 > 101 > 5 > 1054 - 1075

Conditional random fields (CRFs) are probabilistic sequence models that have been applied in the last decade to a number of applications in audio, speech, and language processing. In this paper, we provide a tutorial overview of CRF technologies, pointing to other resources for more in-depth discussion; in particular, we describe the common linear-chain model as well as a number of common extensions...

chapter

A chunk-based phonetic score for mobile voice search

Rohit Prabhavalkar, Jasha Droppo

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4729 - 4732

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

We propose a chunk-based phonetic score for re-scoring word hypotheses for the mobile voice search task. The score is based on a novel technique for aligning decoded phone sequences with forced-alignments of hypothesized word sequences and exploits phone-boundary timing information. In experimental results, we find that the proposed approach results in relative a word error rate reduction of 4.4%...

chapter

A factored conditional random field model for articulatory feature forced transcription

Rohit Prabhavalkar, Eric Fosler-Lussier, Karen Livescu

2011 IEEE Workshop on Automatic Speech Recognition & Understanding > 77 - 82

2011 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU)

We investigate joint models of articulatory features and apply these models to the problem of automatically generating articulatory transcriptions of spoken utterances given their word transcriptions. The task is motivated by the need for larger amounts of labeled articulatory data for both speech recognition and linguistics research, which is costly and difficult to obtain through manual transcription...

chapter

Backpropagation training for multilayer conditional random field based phone recognition

Rohit Prabhavalkar, Eric Fosler-Lussier

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 5534 - 5537

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

Conditional random fields (CRFs) have recently found increased popularity in automatic speech recognition (ASR) applications. CRFs have previously been shown to be effective combiners of posterior estimates from multilayer perceptrons (MLPs) in phone and word recognition tasks. In this paper, we describe a novel hybrid Multilayer-CRF structure (ML-CRF), where a MLP-like hidden layer serves as input...

Filter options

Publication date

Set your own date range

Publication type

book (8)
article (1)

Keywords

MATHEMATICAL MODEL (3)
ACOUSTICS (2)
AUTOMATIC SPEECH RECOGNITION (2)
EMBEDDED SPEECH RECOGNITION (2)
EQUATIONS (2)
HIDDEN MARKOV MODELS (2)
LSTM (2)
MODEL COMPRESSION (2)
PHONE RECOGNITION (2)
RANDOM FIELDS (2)
RANDOM PROCESSES (2)
SPEECH (2)
SPEECH RECOGNITION (2)
TRAINING (2)
ARTICULATORY FEATURES (1)
AUC (1)
AUTOMATIC GAIN CONTROL (1)
AUTOMATIC SPEECH RECOGNITION (ASR) (1)
BACKPROPAGATION (1)
BACKPROPAGATION TRAINING (1)
COMPUTATIONAL MODELING (1)
CONDITIONAL LOG-LIKELIHOOD BASED CRITERION (1)
CTC (1)
DECODING (1)
DISCRIMINATIVE TRAINING (1)
ERROR BACKPROPAGATION (1)
GAIN CONTROL (1)
INFORMATION PROCESSING (1)
KEYWORD SPOTTING (1)
MLP-LIKE HIDDEN LAYER (1)
MOBILE COMMUNICATION (1)
MULTI-STYLE TRAINING (1)
MULTILAYER CONDITIONAL RANDOM FIELD (1)
MULTILAYER PERCEPTRONS (1)
MULTILAYER-CRF STRUCTURE (1)
NATURAL LANGUAGE PROCESSING (1)
NATURAL LANGUAGE PROCESSING (NLP) (1)
NOISE (1)
NOISE MEASUREMENT (1)
PHONETIC SCORE (1)
POSTERIOR MODELING (1)
PRONUNCIATION MODELING (1)
QUANTIZATION (1)
RNN (1)
SMALL-FOOTPRINT MODELS (1)
SPEECH PROCESSING (1)
SPOKEN TERM DETECTION (1)
STATISTICAL LEARNING (1)
STRUCTURAL SVM (1)
SVD (1)
TIED-MIXTURE SMOOTHING (1)
TIMIT (1)
TIMIT DATABASE (1)
VOICE SEARCH (1)
WORD RECOGNITION TASKS (1)
more

INFONA - science communication portal

Search results for: Rohit Prabhavalkar

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options