ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Pozycje od 1 do 6 spośród 6 wyników

rozdział

Reduction of acoustic model training time and required data passes via stochastic approaches to maximum likelihood and discriminative training

Petr Novak, Roman Otec, Antonio Lee, Vaibhava Goel

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5577 - 5581

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The recent boom in use of speech recognition technology has made the access to potentially large amounts of training data easier. This, however, also constitutes a challenge in processing such large, continuously growing amount of information. Here we present a stochastic modification of traditional iterative training approach which leads to the same or even better accuracy of acoustic models and...

rozdział

RASR/NN: The RWTH neural network toolkit for speech recognition

Simon Wiesler, Alexander Richard, Pavel Golik, Ralf Schluter, więcej

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 3281 - 3285

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper describes the new release of RASR — the open source version of the well-proven speech recognition toolkit developed and used at RWTH Aachen University. The focus is put on the implementation of the NN module for training neural network acoustic models. We describe code design, configuration, and features of the NN module. The key feature is a high flexibility regarding the network topology,...

rozdział

Improving deep neural network acoustic models using generalized maxout networks

Xiaohui Zhang, Jan Trmal, Daniel Povey, Sanjeev Khudanpur

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 215 - 219

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Recently, maxout networks have brought significant improvements to various speech recognition and computer vision tasks. In this paper we introduce two new types of generalized maxout units, which we call p-norm and soft-maxout. We investigate their performance in Large Vocabulary Continuous Speech Recognition (LVCSR) tasks in various languages with 10 hours and 60 hours of data, and find that the...

rozdział

Asynchronous stochastic optimization for sequence training of deep neural networks

Georg Heigold, Erik McDermott, Vincent Vanhoucke, Andrew Senior, więcej

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5587 - 5591

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper explores asynchronous stochastic optimization for sequence training of deep neural networks. Sequence training requires more computation than frame-level training using pre-computed frame data. This leads to several complications for stochastic optimization, arising from significant asynchrony in model updates under massive parallelization, and limited data shuffling due to utterance-chunked...

rozdział

Joint training of convolutional and non-convolutional neural networks

Hagen Soltau, George Saon, Tara N. Sainath

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5572 - 5576

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We describe a simple modification of neural networks which consists in extending the commonly used linear layer structure to an arbitrary graph structure. This allows us to combine the benefits of convolutional neural networks with the benefits of regular networks. The joint model has only a small increase in parameter size and training and decoding time are virtually unaffected. We report significant...

rozdział

Context dependent state tying for speech recognition using deep neural network acoustic models

Michiel Bacchiani, David Rybach

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 230 - 234

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper proposes an algorithm to design a tied-state inventory for a context dependent, neural network-based acoustic model for speech recognition. Rather than relying on a GMM/HMM system that operates on a different feature space and is of a different model family, the proposed algorithm optimizes state tying on the activation vectors of the neural network directly. Experiments show the viability...

Opcje filtrowania

Słowa kluczowe:
ACOUSTIC MODELING

Data publikacji

Ustaw własny zakres dat

Słowa kluczowe

SPEECH RECOGNITION (4)
NEURAL NETWORKS (3)
ASYNCHRONOUS STOCHASTIC OPTIMIZATION (1)
CNN (1)
CONTEXT MODELING (1)
DEEP LEARNING (1)
DEEP NEURAL NETWORKS (1)
DISCRIMINATIVE TRAINING (1)
GPU (1)
MAXOUT NETWORKS (1)
MLP (1)
OPEN SOURCE (1)
RASR (1)
SEQUENCE TRAINING (1)
STATE TYING (1)
STOCHASTIC TRAINING (1)
więcej

INFONA - portal komunikacji naukowej

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) $("#expandableTitles").expandable();

Reduction of acoustic model training time and required data passes via stochastic approaches to maximum likelihood and discriminative training

RASR/NN: The RWTH neural network toolkit for speech recognition

Improving deep neural network acoustic models using generalized maxout networks

Asynchronous stochastic optimization for sequence training of deep neural networks

Joint training of convolutional and non-convolutional neural networks

Context dependent state tying for speech recognition using deep neural network acoustic models

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Słowa kluczowe

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)