Search results for: Takaaki Hori

Items from 1 to 5 out of 5 results

chapter

BLSTM-HMM hybrid system combined with sound activity detection network for polyphonic Sound Event Detection

Tomoki Hayashi, Shinji Watanabe, Tomoki Toda, Takaaki Hori, more

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 766 - 770

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper presents a new hybrid approach for polyphonic Sound Event Detection (SED) which incorporates a temporal structure modeling technique based on a hidden Markov model (HMM) with a frame-by-frame detection method based on a bidirectional long short-term memory (BLSTM) recurrent neural network (RNN). The proposed BLSTM-HMM hybrid system makes it possible to model sound event-dependent temporal...

chapter

Dialog state tracking with attention-based sequence-to-sequence learning

Takaaki Hori, Hai Wang, Chiori Hori, Shinji Watanabe, more

2016 IEEE Spoken Language Technology Workshop (SLT) > 552 - 558

2016 IEEE Spoken Language Technology Workshop (SLT)

We present an advanced dialog state tracking system designed for the 5th Dialog State Tracking Challenge (DSTC5). The main task of DSTC5 is to track the dialog state in a human-human dialog. For each utterance, the tracker emits a frame of slot-value pairs considering the full history of the dialog up to the current turn. Our system includes an encoder-decoder architecture with an attention mechanism...

chapter

ASR error detection and recognition rate estimation using deep bidirectional recurrent neural networks

Atsunori Ogawa, Takaaki Hori

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4370 - 4374

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Recurrent neural networks (RNNs) have recently been applied as the classifiers for sequential labeling problems. In this paper, deep bidirectional RNNs (DBRNNs) are applied for the first time to error detection in automatic speech recognition (ASR), which is a sequential labeling problem. We investigate three types of ASR error detection tasks, i.e. confidence estimation, out-of-vocabulary word detection...

chapter

Context adaptive deep neural networks for fast acoustic model adaptation

Marc Delcroix, Keisuke Kinoshita, Takaaki Hori, Tomohiro Nakatani

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4535 - 4539

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Deep neural networks (DNNs) are widely used for acoustic modeling in automatic speech recognition (ASR), since they greatly outperform legacy Gaussian mixture model-based systems. However, the levels of performance achieved by current DNN-based systems remain far too low in many tasks, e.g. when the training and testing acoustic contexts differ due to ambient noise, reverberation or speaker variability...

chapter

Spoken document retrieval by discriminative modeling in a high dimensional feature space

Takanobu Oba, Takaaki Hori, Atsushi Nakamura, Akinori Ito

2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5153 - 5156

ICASSP 2012 - 2012 IEEE International Conference on Acoustics, Speech and Signal Processing

This paper proposes discriminative modeling in a high dimensional feature space for spoken document retrieval (SDR). To estimate the parameters of a high dimensional model properly, a large quantity of data is necessary, but there is no such large corpus for document retrieval. This paper employs two approaches to overcome this problem. One is a reranking approach. A baseline system first gives each...

Filter options

Keywords:
TRAINING DATA

Publication date

Set your own date range

Keywords

TRAINING (4)
AUTOMATIC SPEECH RECOGNITION (2)
HIDDEN MARKOV MODELS (2)
RECURRENT NEURAL NETWORKS (2)
SPEECH RECOGNITION (2)
ACCURACY (1)
ACOUSTIC MODEL ADAPTATION (1)
ACOUSTICS (1)
ADAPTATION MODELS (1)
ATTENTION MODEL (1)
BLSTMHMM (1)
COMPUTATIONAL MODELING (1)
CONTEXT (1)
CONTEXT ADAPTIVE DNN (1)
DATABASES (1)
DEEP BIDIRECTIONAL RECURRENT NEURAL NETWORKS (1)
DEEP NEURAL NETWORKS (1)
DIALOG STATE TRACKING (1)
DISCRIMINATIVE MODEL (1)
ENCODER-DECODER (1)
ERROR DETECTION (1)
ESTIMATION (1)
EVENT DETECTION (1)
FACTORIZED DNN (1)
FEATURE EXTRACTION (1)
GENERALIZATION ABILITY (1)
HISTORY (1)
HYBRID SYSTEM (1)
INDEXES (1)
LABELING (1)
LINEAR MODEL (1)
LONG SHORT-TERM MEMORY (1)
NEURAL NETWORKS (1)
ONTOLOGIES (1)
PATTERN MATCHING (1)
POLYPHONIC SOUND EVENT DETECTION (1)
RECOGNITION RATE ESTIMATION (1)
SEMANTICS (1)
SEQUENCE-TO-SEQUENCE LEARNING (1)
SOUND ACTIVITY DETECTION (1)
SPEECH (1)
SPOKEN DOCUMENT RETRIEVAL (1)
TUNING (1)
VECTORS (1)
VITERBI ALGORITHM (1)
more

INFONA - science communication portal

Search results for: Takaaki Hori

BLSTM-HMM hybrid system combined with sound activity detection network for polyphonic Sound Event Detection

Dialog state tracking with attention-based sequence-to-sequence learning

ASR error detection and recognition rate estimation using deep bidirectional recurrent neural networks

Context adaptive deep neural networks for fast acoustic model adaptation

Spoken document retrieval by discriminative modeling in a high dimensional feature space

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options