Search results

chapter

Improving accented Mandarin speech recognition by using recurrent neural network based language model adaptation

Hao Ni, Jiangyan Yi, Zhengqi Wen, Bin Liu, more

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP) > 1 - 5

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP)

In this paper, we propose adapt the recurrent neural network (RNN) based language model to improve the performance of multi-accent Mandarin speech recognition. N-gram based language model has already been applied to speech recognition system, but it is hard to describe the long span information in a sentence and arises a serious phenomenon of data sparse. Instead, RNN based language model can overcome...

chapter

Prosodic annotation enriched statistical machine translation

Peidong Guo, Heyan Huang, Ping Jian, Yuhang Guo

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP) > 1 - 5

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP)

More and more linguistic information has been employed to improve the performance of machine translation, such as part of speech, syntactic structures, discourse contexts, and so on. However, conventional approaches typically ignore the key information beyond the text such as prosody. In this paper, we exploit and employ three prosodic features: pronunciation (phonetic alphabet and tone), prosodic...

chapter

Confidence estimation for speech recognition systems using conditional random fields trained with partially annotated data

Sheng Li, Xugang Lu, Shinsuke Mori, Yuya Akita, more

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP) > 1 - 5

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP)

Conditional random fields (CRF) can generate high-quality confidence measure scores (CMS) for speech recognition systems. However, like many other real-world machine learning tasks, there are only limited annotated data for training but always abundant unlabeled data, which requires too much human efforts and expertise to annotate. To address this issue, we use a scheme of CRF training for ASR confidence...

chapter

Exploiting noisy web data by OOV ranking for low-resource keyword search

Zhipeng Chen, Ji Wu

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP) > 1 - 5

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP)

Spoken keyword search in low-resource condition suffers from out-of-vocabulary (OOV) problem and insufficient text data for language model (LM) training. Web-crawled text data is used to expand vocabulary and to augment language model. However, the mismatching between web text and the target speech data brings difficulties to effective utilization. New words from web data need an evaluation to exclude...

chapter

Dictionary update for NMF-based voice conversion using an encoder-decoder network

Chin-Cheng Hsu, Hsin-Te Hwang, Yi-Chiao Wu, Yu Tsao, more

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP) > 1 - 5

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP)

In this paper, we propose a dictionary update method for Non-negative Matrix Factorization (NMF) with high dimensional data in a spectral conversion (SC) task. Voice conversion has been widely studied due to its potential applications such as personalized speech synthesis and speech enhancement. Exemplar-based NMF (ENMF) emerges as an effective and probably the simplest choice among all techniques...

chapter

Applying connectionist temporal classification objective function to Chinese Mandarin speech recognition

Pengrui Wang, Jie Li, Bo Xu

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP) > 1 - 5

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP)

In automatic speech recognition (ASR), connectionist temporal classification (CTC) is regarded as a method to achieve end-to-end system. Actually, not only characters (Chars) but also context independent phonemes (CI-Phns) or context dependent phoneme (CD-Phns) can be used as output units of CTC-trained neural network. The contribution of this paper mainly lies in three aspects: First, we trained...

chapter

N-best list re-ranking using semantic relatedness and syntactic score: An approach for improving speech recognition accuracy in air traffic control

Van Nhan Nguyen, Harald Holone

2016 16th International Conference on Control, Automation and Systems (ICCAS) > 1315 - 1319

2016 16th International Conference on Control, Automation and Systems (ICCAS)

In this paper, we investigate how we can take advantage of the availability of linguistic knowledge, particularly semantic knowledge, in Air Traffic Control (ATC) to reduce the Word Error Rate (WER) of Automatic Speech Recognition (ASR) systems. To facilitate this, we integrate semantic knowledge into post-processing by performing n-best list re-ranking. We first propose a feature called semantic...

chapter

Finite State Machine Based Decoding of Handwritten Text Using Recurrent Neural Networks

Cuong Tuan Nguyen, Masaki Nakagawa

2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR) > 246 - 251

2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR)

This paper presents a Finite State Machine (FSM) to reduce user's waiting time to get the recognition result after finishing writing in recognition of online handwritten English text. The lexicon is modeled by a FSM, and then determination and minimization are applied to reduce the number of states. The reduction of states in the FSM shortens the waiting time without degrading the recognition accuracy...

chapter

A Lexicon Verification Strategy in a BLSTM Cascade Framework

Stuner Bruno, Chatelain Clement, Paquet Thierry

2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR) > 234 - 239

2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR)

Handwriting recognition always has been a difficult problem, with image related problems on the one hand and language processing on the other hand. Significant improvements have been made in handwriting recognition thanks to new recurrent neural networks based on LSTM cells. The high character recognition performances of these networks are almost systematically combined with linguistic knowledge,...

chapter

Variable step-size least-symbol-error-rate adaptive decision feedback turbo equalization for underwater channel

Xiaohui Zhong, Fangjiong Cheny, Fei Ji, Hua Yu

OCEANS 2016 MTS/IEEE Monterey > 1 - 4

OCEANS 2016 MTS/IEEE Monterey

In this paper, the turbo principle is applied to the existing least symbols error rate(LSER) decision feedback equalization(DFE) for underwater channel. The performance of the DFE adaptive algorithms are aided by soft information delivered from the channel decoder. We introduce a variable step size scheme that takes soft information into account to get the more suitable step size which can reduce...

chapter

A fully convolutional deep auditory model for musical chord recognition

Filip Korzeniowski, Gerhard Widmer

2016 IEEE 26th International Workshop on Machine Learning for Signal Processing (MLSP) > 1 - 6

2016 IEEE 26th International Workshop on Machine Learning for Signal Processing (MLSP)

Chord recognition systems depend on robust feature extraction pipelines. While these pipelines are traditionally hand-crafted, recent advances in end-to-end machine learning have begun to inspire researchers to explore data-driven methods for such tasks. In this paper, we present a chord recognition system that uses a fully convolutional deep auditory model for feature extraction. The extracted features...

chapter

Adaptive Post Filter for Reducing Block Artifacts in High Efficiency Video Coding

Antoine Chauvet, Tomo Miyazaki, Yoshihiro Sugaya, Shinichiro Omachi

2016 International Conference on Multimedia Systems and Signal Processing (ICMSSP) > 22 - 25

2016 International Conference on Multimedia Systems and Signal Processing (ICMSSP)

This paper describes an adaptive deblocking postfilter based on neural networks for use in H.265 High Efficiency Video Coding (HEVC). Blocking noise is a common problem in video coding caused by the division of the frame into blocks. The filter is adaptive because it uses different filter parameters depending on block characteristics. We use a modified HEVC decoder to export the block information...

chapter

Multiple description vector quantizer design based on redundant representation of central code

Akinori Ito

2016 24th European Signal Processing Conference (EUSIPCO) > 106 - 109

2016 24th European Signal Processing Conference (EUSIPCO)

A design method of a multiple description vector quantizer (VQ) is proposed. VQ is widely used for data compression, transmission and other processing. Here, we assume transmission channels with data erasure such as a packet-based network. Multiple description coding is a coding method used to achieve “graceful degradation” when transmitting signals through lossy channels. The proposed method is inspired...

chapter

Adaptive decoding using local field potentials in a brain-machine interface

Rosa So, Camilo Libedinsky, Kai Keng Ang, Wee Chiek Clement Lim, more

2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC) > 5721 - 5724

2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC)

Brain-machine interface (BMI) systems have the potential to restore function to people who suffer from paralysis due to a spinal cord injury. However, in order to achieve long-term use, BMI systems have to overcome two challenges — signal degeneration over time, and non-stationarity of signals. Effects of loss in spike signals over time can be mitigated by using local field potential (LFP) signals...

chapter

ECoG data analyses to inform closed-loop BCI experiments for speech-based prosthetic applications

Tejaswy Pailla, Werner Jiang, Benjamin Dichter, Edward F. Chang, more

2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC) > 5713 - 5716

2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC)

Brain Computer Interfaces (BCIs) assist individuals with motor disabilities by enabling them to control prosthetic devices with their neural activity. Performance of closed-loop BCI systems can be improved by using design strategies that leverage structured and task-relevant neural activity. We use data from high density electrocorticography (ECoG) grids implanted in three subjects to study sensory-motor...

chapter

Big data challenges in decoding cortical activity in a human with quadriplegia to inform a brain computer interface

David A. Friedenberg, Chad E. Bouton, Nicholas V. Annetta, Nicholas Skomrock, more

2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC) > 3084 - 3087

2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC)

Recent advances in Brain Computer Interfaces (BCIs) have created hope that one day paralyzed patients will be able to regain control of their paralyzed limbs. As part of an ongoing clinical study, we have implanted a 96-electrode Utah array in the motor cortex of a paralyzed human. The array generates almost 3 million data points from the brain every second. This presents several big data challenges...

chapter

Neural decoding of code modulated visual evoked potentials by spatio-temporal inverse filtering for brain computer interfaces

Jun-ichi Sato, Yoshikazu Washizawa

2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC) > 1484 - 1487

2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC)

This study addresses neural decoding of a code modulated visual evoked potentials (c-VEPs). c-VEP was recently developed, and applied to brain computer interfaces (BCIs). c-VEP BCI exhibits faster communication speed than existing VEP-based BCIs. In c-VEP BCI, the canonical correlation analysis (CCA) that maximizes the correlation between an averaged signal and single trial signals is often used for...

chapter

Hybrid autoencoder and classifier for label-deficient semi-supervised learning: Case study

Rastin Rastgoufard, AbdulRahman Alsamman

2016 19th International Conference on Information Fusion (FUSION) > 79 - 83

2016 19th International Conference on Information Fusion (FUSION)

Label-deficient semi-supervised learning is a challenging setting in which there is an abundance of unlabeled data but a dearth of labeled data. A hybrid network that mixes an autoencoder, capable of extracting information from unlabeled data, and a neural network classifier, which incorporates information from labeled data, can be useful in a label-deficient setting. In this case study, we examine...

chapter

A neural words encoding model

Dayiheng Liu, Jiancheng Lv, Xiaofeng Qi, Jiangshu Wei

2016 International Joint Conference on Neural Networks (IJCNN) > 532 - 536

2016 International Joint Conference on Neural Networks (IJCNN)

This paper proposes a neural network model and learning algorithm that can be applied to encode words. The model realizes the function of words encoding and decoding which can be applied to text encryption/decryption and word-based compression. The model is based on Deep Belief Networks (DBNs) and it differs from traditional DBNs in that it is asymmetric structured and the output of it is a binary...

chapter

Sparsely connected autoencoder

Kavya Gupta, Angshul Majumdar

2016 International Joint Conference on Neural Networks (IJCNN) > 1940 - 1947

2016 International Joint Conference on Neural Networks (IJCNN)

This work proposes to learn autoencoders with sparse connections. Prior studies on autoencoders enforced sparsity on the neuronal activity; these are different from our proposed approach - we learn sparse connections. Sparsity in connections helps in learning (and keeping) the important relations while trimming the irrelevant ones. We have tested the performance of our proposed method on two tasks...

INFONA - science communication portal

Search results

Improving accented Mandarin speech recognition by using recurrent neural network based language model adaptation

Prosodic annotation enriched statistical machine translation

Confidence estimation for speech recognition systems using conditional random fields trained with partially annotated data

Exploiting noisy web data by OOV ranking for low-resource keyword search

Dictionary update for NMF-based voice conversion using an encoder-decoder network

Applying connectionist temporal classification objective function to Chinese Mandarin speech recognition

N-best list re-ranking using semantic relatedness and syntactic score: An approach for improving speech recognition accuracy in air traffic control

Finite State Machine Based Decoding of Handwritten Text Using Recurrent Neural Networks

A Lexicon Verification Strategy in a BLSTM Cascade Framework

Variable step-size least-symbol-error-rate adaptive decision feedback turbo equalization for underwater channel

A fully convolutional deep auditory model for musical chord recognition

Adaptive Post Filter for Reducing Block Artifacts in High Efficiency Video Coding

Multiple description vector quantizer design based on redundant representation of central code

Adaptive decoding using local field potentials in a brain-machine interface

ECoG data analyses to inform closed-loop BCI experiments for speech-based prosthetic applications

Big data challenges in decoding cortical activity in a human with quadriplegia to inform a brain computer interface

Neural decoding of code modulated visual evoked potentials by spatio-temporal inverse filtering for brain computer interfaces

Hybrid autoencoder and classifier for label-deficient semi-supervised learning: Case study

A neural words encoding model

Sparsely connected autoencoder

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options