Search results for: Xiangang Li

Items from 1 to 14 out of 14 results

article

A comparative study on selecting acoustic modeling units in deep neural networks based large vocabulary Chinese speech recognition

Xiangang Li, Yuning Yang, Zaihu Pang, Xihong Wu

Neurocomputing > 2015 > 170 > C > 251-256

This paper compared the performance of different acoustic modeling units in deep neural networks (DNNs) based large vocabulary continuous speech recognition (LVCSR) systems for Chinese. Recently, the deep neural networks based acoustic modeling method has achieved very competitive performance for many speech recognition tasks, and has become the focus of current LVCSR research. Some previous work...

chapter

Integrating prosodic information into recurrent neural network language model for speech recognition

Tong Fu, Yang Han, Xiangang Li, Yi Liu, more

2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 1194 - 1197

2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Prosody is a kind of cues that are critical to human speech perception and comprehension, so it is plausible to integrate prosodic information into machine speech recognition. However, as a result of the supra-segmental nature, it is hard to integrate prosodic information with conventional acoustic features. Recently, RNNLMs have shown to be the state-of-the-art language model in many tasks. We thus...

chapter

Chinese syllable-to-character conversion with recurrent neural network based supervised sequence labelling

Yi Liu, Jing Hua, Xiangang Li, Tong Fu, more

2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 350 - 353

2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Chinese Syllable-to-Character (S2C) conversion is the important component for Input Methods, and the key problem in Chinese S2C conversion is the serious phenomenon in Chinese language. In order to disambiguate homophones to improve Chinese S2C conversion, in this paper, Chinese S2C conversion is treated as a sequence labelling task, and the recurrent neural network (RNN) based on supervise sequence...

article

Removal of low-concentration benzene in indoor air with plasma-MnO2 catalysis system

Hui Ge, Dongxue Hu, Xiangang Li, Ying Tian, more

Journal of Electrostatics > 2015 > 76 > C > 216-221

Non-thermal plasma (NTP) and combined plasma-MnO₂ catalytic (CPMC) air cleaners were tested for removal of low-concentration benzene in air. Both air cleaners were made of stainless steel needle matrix plate and used DC corona discharger. The effects of discharge power and relative humidity (RH) on benzene removal efficiency were investigated in a closed chamber. The intermediate products produced...

chapter

Constructing long short-term memory based deep recurrent neural networks for large vocabulary speech recognition

Xiangang Li, Xihong Wu

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4520 - 4524

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Long short-term memory (LSTM) based acoustic modeling methods have recently been shown to give state-of-the-art performance on some speech recognition tasks. To achieve a further performance improvement, in this research, deep extensions on LSTM are investigated considering that deep hierarchical model has turned out to be more efficient than a shallow one. Motivated by previous research on constructing...

chapter

Improving long short-term memory networks using maxout units for large vocabulary speech recognition

Xiangang Li, Xihong Wu

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4600 - 4604

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

s have been shown to give state-of-the-art performance on many speech recognition tasks. To achieve a further performance improvement, in this paper, maxout units are proposed to be integrated with the LSTM cells, considering those units have brought significant improvements to deep feed-forward neural networks. A novel architecture was constructed by replacing the input activation units (generally...

chapter

Decision tree based state tying for speech recognition using DNN derived embeddings

Xiangang Li, Xihong Wu

The 9th International Symposium on Chinese Spoken Language Processing > 123 - 127

2014 9th International Symposium on Chinese Spoken Language Processing (ISCSLP)

Recently, context dependent (CD)-deep neural network (DNN)-hidden Markov model (HMM) obtains significant improvements in many automatic speech recognition (ASR) tasks. In the standard training procedure for CD-DNN-HMM, the Gaussian mixture models (GMM) based ASR system has to be firstly built to pre-segment the training data and to define the CD states as the targets for DNN. In this paper, we propose...

chapter

Error-driven pronunciation dictionary construction for Mandarin speech recognition

Yi Liu, Xiangang Li, Xihong Wu

The 9th International Symposium on Chinese Spoken Language Processing > 88 - 92

2014 9th International Symposium on Chinese Spoken Language Processing (ISCSLP)

Aiming at constructing the pronunciation dictionary for Mandarin speech recognition, an automatic error-driven and incremental approach is proposed based on the acoustic confusion network. This method considers both of the acoustic and language information, constructs a dictionary through words selection and composition to optimal the performance of ASR directly. During the process, removing and splitting...

chapter

Labeling unsegmented sequence data with DNN-HMM and its application for speech recognition

Xiangang Li, Xihong Wu

The 9th International Symposium on Chinese Spoken Language Processing > 10 - 14

2014 9th International Symposium on Chinese Spoken Language Processing (ISCSLP)

Recently, deep neural network (DNN) with hidden Markov model (HMM) has turned out to be a superior sequence learning framework, based on which significant improvements were achieved in many application tasks, such as automatic speech recognition (ASR). However, the training of DNN-HMM requires the pre-segmented training data, which can be generated using Gaussian Mixture Model (GMM) in ASR tasks....

chapter

Recurrent neural network language model with part-of-speech for Mandarin speech recognition

Caixia Gong, Xiangang Li, Xihong Wu

The 9th International Symposium on Chinese Spoken Language Processing > 459 - 463

2014 9th International Symposium on Chinese Spoken Language Processing (ISCSLP)

Recurrent neural network language models (RNNLMs) have been successfully applied in a variety of language processing applications ranging from speech recognition to machine translation. They can fight the curse of dimensionality by learning a distributed representation (word vector). The components of these vectors measure the co-occurrence of the word with context features over a corpus. However,...

chapter

Query-based composition for large-scale language model in LVCSR

Yang Han, Chenwei Zhang, Xiangang Li, Yi Liu, more

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4898 - 4902

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

This paper describes a query-based composition algorithm that can integrate an ARPA format language model in the unified WFST framework, which avoids the memory and time cost of converting the language models to WFST and optimizing the WFST of language models. The proposed algorithm is applied to on-the-fly one-pass decoder and rescoring decoder. Both modified decoder require less memory during decoding...

chapter

Error feedback based lexical entity extraction for Chinese language modeling

Yi Liu, Jing Hua, Xiangang Li, Xihong Wu

2013 6th International Congress on Image and Signal Processing (CISP) > 3 > 1298 - 1303

2013 6th International Congress on Image and Signal Processing (CISP)

Chinese, which is quite different from western languages, has no standard definition of word. Therefore, choosing suitable lexicon plays an important role in Chinese language modeling. This paper proposes a novel method of constructing the lexicon automatically. Other than depending on statistical measures of text features, this method is directly based on the feedback of errors from the corresponding...

chapter

Deep neural networks for syllable based acoustic modeling in Chinese speech recognition

Xiangang Li, Caifu Hong, Yuning Yang, Xihong Wu

2013 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference > 1 - 4

2013 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Recently, the deep neural networks (DNNs) based acoustic modeling methods have been successfully applied to many speech recognition tasks. This paper reports the work about applying DNNs for syllable based acoustic modeling in Chinese automatic speech recognition (ASR). Compared with initial/finals (IFs), syllable can implicitly model the intra-syllable variations in better accuracy. However, the...

chapter

The effect of part-of-speech on Mandarin speech recognition

Caixia Gong, Xiangang Li, Xihong Wu

2013 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference > 1 - 4

2013 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

This paper concentrates on the effect of part-of-speech on Mandarin speech recognition by incorporating it into language model and pronunciation dictionary. This work is motivated by the two benefits of part-of-speech, one is to reduce the lexical ambiguity in language model to some extent and the other is to provide some information about the pronunciation of heteronyms. The experiments conducted...

Filter options

Publication date

Set your own date range

INFONA - science communication portal

Search results for: Xiangang Li

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Data set

Journal

Reporting an error / abuse

Sending the report failed

Accessibility options