Search results for: Wei Qiang

Items from 21 to 40 out of 89 results

chapter

The THUEE system for the openKWS14 keyword search evaluation

Meng Cai, Zhiqiang Lv, Beili Song, Yongzhe Shi, more

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4734 - 4738

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The OpenKWS14 keyword search evaluation is one of the most challenging and influential evaluations in the field of speech recognition. Its goal is to build a high-performance keyword search system for a minority language with limited training data in a short period of time. We present the system of the Department of Electronic Engineering, Tsinghua University (THUEE team) for the OpenKWS14 keyword...

chapter

Neuron sparseness versus connection sparseness in deep neural network for large vocabulary speech recognition

Jian Kang, Cheng Lu, Meng Cai, Wei-Qiang Zhang, more

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4954 - 4958

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Exploiting sparseness in deep neural networks is an important method for reducing the computational cost. In this paper, we study neuron sparseness in deep neural networks for acoustic modeling. For the feed-forward stage, we only activate neurons whose input values are larger than a given threshold, and set the outputs of inactive nodes to zero. Thus, only a few nonzero outputs are fed to the next...

chapter

A generator control strategy for reducing the load shedding amount after the UHVDC commutation failure

Yi Xiaoyu, Wang Yuanyuan, Guo Weimin, Wei Qiang, more

2014 International Conference on Power System Technology > 1001 - 1007

2014 International Conference on Power System Technology (POWERCON)

In the power grid of "strong DC and weak AC", the commutation failure of the ultra-high voltage (UHV) line will result in angle swing between two regional power grids. Our analysis show that, unlike the scenario in the single machine infinite bus system, the speed of angle swing between two regional power grids after the fault is slow since power vacant is not very large (compared with the...

chapter

Phonotactic language recognition based on DNN-HMM acoustic model

Wei-Wei Liu, Meng Cai, Hua Yuan, Xiao-Bei Shi, more

The 9th International Symposium on Chinese Spoken Language Processing > 153 - 157

2014 9th International Symposium on Chinese Spoken Language Processing (ISCSLP)

A recently introduced deep neural network (DNN) has achieved some unprecedented gains in many challenging automatic speech recognition (ASR) tasks. In this paper deep neural network hidden Markov model (DNN-HMM) acoustic models is introduced to phonotactic language recognition and outperforms artificial neural network hidden Markov model (ANN-HMM) and Gaussian mixture model hidden Markov model (GMM-HMM)...

chapter

Discriminative boosting regression backend for phonotactic language recognition

Wei-Wei Liu, Wei-Qiang Zhang, Jia Liu

The 9th International Symposium on Chinese Spoken Language Processing > 148 - 152

2014 9th International Symposium on Chinese Spoken Language Processing (ISCSLP)

In spoken language recognition (SLR), discriminatively trained models always outperform non-discriminative models but computationally expensive and complex to implement. In this paper, we explore a novel approach to discriminative vector space model (VSM) training by using a boosting regression framework, in which an ensemble of VSMs is trained sequentially. The effectiveness of our boosting variation...

chapter

Multi-scale kernels for short utterance speaker recognition

Wei-Qiang Zhang, Junhong Zhao, Wen-Lin Zhang, Jia Liu

The 9th International Symposium on Chinese Spoken Language Processing > 414 - 417

2014 9th International Symposium on Chinese Spoken Language Processing (ISCSLP)

Short utterance is a great challenge for speaker recognition, for there is very limited data can be used for training and testing. To give a robust estimation, the amount of model parameters for the short utterance should be less than that for the long utterance; however, this may impede the models descriptive capability. In this paper, we propose a multi-scale kernel (MSK) approach to solve this...

chapter

A new fast and memory effective i-vector extraction based on factor analysis of KLD derived GMM supervector

Zhi-Yi Li, Wei-Qiang Zhang, Yao Tian, Jia Liu

The 9th International Symposium on Chinese Spoken Language Processing > 163 - 167

2014 9th International Symposium on Chinese Spoken Language Processing (ISCSLP)

At present, i-vector model has become the state-of-the-art technology for speaker recognition. It represents speech utterance to a low-dimensional fix-length compact i-vector. For some real application, i-vector extraction procedure is relatively slow and requires too much memories. Some numerical approximation based fast extraction methods have been proposed to speed up the computation and to save...

chapter

Speaker verification using Fisher vector

Yao Tian, Liang He, Zhi-yi Li, Wei-lan Wu, more

The 9th International Symposium on Chinese Spoken Language Processing > 419 - 422

2014 9th International Symposium on Chinese Spoken Language Processing (ISCSLP)

This paper introduces an approach based on Fisher vector feature representation for speaker verification. The Fisher vector is originated from Fisher Kernel and represents each utterance as a high-dimensional vector by encoding the derivatives of the loglikelihood of the UBM model with respect to it's mean and variances. This representation captures the average first and second order differences between...

chapter

Improved phonotactic language recognition based on RNN feature reconstruction

Wei-Wei Liu, Wei-Qiang Zhang, Yongzhe Shi, An Ji, more

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5322 - 5326

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Nowadays phone recognition followed by support vector machine (PR-SVM) has been proposed in language recognition tasks and shown encouraging results. However, it still suffers from the problems such as the curse of dimensionality led by the increasing order of the N-gram feature supervector, the fast increasing number of possible parameters because of fast exact match of the phoneme history, etc....

chapter

Variance regularization of RNNLM for speech recognition

Yongzhe Shi, Wei-Qiang Zhang, Meng Cai, Jia Liu

2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4893 - 4897

ICASSP 2014 - 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Recurrent neural network language models (RNNLMs) have been proved superior to many other competitive language modeling techniques in terms of perplexity and word error rate. The remaining problem is the great computational complexity of RNNLMs in the output layer, resulting in long time for evaluation. Typically, a class-based RNNLM with the output layer factorized was proposed for speedup, which...

chapter

Compact acoustic modeling based on acoustic manifold using a mixture of factor analyzers

A. Wen-Lin Zhang, C. Bi-Cheng Li, B. Wei-Qiang Zhang

2013 IEEE Workshop on Automatic Speech Recognition and Understanding > 37 - 42

2013 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU)

A compact acoustic model for speech recognition is proposed based on nonlinear manifold modeling of the acoustic feature space. Acoustic features of the speech signal is assumed to form a low-dimensional manifold, which is modeled by a mixture of factor analyzers. Each factor analyzer describes a local area of the manifold using a low-dimensional linear model. For an HMM-based speech recognition system,...

chapter

LDPC coded MSK-BOC modulation for satellite navigation system

Xue Rui, Xu Xichao, Wei Qiang, Xing Daiyu

2013 IEEE 11th International Conference on Electronic Measurement & Instruments > 2 > 771 - 775

2013 IEEE 11th International Conference on Electronic Measurement & Instruments (ICEMI)

Signal structure is one of the decisive factors of the inherent performance of satellite navigation system, meanwhile it is one of the critical technologies which must be resolved during system design and upgrading process. In order to improve code tracking precision and have the better bit error rate (BER) ability at the same time, we combine low-density party-check (LDPC) codes and minimum shift...

chapter

THUEE system for the Albayzin 2012 language recognition evaluation

Weiwei Liu, Wei-Qiang Zhang, Liang He, Jiaming Xu, more

2013 IEEE China Summit and International Conference on Signal and Information Processing > 109 - 112

2013 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP)

Albayzin 2012 language recognition evaluation (LRE) is one of the most challenging language recognition evaluation, which is mainly reflected in: (1) the target languages are more confusable with other languages, which might push down the system performance; (2) developing and test data is heterogeneous regarding duration, number of speakers, ambient noise/music, channel conditions, etc. (3) signals...

chapter

From query log to competitive advertising: A business intelligence method for elaborating consideration set of keywords

Wei Ying, Wei Qiang, Zhang Jin

2013 International Conference on Management Science and Engineering 20th Annual Conference Proceedings > 179 - 185

2013 International Conference on Management Science and Engineering (ICMSE)

With the rapid development of keyword advertising on search engine platforms, competitive advertising becomes a novel strategy for advertisers to gain more potential market share. Though keyword suggestion methods can help match the keywords chosen by the advertisers and the queries in search engine, mainstream keyword suggestion methods suggest keywords by directly extending seed keywords and cannot...

chapter

Improving deep neural network acoustic models using unlabeled data

Meng Cai, Wei-Qiang Zhang, Jia Liu

2013 IEEE China Summit and International Conference on Signal and Information Processing > 137 - 141

2013 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP)

The Context-Dependent Deep-Neural-Network HMM, or CD-DNN-HMM, is a powerful acoustic modeling technique. Its training process typically involves unsupervised pre-training and supervised fine-tuning. In the paper, we demonstrate that the performance of DNNs can be improved by utilizing a large amount of unlabeled data in the training procedure. In our method, CD-DNN-HMM trained using 309 hours of unlabeled...

chapter

The research about video surveillance platform based on cloud computing

Xiang Chen, Jie-Bin Xu, Wei-Qiang Guo

2013 International Conference on Machine Learning and Cybernetics > 2 > 979 - 983

2013 International Conference on Machine Learning and Cybernetics (ICMLC)

With the rapid development of Smart City, video surveillance platform is a critical part of it now bearing a substantial pressure. The current video surveillance platform appear to have several bottlenecks: overloaded streaming media server, weak tolerant ability and weak in expansion. After an investigation on the architecture of the current stand-alone streaming media server, this paper proposes...

chapter

Temporal kernel neural network language model

YongZhe Shi, Wei-Qiang Zhang, Meng Cai, Jia Liu

2013 IEEE International Conference on Acoustics, Speech and Signal Processing > 8247 - 8251

ICASSP 2013 - 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Using neural networks to estimate the probabilities of word sequences has shown significant promise for statistical language modeling. Typical modeling methods include multi-layer neural networks, log-bilinear networks and recurrent neural networks, etc. In this paper, we propose the temporal kernel neural network language model, a variant of models mentioned above. This model explicitly captures...

chapter

Automatic pitch accent detection using auto-context with acoustic features

Junhong Zhao, Wei-Qiang Zhang, Hua Yuan, Jia Liu, more

2012 8th International Symposium on Chinese Spoken Language Processing > 247 - 251

2012 8th International Symposium on Chinese Spoken Language Processing (ISCSLP 2012)

In prosody event detection field, many local acoustic features have been proposed for representing the prosody characteristics of speech unit. The context information that represents some possible regularities underlying neighboring prosody events, however, hasn't been used effectively. The main difficulty to utilize prosodic context is that it's hard to capture the long-distance sequential dependency...

chapter

Image Denoising Based on Wave Atoms and Cycle Spinning

Zhang Wei-Qiang, Song Yi-Mei, Feng Ji-Qiang

2012 Eighth International Conference on Computational Intelligence and Security > 310 - 313

2012 Eighth International Conference on Computational Intelligence and Security (CIS)

A new method for image denoising was presented", "which colligated the strong point of wave atoms transform and Cycle Spinning. Due to lack of translation invariance of wave atoms transform", "image denoising by coefficient thresholding would lead to Pseudo-Gibbs phenomena. Cycle Spinning was employed to avoid the artifacts. Experimental results show that the method can remove...

chapter

GPU accelerated GMM supervectors for speaker and language recognition

Wang Fuqiu, Wei-Qiang Zhang, Liu Jia

2012 IEEE 11th International Conference on Signal Processing > 1 > 536 - 539

2012 11th International Conference on Signal Processing (ICSP 2012)

Computing supervectors from many sliced utterance feature vectors as the inputs to support vector machine is used in many state-of-art systems for speaker and language recognition. This feature recombined method can achieve very well recognition results, but is also very time-consuming. By analyzing the supervectors computation procedure, we found great data-parallel potential. We can use vector/matrix...

Publication type:
book

Publication date

Set your own date range

INFONA - science communication portal

Search results for: Wei Qiang

The THUEE system for the openKWS14 keyword search evaluation

Neuron sparseness versus connection sparseness in deep neural network for large vocabulary speech recognition

A generator control strategy for reducing the load shedding amount after the UHVDC commutation failure

Phonotactic language recognition based on DNN-HMM acoustic model

Discriminative boosting regression backend for phonotactic language recognition

Multi-scale kernels for short utterance speaker recognition

A new fast and memory effective i-vector extraction based on factor analysis of KLD derived GMM supervector

Speaker verification using Fisher vector

Improved phonotactic language recognition based on RNN feature reconstruction

Variance regularization of RNNLM for speech recognition

Compact acoustic modeling based on acoustic manifold using a mixture of factor analyzers

LDPC coded MSK-BOC modulation for satellite navigation system

THUEE system for the Albayzin 2012 language recognition evaluation

From query log to competitive advertising: A business intelligence method for elaborating consideration set of keywords

Improving deep neural network acoustic models using unlabeled data

The research about video surveillance platform based on cloud computing

Temporal kernel neural network language model

Automatic pitch accent detection using auto-context with acoustic features

Image Denoising Based on Wave Atoms and Cycle Spinning

GPU accelerated GMM supervectors for speaker and language recognition

Filter options

Publication date

Content availability

Keywords

Data set

INFONA - science communication portal

Search results for: Wei Qiang

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Data set

Reporting an error / abuse

Sending the report failed

Accessibility options