Search results for: Yong Lu

Items from 1 to 7 out of 7 results

chapter

VTS feature compensation based on two-layer GMM structure for robust speech recognition

Lin Zhou, Haijing Li, Ying Chen, Zhenyang Wu, more

2016 8th International Conference on Wireless Communications & Signal Processing (WCSP) > 1 - 5

2016 8th International Conference on Wireless Communications & Signal Processing (WCSP)

In this paper, a two-layer Gaussian Mixed Model (GMM) structure for Vector Taylor Series (VTS) feature compensation is proposed for robust speech recognition. Since GMM with the numerous mixture components is used for VTS, the computation complexity of VTS is extremely huge. To deal with this issue, we propose two-layer GMM structure for VTS. In detail, the GMM with fewer mixture components is utilized...

chapter

Research on a kind of Noisy Tibetan speech recognition algorithm based on WNN

Yong Lu, Haining Huang

2011 Seventh International Conference on Natural Computation > 2 > 605 - 608

2011 Seventh International Conference on Natural Computation (ICNC)

The research on noisy Tibetan speech recognition algorithm based on wavelet neural network (WNN) combined with auditory feature was carried out in this paper. The recognition classifier based on WNN was designed, and Mel Frequency Cepstrum Constant (MFCC) feature was given. Then the simulation on the given algorithm was run under the different signal to noise ratios (SNR), and the results illustrated...

chapter

Active Learning and Semi-Supervised Learning in Tibetan Language Speech Recognition

Xiuqin Pan, Yongcun Cao, Yong Lu

2010 International Conference on Artificial Intelligence and Computational Intelligence > 1 > 369 - 372

2010 International Conference on Artificial Intelligence and Computational Intelligence (AICI 2010)

A key challenge in rapidly building Tibetan language speech recognition applications is minimizing the manual effort required in transcribing and labeling speech data. Accurate labeling of Tibetan speech utterances is extremely time consuming and requires trained linguists. For alleviate this problem, we present an approach that aims at reducing the amount of manually transcribed speech data required...

chapter

Tibetan Language Speech Recognition Model Based on Active Learning and Semi-Supervised Learning

Xiuqin Pan, Yongcun Cao, Yong Lu, Yue Zhao

2010 10th IEEE International Conference on Computer and Information Technology > 1225 - 1228

2010 IEEE 10th International Conference on Computer and Information Technology (CIT)

In the researches on Tibetan language speech recognition, accurate labeling of Tibetan speech utterances is extremely time consuming and requires trained linguists. For alleviate this problem, we present an approach that can use few labeled Tibetan speech utterances to construct the effective recognition model. The experimental results show that our approach has better performance than traditional...

article

Robust speech recognition using improved vector taylor series algorithm for embedded systems

Yong Lu, Haiyang Wu, Zhenyang Wu

IEEE Transactions on Consumer Electronics > 2010 > 56 > 2 > 764 - 769

This paper proposes a novel robust speech recognition technique using improved vector Taylor series (VTS) algorithm for embedded systems. It uses a hidden Markov model (HMM) to replace the Gaussian mixture model (GMM) for estimating the clean speech feature, and gives the closed-form solutions of the noise parameters including the mean and variance at each expectation-maximization (EM) iteration....

chapter

Maximum likelihood model adaptation using piecewise linear transformation for robust speech recognition

Yong Lu, Zhenyang Wu

2009 IEEE 13th International Symposium on Consumer Electronics > 608 - 610

2009 IEEE 13th International Symposium on Consumer Electronics (ISCE)

This paper presents a new model adaptation algorithm using piecewise linear transformation (PLT) for robust speech recognition. In this algorithm, the nonlinear relationship between training and testing mean vectors is approximated by a set of piecewise linear transformations. The PLT coefficients are estimated from adaptation data by the expectation-maximization (EM) algorithm and maximum likelihood...

chapter

Research on the Algorithm of Noisy-Robust Tibetan Speech Recognition Based on RBF

Xiuqin Pan, Yong Lu, Yongcun Cao, Hong Zhang, more

2008 International Symposium on Intelligent Information Technology Application Workshops > 416 - 419

2008 International Symposium on Intelligent Information Technology Application Workshops

Aiming at the problem of Tibetan speech recognition under the condition of resistance from noise, a kind of Tibetan speech recognition algorithm, combining RBF network with auditory feature was presented in this paper. The description for the Tibetan speech signals was carried out with Mel frequency cepstrum constant (MFCC), and the recognition classifier was designed based on RBF network with the...

Filter options

Keywords:
SPEECH

Publication date

Set your own date range

Publication type

book (6)
article (1)

Keywords

HIDDEN MARKOV MODELS (3)
LEARNING (ARTIFICIAL INTELLIGENCE) (3)
SUPERVISED LEARNING (3)
ACCURACY (2)
ACTIVE LEARNING (2)
ADAPTATION MODEL (2)
CLASSIFICATION ALGORITHMS (2)
COMPUTATIONAL MODELING (2)
ESTIMATION (2)
FEATURE EXTRACTION (2)
LABELING (2)
MATHEMATICAL MODEL (2)
NOISE (2)
SEMI-SUPERVISED LEARNING (2)
SEMISUPERVISED LEARNING (2)
SIGNAL TO NOISE RATIO (2)
TIBETAN LANGUAGE SPEECH RECOGNITION (2)
ARTIFICIAL NEURAL NETWORKS (1)
AUDITORY FEATURE (1)
ENTROPY (1)
EQUATIONS (1)
EXPECTATION-MAXIMISATION ALGORITHM (1)
EXPECTATION-MAXIMIZATION ALGORITHM (1)
FEATURE COMPENSATION (1)
GMM MODEL (1)
GRADIENT DESCENT METHODS (1)
GRADIENT METHODS (1)
MANUALLY TRANSCRIBED SPEECH DATA (1)
MAXIMUM LIKELIHOOD ESTIMATION (1)
MAXIMUM LIKELIHOOD MODEL ADAPTATION ALGORITHM (1)
MEL FREQUENCY CEPSTRAL COEFFICIENT (1)
MEL FREQUENCY CEPSTRUM CONSTANT (1)
MODEL ADAPTATION (1)
NATURAL LANGUAGE PROCESSING (1)
NEURAL NETWORK (1)
NOISE MEASUREMENT (1)
NOISY-ROBUST (1)
NOISY-ROBUST TIBETAN SPEECH RECOGNITION (1)
PARAMETER ESTIMATION (1)
PIECEWISE LINEAR TECHNIQUES (1)
PIECEWISE LINEAR TRANSFORMATION (1)
PLT COEFFICIENT ESTIMATION (1)
RADIAL BASIS FUNCTION NETWORKS (1)
RBF (1)
RBF NETWORK (1)
RECOGNITION (1)
RECOGNITION CLASSIFIER (1)
ROBUST SPEECH RECOGNITION (1)
ROBUST SPEECH RECOGNITION, VECTOR TAYLOR SERIES, FEATURE COMPENSATION, HIDDEN MARKOV MODEL (1)
SIGNAL PROCESSING ALGORITHMS (1)
TIBETAN LANGUAGE SPEECH RECOGNITION MODEL (1)
TIBETAN SPEECH PROCESSING (1)
TIBETAN SPEECH UTTERANCES (1)
VECTOR TAYLOR SERIES (1)
WAVELET (1)
more

INFONA - science communication portal

Search results for: Yong Lu

VTS feature compensation based on two-layer GMM structure for robust speech recognition

Research on a kind of Noisy Tibetan speech recognition algorithm based on WNN

Active Learning and Semi-Supervised Learning in Tibetan Language Speech Recognition

Tibetan Language Speech Recognition Model Based on Active Learning and Semi-Supervised Learning

Robust speech recognition using improved vector taylor series algorithm for embedded systems

Maximum likelihood model adaptation using piecewise linear transformation for robust speech recognition

Research on the Algorithm of Noisy-Robust Tibetan Speech Recognition Based on RBF

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options