Search results for: Jia Liu

Items from 1 to 18 out of 18 results

chapter

Ivec-PLDA-AHC priors for VB-HMM speaker diarization system

Liang He, Xianhong Chen, Can Xu, Tianyu Liang, more

2017 IEEE International Workshop on Signal Processing Systems (SiPS) > 1 - 6

2017 IEEE International Workshop on Signal Processing Systems (SiPS)

This paper proposes a hybrid speaker diarization system. The main body is a variational Bayes — hidden Markov model (VB-HMM) speaker diarization system. The VB-HMM speaker diarization system avoids making premature hard decision and takes advantages of soft speaker information in an iterative way. Thus, it outperforms most of mainstream speaker diarization systems. Unfortunately, this system is sensitive...

chapter

Lattice based transcription loss for end-to-end speech recognition

Jian Kang, Wei-Qiang Zhang, Jia Liu

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP) > 1 - 5

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP)

End-to-end speech recognition systems have been successfully implemented and have become competitive replacements for hybrid systems. A common loss function to train end-to-end systems is connectionist temporal classification (CTC). This method maximizes the log likelihood between the feature sequence and the associated transcription sequence. However there are some weaknesses with CTC training. The...

chapter

Gated recurrent units based hybrid acoustic models for robust speech recognition

Jian Kang, Wei-Qiang Zhang, Jia Liu

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP) > 1 - 5

2016 10th International Symposium on Chinese Spoken Language Processing (ISCSLP)

Recurrent neural networks (RNNs) have shown an ability to model temporal dependencies. However the problem of exploding or vanishing gradients has limited their application. In recent years, long short-term memory RNNs (LSTM RNNs) have been proposed to solve this problem, and have achieved excellent results. However, because of the large size of LSTM RNNs, they more easily suffer from overfitting,...

chapter

Neuron sparseness versus connection sparseness in deep neural network for large vocabulary speech recognition

Jian Kang, Cheng Lu, Meng Cai, Wei-Qiang Zhang, more

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4954 - 4958

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Exploiting sparseness in deep neural networks is an important method for reducing the computational cost. In this paper, we study neuron sparseness in deep neural networks for acoustic modeling. For the feed-forward stage, we only activate neurons whose input values are larger than a given threshold, and set the outputs of inactive nodes to zero. Thus, only a few nonzero outputs are fed to the next...

chapter

Optimization of the Routes to Realize Non-Fossil Fuel Development Goal in China Based on Linear Dynamic Programming Model in Multi-constraints

Jia Liu, Xuying Qin

2013 Sixth International Conference on Business Intelligence and Financial Engineering > 591 - 595

2013 Sixth International Conference on Business Intelligence and Financial Engineering (BIFE)

A linear dynamic programming model was developed to study the optimization of the routes to realize non-fossil fuel development goal in China. To deal with the flip-flop phenomenon when applying linear programming to describe and solve above managerial and engineering optimization problem, a model for optimizing the routes of non-fossil fuel developing (MORN model) was applied in multi-constraints...

chapter

THUEE system for the Albayzin 2012 language recognition evaluation

Weiwei Liu, Wei-Qiang Zhang, Liang He, Jiaming Xu, more

2013 IEEE China Summit and International Conference on Signal and Information Processing > 109 - 112

2013 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP)

Albayzin 2012 language recognition evaluation (LRE) is one of the most challenging language recognition evaluation, which is mainly reflected in: (1) the target languages are more confusable with other languages, which might push down the system performance; (2) developing and test data is heterogeneous regarding duration, number of speakers, ambient noise/music, channel conditions, etc. (3) signals...

chapter

Improve low-resource non-native mispronunciation detection with native speech by articulatory-based tandem feature

Hua Yuan, Ji Xu, Junhong Zhao, Jia Liu

2013 IEEE China Summit and International Conference on Signal and Information Processing > 127 - 131

2013 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP)

In this paper, we propose a method to improve detecting the mispronunciation type of the non-native learners. In order to cope with the low-resource condition of non-native speech and the difference of native and non-native speech, the following efforts are made: 1) train acoustic model with the low-resource non-native data; 2) introduce the articulatory-based tandem feature; 3) pool auxiliary native...

chapter

Improving deep neural network acoustic models using unlabeled data

Meng Cai, Wei-Qiang Zhang, Jia Liu

2013 IEEE China Summit and International Conference on Signal and Information Processing > 137 - 141

2013 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP)

The Context-Dependent Deep-Neural-Network HMM, or CD-DNN-HMM, is a powerful acoustic modeling technique. Its training process typically involves unsupervised pre-training and supervised fine-tuning. In the paper, we demonstrate that the performance of DNNs can be improved by utilizing a large amount of unlabeled data in the training procedure. In our method, CD-DNN-HMM trained using 309 hours of unlabeled...

chapter

Improve mispronunciation detection with Tandem feature

Hua Yuan, Junhong Zhao, Jia Liu

2012 8th International Symposium on Chinese Spoken Language Processing > 184 - 187

2012 8th International Symposium on Chinese Spoken Language Processing (ISCSLP 2012)

This paper presents a method to improve the mispronunciation detection performance for low-resource acoustic model. The 1h speech data is randomly selected from CU-CHLOE to imitate the low-resource non-native English situation. The Tandem feature derived from articulatory based Multi-Layer Perception (MLP) is employed to replace the traditional spectral feature (e.g. PLP). Further, motivated by similar...

chapter

HMM-Based Predictive Power Saving Mechanism in WiMAX

Jia Liu, Chuang Lin, Fengyuan Ren

2010 IEEE/ACIS 9th International Conference on Computer and Information Science > 459 - 464

2010 IEEE/ACIS 9th International Conference on Computer and Information Science (ICIS 2010)

Several studies have showed that network features (e.g., packet interval and packet size) may be well modeled by a hidden Markov model (HMM) with appropriate hidden variables that capture the current state of the network. In this paper, we propose a prediction mechanism on the basis of the HMM model to assist the Power Saving (PS) in WiMAX. In comparison with prior models whose analyses are often...

chapter

Modeling and Analyzing Gap-Utilized Handover in Wireless Networks

Jia Liu, Chuang Lin, Fengyuan Ren

2010 International Conference on Communications and Mobile Computing > 1 > 503 - 507

2010 International Conference on Communications and Mobile Computing (CMC 2010)

Handover interruption as a critical issue has long been studied in wireless networks towards a seamless and lossless target. This paper proposes a model for gap-utilized handover, using traffic-pattern learning based on HMM, and presents the analytic details regarding to the QoS performance. The handover exploiting traffic gaps, i.e., periods of no packet transferred, can reduce packet loss/delay...

chapter

Phone modeling and combining discriminative training for mandarinenglish bilingual speech recognition

Yanmin Qian, Jia Liu

2010 IEEE International Conference on Acoustics, Speech and Signal Processing > 4918 - 4921

2010 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2010

Automatic multilingual speech recognition is always a difficult task. This paper presents recent work on the development of a Mandarin-English bilingual speech recognition system. A unified single set of bilingual acoustic models based on a novel State-Time-Alignment (STA) method is proposed to balance the performance and the complexity of the bilingual speech recognition system, and a comparison...

chapter

Delivery packets reliably and efficiently over error prone channel of WSNs

Bin-bin Xiong, Jia Liu

2009 International Conference on Intelligent Sensors, Sensor Networks and Information Processing (ISSNIP) > 1 - 6

2009 Fifth International Conference on Intelligent Sensors, Sensor Networks and Information Processing (ISSNIP 2009)

In wireless sensor networks, it has been proved that the reliable transmission protocols sending redundant packets to the upstream neighbour hop-by-hop have advantage on energy efficiency compared with those using end-to-end error recovery and control scheme. It provides an opportunity for applications to find a trade off point regarding transmission probability and energy consumption. The problem...

chapter

Universal Steganalysis to Images with WBMC Model

Xiaoyuan Yang, Shifeng Wang, Jia Liu

2009 Fifth International Conference on Information Assurance and Security > 2 > 627 - 630

2009 Fifth International Conference on Information Assurance and Security (IAS)

We propose a Wavelet based Markov Chain (WBMC) model for nature images, which can present statistic divergence between cover image and steg image prominently. Based on Markov chain empirical matrix, we discussed the difference between low frequency domain and high frequency domain generalized by steg process, and then defined two models: WBMC_L model and WBMC_H model respective to construct our WBMC...

chapter

A novel speech recognition system-on-chip

Haijie Yang, Jing Yao, Jia Liu

2008 International Conference on Audio, Language and Image Processing > 764 - 768

2008 International Conference on Audio, Language and Image Processing

This paper introduces a novel isolated word speech recognition system-on-chip (SoC). An Application Specific Integrated Circuit (ASIC) with a unique vector accelerator is designed in the SoC to realize Continuous density Hidden Markov Model (CHMM) recognition algorithm based on the Mel-Frequency Cepstral Coefficients (MFCC) feature. Due to a hardware and software co-design, the cost of the ASIC is...

chapter

A prediction model based on neural network and fuzzy Markov chain

Jia Liu, Shunxiang Li, Shusheng Jia

2008 7th World Congress on Intelligent Control and Automation > 790 - 793

2008 7th World Congress on Intelligent Control and Automation

In order to solve the problem of random and fluctuation of experiment errors and predication errors of neural network, a neural network model modified by a fuzzy Markov chain was introduced, When neural network was used to predict, the prediction errors between actual value and output value of the network were distributed randomly. That can be simulated by a Markov chain. According to the forecasting...

chapter

Speech Emotion Recognition using an Enhanced Co-Training Algorithm

Jia Liu, Chun Chen, Jiajun Bu, Mingyu You, more

Multimedia and Expo, 2007 IEEE International Conference on > 999 - 1002

2007 IEEE International Conference on Multimedia and Expo

In previous systems of speech emotion recognition, supervised learning are frequently employed to train classifiers on lots of labeled examples. However, the labeling of abundant data requires much time and many human efforts. This paper presents an enhanced co-training algorithm to utilize a large amount of unlabeled speech utterances for building a semi-supervised learning system. It uses two conditionally...

chapter

A New Framework For Large Vocabulary Keyword Spotting Using Two-Pass Confidence Measure

Yingna Chen, Tao Hou, Sha Meng, Shan Zhong, more

The Proceedings of the Multiconference on "Computational Engineering in Systems Applications" > 1 > 68 - 71

Multiconference on "Computational Engineering in Systems Applications"

In this paper, a new framework for large vocabulary keyword spotting is proposed, which involves three phases. In the first phase, N-best sub-word lattice is generated by hidden Markov model (HMM). Keyword candidates are hypothesized by dynamic keyword matching during the second phase. In the last phase, two-pass confidence measure, which provides complementary information, is used for keyword verification...

Filter options

Keywords:
HIDDEN MARKOV MODELS

Publication date

Set your own date range

Keywords

SPEECH RECOGNITION (9)
TRAINING (7)
ACOUSTICS (5)
SPEECH (5)
FEATURE EXTRACTION (4)
DATA MODELS (3)
HIDDEN MARKOV MODEL (3)
PREDICTIVE MODELS (3)
ACOUSTIC MODELING (2)
ADAPTATION MODELS (2)
ARTICULATORY FEATURE (2)
ARTIFICIAL NEURAL NETWORKS (2)
DEEP NEURAL NETWORK (2)
DELAY (2)
ENERGY CONSUMPTION (2)
ERROR ANALYSIS (2)
ERROR STATISTICS (2)
HIDDEN MARKOV MODEL (HMM) (2)
HMM (2)
MANGANESE (2)
MARKOV CHAIN (2)
MARKOV PROCESSES (2)
NEURAL NETWORKS (2)
SUPPORT VECTOR MACHINES (2)
TANDEM FEATURE (2)
VOCABULARY (2)
ACCURACY (1)
AGGLOMERATIVE HIERARCHICAL CLUSTERING (AHC) (1)
ALBAYZIN 2012 LANGUAGE RECOGNITION EVALUATION (LRE) (1)
APPLICATION SPECIFIC INTEGRATED CIRCUIT (1)
APPLICATION SPECIFIC INTEGRATED CIRCUITS (1)
ASIC (1)
AUTOMATIC MULTILINGUAL SPEECH RECOGNITION (1)
BER (1)
BILINGUAL SPEECH RECOGNITION (1)
BIT ERROR RATE (1)
BIT ERROR RATE (BER) (1)
BP NEURAL NETWORK (1)
BRAIN MODELING (1)
BRAIN MODELS (1)
CALL (1)
CAPACITY PLANNING (1)
CELLULAR RADIO (1)
CELLULAR SYSTEMS (1)
CHANNEL MODELS (1)
CLASSIFICATION NOISE REDUCTION (1)
CLUSTERING METHODS (1)
CO-TRAINING ALGORITHM (1)
COMPLEXITY THEORY (1)
COMPUTER ARCHITECTURE (1)
CONDITIONALLY INDEPENDENT ATTRIBUTE VIEWS (1)
CONFIDENCE (1)
CONNECTIONIST TEMPORAL CLASSIFICATION (1)
CONTEXT (1)
CONTINUOUS DENSITY HIDDEN MARKOV MODEL (1)
COVARIANCE MATRICES (1)
DATA MINING (1)
DATABASES (1)
DEPARTMENT OF ELECTRONIC ENGINEERING (1)
DETECTORS (1)
DISCRIMINATIVE TRAINING (1)
DTW (1)
DYNAMIC KEYWORD MATCHING (1)
EDUCATIONAL INSTITUTIONS (1)
ELECTRICITY (1)
EMOTION RECOGNITION (1)
END-TO-END SYSTEM (1)
ERROR PRONE CHANNEL (1)
ESTIMATION (1)
FADING AVOID TRANSMISSION PROTOCOL (1)
FINITE MARKOV CHAINS WITH ABSORBING STATES (FSMC) (1)
FMPE (1)
FORECASTING PROPERTY (1)
FORECASTING THEORY (1)
FREQUENCY DOMAIN (1)
FREQUENCY-DOMAIN ANALYSIS (1)
FUELS (1)
FUZZY MARKOV CHAIN (1)
GAP (1)
GAP-UTILIZED HANDOVER (1)
GATED RECURRENT UNITS (1)
HANDOVER (1)
HANDOVER INTERRUPTION (1)
HMM CLASSIFIER (1)
HOP BY-HOP RELIABLE TRANSMISSION PROTOCOLS (1)
I-VECTOR (IVEC) (1)
IMAGE DATABASES (1)
IMAGE PROCESSING (1)
ITERATION METHOD (1)
ITERATIVE METHODS (1)
KEYWORD VERIFICATION (1)
LARGE VOCABULARY KEYWORD SPOTTING (1)
LATTICE (1)
LATTICES (1)
LIGHT EMITTING DIODES (1)
LINEAR DYNAMIC PROGRAMMING; MULTI-CONSTRAINTS; FLIPFLOP PHENOMENON; NON-FOSSIL FUEL; REALIZING ROUTES. (1)
LOGIC GATES (1)
LONG SHORT-TERM MEMORY (1)
LOSSLESS TARGET (1)
more

INFONA - science communication portal

Search results for: Jia Liu

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options