Search results

chapter

An Improved Tibetan Lhasa Speech Recognition Method Based on Deep Neural Network

Wenbin Ruan, Zhenye Gan, Bin Liu, Yin Guo

2017 10th International Conference on Intelligent Computation Technology and Automation (ICICTA) > 303 - 306

2017 10th International Conference on Intelligent Computation Technology and Automation (ICICTA)

Deep Neural Networks (DNN) are the dominant technique widely used in English and Chinese speech recognition currently. However, Tibetan speech recognition research starts late and mainly uses Hidden Markov Model (HMM). In this paper, We show a better method of replacing Gaussian Mixture Models (GMM) by DNN to Tibetan Lhasa dialect speech recognition system. The system contains seven layers of features...

chapter

Single-channel speech separation based on deep clustering with local optimization

Taotao Fu, Ge Yu, Lili Guo, Yan Wang, more

2017 3rd International Conference on Frontiers of Signal Processing (ICFSP) > 44 - 49

2017 3rd International Conference on Frontiers of Signal Processing (ICFSP)

There are many challenges in single-channel multi-person mixed speech separation, such as modeling the temporal continuity of the speech signals and improving the frame separation performance simultaneously. In this paper, a separation method based on Deep Clustering with local optimization by the improved Non-Negative Matrix Factorization (NMF) combined with Factorial Conditional Random Fields (FCRF)...

chapter

The hierarchical classification model using Support Vector Machine with multiple kernels in human behavioral pattern recognition

Sorin Soviany, Virginia Sandulescu, Sorin Puscoci

2017 E-Health and Bioengineering Conference (EHB) > 683 - 686

2017 E-Health and Bioengineering Conference (EHB)

The paper proposes a classification model for human behavioral patterns recognition in which the decisions are provided based on several Support Vector Machines classifiers within a multi-level decision structure. SVMs are suitable for applications in which the input data feature spaces are very large, involving many features. The human behavior recognition is a relevant example of such application...

chapter

A comprehensive approach for validating p53 binding site predictions

Tansel Ozyer, Reda Alhajj

2017 8th International Conference on Information Technology (ICIT) > 846 - 853

2017 8th International Conference on Information Technology (ICIT)

Predicting the locations of Response Elements (RE) has received considerable attention in the field of gene sequence analysis and bioinformatics. Protein53 (p53) has a prominent role in the cell cycle and cancer prevention; it functions as a transcription factor and binds with p53 REs in the DNA. The identification of p53 response elements enlightens the unknown functions and characteristics of p53...

chapter

Arabic handwriting recognition using sequential minimal optimization

Hanadi Hassen, Somaya Al-Maadeed

2017 1st International Workshop on Arabic Script Analysis and Recognition (ASAR) > 79 - 84

2017 1st International Workshop on Arabic Script Analysis and Recognition (ASAR)

Due to the variability of writing styles and to other problems related to the nature of Arabic scripts, the recognition of Arabic handwriting is still awaiting accurate results. Segmentation of Arabic handwritten words into graphemes poses a major challenge in Arabic handwriting recognition and is highly error prone. In this paper, we adopt the holistic approach which handles the whole word image...

chapter

Joint optimisation of tandem systems using Gaussian mixture density neural network discriminative sequence training

C. Zhang, P. C. Woodland

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 5015 - 5019

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The use of deep neural networks (DNNs) for feature extraction and Gaussian mixture models (GMMs) for acoustic modelling is often termed a tandem system configuration and can be viewed as a Gaussian mixture density neural network (MDNN). Compared to the direct use of DNN output probabilities in the acoustic model, the tandem approach suffers from a major weakness in that the feature extraction stage...

chapter

Automated structure discovery and parameter tuning of neural network language model based on evolution strategy

Tomohiro Tanaka, Takafumi Moriya, Takahiro Shinozaki, Shinji Watanabe, more

2016 IEEE Spoken Language Technology Workshop (SLT) > 665 - 671

2016 IEEE Spoken Language Technology Workshop (SLT)

Long short-term memory (LSTM) recurrent neural network based language models are known to improve speech recognition performance. However, significant effort is required to optimize network structures and training configurations. In this study, we automate the development process using evolutionary algorithms. In particular, we apply the covariance matrix adaptation-evolution strategy (CMA-ES), which...

chapter

A Re-estimation Brain Storm Optimization to Train Hidden Markov Model for Transcription Factor Binding Site Analysis

Xinyuan Ma, Ning Xian

2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA) > 134 - 139

2016 15th IEEE International Conference on Machine Learning and Applications (ICMLA)

Computational analysis of transcription factor binding site (TFBS) is one of the most challenging topics in bioinformatics. A set of TFBS sequences is a type of multiple sequence alignment (MSA). Thus, the hidden Markov model (HMM), as a powerful tool to model MSA, has been extensively applied in TFBS analysis. However, with the sizes of TFBS problems, training HMM in a deterministic way is computationally...

chapter

A resource-constrained HCRF modeling for a large-scale speaker identification task

Wei-Tyng Hong

2016 IEEE 5th Global Conference on Consumer Electronics > 1 - 2

2016 IEEE 5th Global Conference on Consumer Electronics

This paper proposes an efficient algorithm on the training of hidden conditional random fields (HCRFs) for large-scale speaker recognition in which a speaker identification task with around 1000 speakers is investigated. HCRFs are a type of direct models in pattern recognition and thus iterative procedures are usually required to estimate the model parameters. The key method in this paper is to perform...

chapter

Variance reduction for optimization in speech recognition

Jen-Tzung Chien, Pei-Wen Huang

2016 IEEE 26th International Workshop on Machine Learning for Signal Processing (MLSP) > 1 - 6

2016 IEEE 26th International Workshop on Machine Learning for Signal Processing (MLSP)

Deep neural network (DNN) is trained according to a mini-batch optimization based on the stochastic gradient descent algorithm. Such a stochastic learning suffers from instability in parameter updating and may easily trap into local optimum. This study deals with the stability of stochastic learning by reducing the variance of gradients in optimization procedure. We upgrade the optimization from the...

chapter

Continuous fundamental frequency prediction with deep neural networks

Balint Pal Toth, Tamas Gabor Csapo

2016 24th European Signal Processing Conference (EUSIPCO) > 1348 - 1352

2016 24th European Signal Processing Conference (EUSIPCO)

Deep learning is proven to outperform other machine learning methods in numerous research fields. However, previous approaches, like multispace probability distribution hidden Markov models still surpass deep learning methods in the prediction accuracy of speech fundamental frequency (F0), inter alia, due to its discontinuous behavior. The current research focuses on the application of feedforward...

chapter

An optimized classification method for human behavioral patterns recognition

Sorin Soviany, Sorin Puscoci

2015 E-Health and Bioengineering Conference (EHB) > 1 - 4

2015 E-Health and Bioengineering Conference (EHB)

The paper proposes an innovative supervised learning method for human behavioral recognition in which the behavioral patterns are classified according to the classes importance. A detector classifier is trained to recognize the human behavioral patterns belonging to the most important class. The optimization is performed by fixing the classifier operating point to provide the appropriate performance...

chapter

Wear process lifetime prediction based on parametric model applied to experimental data

Nejra Beganovic, Dirk Soffker

2015 IEEE Conference on Prognostics and Health Management (PHM) > 1 - 6

2015 IEEE Conference on Prognostics and Health Management (PHM)

Lifetime prediction of a technical system plays a significant role also with respect to the avoidance of breakdowns. The first part of this contribution is a brief review of lifetime models followed by an introduction of a new parametric lifetime model. Experimental data for the lifetime model training and evaluation are taken from a tribological system describing a wear process. The main focus of...

chapter

Nonlinear discriminant analysis with neural networks for speech recognition

Vincent Fontaine, Christophe Ris, Henri Leich

1996 8th European Signal Processing Conference (EUSIPCO 1996) > 1 - 4

1996 8th European Signal Processing Conference (EUSIPCO 1996)

Linear Discriminant Analysis (LDA) has been applied successfully to speech recognition tasks, improving accuracy and robustness against some types of noise. However, it is well known that LDA suffers from some weaknesses if the distributions are not unimodal or when the mean of the distributions are shared. In this paper, we propose to take advantage of the nonlinear discriminant properties of the...

chapter

Investigations on sequence training of neural networks

Simon Wiesler, Pavel Golik, Ralf Schluter, Hermann Ney

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 4565 - 4569

ICASSP 2015 - 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper we present an investigation of sequence-discriminative training of deep neural networks for automatic speech recognition. We evaluate different sequence-discriminative training criteria (MMI and MPE) and optimization algorithms (including SGD and Rprop) using the RASR toolkit. Further, we compare the training of the whole network with that of the output layer only. Technical details...

chapter

A dialog management methodology based on evolving Fuzzy-rule-based (FRB) classifiers

David Griol, Jose Antonio Iglesias, Agapito Ledezma, Araceli Sanchis

2014 IEEE Conference on Evolving and Adaptive Intelligent Systems (EAIS) > 1 - 8

2014 IEEE Conference on Evolving and Adaptive Intelligent Systems (EAIS)

This paper proposes a statistical methodology based on evolving Fuzzy-rule-based (FRB) classifiers to develop dialog managers for spoken dialog systems. The dialog managers developed by means of our proposal select the next system action by considering a set of dynamic rules that are automatically obtained by means of the application of the FRB classification process. Our approach has the main advantage...

chapter

A Sentence-Pitch-Contour Model for Indiginous Language (Galo) Using Vector Quantization (VQ) and Hidden Markov Model

Akalpita Das, Laba Kr. Thakuria, Purnendu Acharjee, P.H. Thakdar

2014 Fourth International Conference on Communication Systems and Network Technologies > 944 - 947

2014 International Conference on Communication Systems and Network Technologies (CSNT)

A model is proposed to developed a Indigenous language (Galo) sentence's pitch-contour with sentence-wide optimization, called the sentence pitch-contour using HMM(Hidden Markov Model) & VQ (vector quantization). To develop a sentence pitch-contour (SPC-HMM), each training sentence are normalized for the pitch-contours of the syllables. Our model is effective for pitch height normalization...

chapter

Phone classification using HMM/SVM system and normalization technique

Mohammed Sidi Yakoub, Roger Nkambou, Sid-Ahmed Selouani

IEEE International Symposium on Signal Processing and Information Technology > 96 - 101

2013 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT)

Support vector machines (SVM) were originally developed for binary classification and extended for multi-class classification. Due to their powerfulness and adaptation to hard classification problems, we have chosen them for automatic speech recognition (ASR). The aim of this paper is to investigate the use of SVM multi-class classification coupled with HMM for TIMIT phones. SVM requires that all...

chapter

Pattern Identification Using Reconstructed Phase Space and Hidden Markov Model

Wenjing Zhang, Xin Feng

2012 11th International Conference on Machine Learning and Applications > 1 > 374 - 379

2012 Eleventh International Conference on Machine Learning and Applications (ICMLA)

In this paper we present a method for identification of temporal patterns that are predictive of events in a dynamic data system. The proposed new MRPS-HMM method applies a hybrid model using Reconstructed Phase Space (RPS) and stochastic state estimation via Hidden Markov Model (HMM) to search predictive patterns. This method constructs a multivariate phase space by embedding each data sequence with...

chapter

The optimization model of sensors set based on hidden Markov model

Yan Tian, Jian-Min Zhao, Xia Tian, Zhong-Hua Cheng

2012 International Conference on Quality, Reliability, Risk, Maintenance, and Safety Engineering > 603 - 606

2012 International Conference on Quality, Reliability, Risk, Maintenance, and Safety Engineering (QR2MSE)

The design processes and methods of PHM system based on HMM have been investigated. HMM has some advantage in terms of dealing with small sample size and high discerning accuracy. The rationality of sensors set which is based on the hidden Markov model has been evaluated from quantitative point of view. Then the evaluating method of different sensor sets based on HMM has been put forward. At last,...

INFONA - science communication portal

Search results

An Improved Tibetan Lhasa Speech Recognition Method Based on Deep Neural Network

Single-channel speech separation based on deep clustering with local optimization

The hierarchical classification model using Support Vector Machine with multiple kernels in human behavioral pattern recognition

A comprehensive approach for validating p53 binding site predictions

Arabic handwriting recognition using sequential minimal optimization

Joint optimisation of tandem systems using Gaussian mixture density neural network discriminative sequence training

Automated structure discovery and parameter tuning of neural network language model based on evolution strategy

A Re-estimation Brain Storm Optimization to Train Hidden Markov Model for Transcription Factor Binding Site Analysis

A resource-constrained HCRF modeling for a large-scale speaker identification task

Variance reduction for optimization in speech recognition

Continuous fundamental frequency prediction with deep neural networks

An optimized classification method for human behavioral patterns recognition

Wear process lifetime prediction based on parametric model applied to experimental data

Nonlinear discriminant analysis with neural networks for speech recognition

Investigations on sequence training of neural networks

A dialog management methodology based on evolving Fuzzy-rule-based (FRB) classifiers

A Sentence-Pitch-Contour Model for Indiginous Language (Galo) Using Vector Quantization (VQ) and Hidden Markov Model

Phone classification using HMM/SVM system and normalization technique

Pattern Identification Using Reconstructed Phase Space and Hidden Markov Model

The optimization model of sensors set based on hidden Markov model

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options