The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
This paper introduces a novel paradigm to impute missing data that combines a decision tree with an auto-associative neural network (AANN) based model and a principal component analysis-neural network (PCA-NN) based model. For each model, the decision tree is used to predict search bounds for a genetic algorithm that minimise an error function derived from the respective model. The modelspsila ability...
In this work we use support vector machine to predict polyadenylation sites (Poly (A) sites) in human DNA and mRNA sequences by analyzing features around them. Two models are created. The first model identifies the possible location of the Poly (A) site effectively. The second model distinguishes between true and false Poly (A) sites, hence effectively detect the region where Poly (A) sites and transcription...
The restricted structure of fuzzy grid type based partitioning commonly employed in fuzzy model is limiting the fuzzy model on the whole to accurately describe the underlying distribution of data points in feature space. Common solution via the use of more linguistic terms to finely describe the feature space would confute the whole idea of introducing approximate reasoning. This paper proposes the...
In this paper, a new sub-domain intelligent modeling method based on neural networks is proposed for modeling the nonlinear multivariate process. The new modeling method decomposes the process into several levels sub-models and the low level models are the sub-model of the high level models. Since the modeling method is step by step to build the sub-models from low level models to high level models,...
The research on unsupervised feature selection is scarce in comparison to that for supervised models, despite the fact that this is an important issue for many clustering problems. An unsupervised feature selection method for general Finite Mixture Models was recently proposed and subsequently extended to generative topographic mapping (GTM), a manifold learning constrained mixture model that provides...
The purpose of this study was to develop a product design model for impact toughness estimation of low-alloy steel plates. Based on these estimates, the rejection probability of steel plates can be approximated. The target variable was formulated from three Charpy-V measurements with a LIB transformation, because the mean of the measurements would have lost valuable information.The method is suitable...
This paper first reviews extreme learning machine (ELM) in light of coverpsilas theorem and interpolation for a comparative study with radial-basis function (RBF) networks. To improve generalization performance, a novel method of combining a set of single ELM networks using stacked generalization is proposed. Comparisons and experiment results show that the proposed stacking ELM outperforms a single...
This study combines wavelet-based feature extractions with kernel partial least square (PLS) regression for international stock index forecasting. Wavelet analysis is utilized as a preprocessing step to decompose and extract most important time scale features from high dimensional input data. Owing to the high dimensionality and heavy multi-collinearity of the input data, a kernel PLS regression model...
In this paper we describe a model for classifying binary data using classifiers based on Bernoulli mixture models. We show how Bernoulli mixtures can be used for feature extraction and dimensionality reduction of raw input data. The extracted features are then used for training a classifier for supervised labeling of individual sample points. We have applied this method to two different types of datasets,...
The ensemble paradigm for machine learning has been studied for more than two decades and many methods, techniques and algorithms have been developed, and increasingly used in various applications. Nevertheless, there are still some fundamental issues remaining to be addressed, and an important one is what factors affect the accuracy of an ensemble, and to what extent they do, which is thus taken...
The Self-Organizing Map (SOM) is a famous algorithm for the unsupervised learning and visualization introduced by Teuvo Kohonen. This study proposes the Lazy Self-Organizing Map (LSOM) algorithm which reflects the world of worker ants. In LSOM, three kinds of neurons exist: worker neurons, lazy neurons and indecisive neurons. We apply LSOM to various input data set and confirm that LSOM can obtain...
Probabilistic models were developed to provide predictive distributions of daily maximum surface level ozone concentrations. Five forecast models were compared at two stations (Chilliwack and Surrey) in the Lower Fraser Valley of British Columbia, Canada, with local meteorological variables used as predictors. The models were of two types, conditional density models and Bayesian models. The Bayesian...
Pattern classification is an important task in speech recognition and speaker verification. Given the feature vectors of an input the goal is to capture the characteristics of these features unique to each class. This paper deals with exploring Auto Associative Neural Network (AANN) models for the task of speaker verification and speech recognition. We show that AANN models produce comparable performance...
Humans have a drive to maximize knowledge of the world, yet decision making data also suggest a contrary drive to minimize cognitive effort using simplifying heuristics. The trade-off between maximizing knowledge and minimizing effort is modeled by simulation of a challenging decision task. The task is to choose which of two gambles has the highest probability of success when the alternative with...
The study and inference of biological pathways and gene regulation mechanisms has become a vital component of modern medicine and drug discovery. Gene expression studies make it possible to understand these mechanisms by simultaneously measuring the expression level of thousands of genes. These data though rich in information are also prone to many quality control issues that ultimately result in...
Automatically inferring drug gene regulatory networks models from microarray time series data is a challenging task. The ordinary differential equations models are sensible, but difficult to build. We extended our reverse engineering algorithm for gene networks (RODES), based on genetic programming, by adding a neural networks feedback linearization component. Thus, RODES automatically discovers the...
Automatic pattern classifiers that allow for incremental learning can adapt internal class models efficiently in response to new information, without having to retrain from the start using all the cumulative training data. In this paper, the performance of two such classifiers - the fuzzy ARTMAP and Gaussian ARTMAP neural networks - are characterize and compared for supervised incremental learning...
A challenging task is to classify Internet customers based on their heterogeneous search histories of shopping in the Internet. The problem is the data pattern itself. Each transition of a customer from one page to the next in purchasing a commodity is considered as an attribute and this is a pair of data. The purchase patterns consist of usually different length for different customers. We cannot...
Adaptive resonance theory (ART) is an unsupervised neural network. Fuzzy ART (FART) is a variation of ART, allows both binary and continuous input patterns. However, fuzzy ART has the category proliferation problem. In this study, to solve this problem, we propose a new fuzzy ART algorithm: fuzzy ART combining overlapped category in consideration of connections (C-FART). C-FART has two important features...
This paper presents an on-line, continuously learning mechanism for sequence data. The proposed approach is based on SOINN-DTW method (Okada and Hasegawa, 2007), which is designed for learning of sequence data. It is based on self-organizing incremental neural network (SOINN) and dynamic time warping (DTW). Using SOINNpsilas function represents the topological structure of online input data, the output...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.