The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Hybrid models combine different technologies to obtain a product that shares their advantages and minimizes their deficiencies. The solutions given by a case-based system (CBS) rely on similar past experiences, which are commonly described in terms of both symbolic and continuous attributes. The nearest neighbor (NN) principle commonly followed to develop CBS for classification task proceeds from...
This paper presents a hybrid optimization method based on the fusion of the clonal selection algorithm (CSA) and harmony search (HS) technique. The CSA is employed to improve the members of the harmony memory in the HS method. The hybrid optimization algorithm is further used to optimize a fuzzy classification system for the Fisher Iris data classification. Computer simulations results demonstrate...
This paper presents an application of neural network interleaved training algorithm proposed in in the domain of chess. In order to use the referenced learning method a structure of metric space is introduced in the space of chess moves. Neural network is used as a classifier of a distance from a given move to the optimal one, leading to significant limitation of the set of moves potentially worth...
Ensemble methods like bagging combine the decisions of multiple classifiers in order to obtain more accuracy than a single classifier. This paper studies the use of bagging for a region oriented symbolic classifier. Experiments with two artificial data sets, generated according to bi-variate normal distributions have been performed in order to show the usefulness of bagging for this symbolic classifier...
The class imbalance problem (when one of the classes has much less samples than the others) is of great importance in machine learning, because it corresponds to many critical applications. In this work we introduce the recursive partitioning of the majority class (REPMAC) algorithm, a new hybrid method to solve imbalanced problems. Using a clustering method, REPMAC recursively splits the majority...
Information extraction (IE) aims to extract from textual documents only the fragments which correspond to datafields required by the user. In this paper, we present new experiments evaluating a hybrid machine learning approach for IE that combines text classifiers and hidden Markov models (HMM). In this approach, a text classifier technique generates an initial output, which is refined by an HMM,...
In this paper, we study a single objective extension of support vector machines for multicategory classification. Extending the dual formulation of binary SVMs, the algorithm looks for minimizing the sum of all the pairwise distances among a set of prototypes, each one constrained to one of the convex-hulls enclosing a class of examples. The final discriminant system is built looking for an appropriate...
Machines learning techniques have been applied in several different problems in bioinformatics. Similarly, pattern discovery algorithms have also been used to uncover hidden motifs in protein sequences, contributing greatly to the understanding of the problem of protein classification. G-protein coupled receptors (GPCRs) represent one of the largest protein families in Human Genome. Most of these...
Machine Learning techniques have been largely applied to the problem of class prediction in microarray data. Nevertheless, current approaches to select appropriate methods for such task often result unsatisfactory in many ways, instigating the need for the development of tools to automate the process. In this context, the authors introduce the use of metalearning in the specific domain of gene expression...
In this work our aim is to increase the performance of fuzzy rule based classifications systems in the framework of imbalanced data-sets by means of the application of a genetic tuning step. We focus on the imbalanced data-set problem since it appears in many real application areas and, for this reason, it has become a relevant topic in the area of machine learning. This problem occurs when the number...
Multi-objective meta-heuristics permit to conceive a complete novel approach to induce classifiers, where the properties of the rules can be expressed in different objectives, and then the algorithm finds these rules in an unique run by exploring Pareto dominance concepts. Furthermore, these rules can be used as an unordered classifier, in this way, the rules are more intuitive and easier to understand...
Multi-dimensional classification is a generalization of supervised classification that considers more than one class variable to classify. In this paper we review the existing multi-dimensional Bayesian classifiers and introduce a new one: the KDB multi-dimensional classifier. Then we define different classification rules for multi-dimensional scope. Finally, we introduce a structural learning approach...
In this paper, we highlight the use of synthetic data sets to analyze learners behavior under bounded complexity. We propose a method to generate synthetic data sets with a specific complexity, based on the length of the class boundary. We design a genetic algorithm as a search technique and find it useful to obtain class labels according to the desired complexity. The results show the suitability...
Radial basis function networks (RBFNs) have shown their capability to be used in classification problems, so that many data mining algorithms have been developed to configure RBFNs. These algorithms need to be given a suitable set of parameters for every problem they face, thus methods to automatically search the values of these parameters are required. This paper shows the robustness of a meta-algorithm...
For a given data set, different learning algorithms typically provide different classifiers. Although it is possible to simply select the most successful classifier, the less successful classifiers could have potentially valuable information that may be wasted. This work proposes GAESC, an algorithm for evolving a set of classifiers into a single symbolic classifier using genetic algorithms. Individuals...
This paper presents an efficient distributed genetic algorithm for classification rules extraction in data mining, which is based on a new method of dynamic data distribution applied to parallelism using networks of computers in order to mine large datasets. The presented algorithm shows many advantages when compared with other distributed algorithms proposed in the specific literature. In this way,...
Rapid developments in computing-related technologies have enabled the collection of large amounts of data at unprecedented rates from diverse systems, both natural and engineered. The availability of such data has motivated the development of intelligent systems to gain new insights into how these systems work, leading thereby to superior decision making. In this paper we present recent advances in...
The use of feature selection can improve accuracy, efficiency, applicability and understandability of a learning process and the resulting learner. For this reason, many methods of automatic feature selection have been developed. By using the modularization of feature selection process, this paper evaluates a wide spectrum of these methods and some additional ones created by combination of different...
Logistic regression (LR) has become a widely used and accepted method to analyse binary or multiclass outcome variables, since it is a flexible tool that can predict the probability for the state of a dichotomous variable. A recently proposed LR method is based on the hybridisation of a linear model and evolutionary product-unit neural network (EPUNN) models for binary classification. This produces...
The main objective of this work is to automatically design neural network models with sigmoidal basis units for classification tasks, so that classifiers are obtained in the most balanced way possible in terms of CCR and sensitivity (given by the lowest percentage of examples correctly predicted to belong to each class). We present a memetic Pareto evolutionary NSGA2 (MPENSGA2) approach based on the...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.