The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
One of the interesting and important subjects among researchers in the field of medical and computer science is diagnosing illness by considering the features that have the most impact on recognitions. The subject discusses a new concept which is called Medical Data Mining (MDM). Indeed, data mining methods use different ways such as classification and clustering to classify diseases and their symptoms...
In this paper, a novel approach for classification rule mining is presented. The remarkable relationship between the rule extraction procedure and the concept of multiobjective optimization is emphasized. The range values of features composing the rules are handled as decision variables in the modelled multiobjective optimization problem. The proposed method is applied to three well-known datasets...
Web recommendation systems are helpful in overcoming the excess information on web by retrieving the information required by the user with respect to user's or similar users' preferences and interests. In order to make web recommendation system work, web users have to be clustered based on their common interest. The web user clusters are used to obtain the knowledge about the web pages accessed. This...
Feature selection is an important preprocessing in data mining, it aims to reduce the computational complexity of learning algorithm, and to improve the performance of data mining algorithms by removing irrelevant and redundant features. In the framework of discrete-valued feature selection, this paper experimentally compares two feature selection methods which are based on generic algorithm. The...
This paper presents a new hybrid HPSO-DE classification algorithm that combines the advantages of particle swarm optimization algorithm and differential evolution algorithm. Major improvements achieved by this combination are 1) flight improvement — flight behaviors are more and better diversified because each of the top 3 particles gets put into 3 different groups of the rest and then each group...
Data mining is an advanced technology, which is the process of discovering actionable information from large set of data, which is used to analyze large volumes of data and extracts patterns that can be converted to useful knowledge. Medical data mining has a great potential for exploring the hidden patterns in the data sets of medical domain. These patterns can be utilized to do clinical diagnosis...
Feature selection reduces a data set into a subset which also represents the entire data with less computational complexity and performance does not affect much. However, to extract such a subset is a nontrivial task although there are a number of methods to handle this problem. In the near past an approach based on rough set have been used for feature selection. The dependency measure is one of the...
Utilization of machine learning algorithms in time-series data analysis is crucial to effective decision making in today's dynamic and competitive environment. One data type of growing interest is the electricity consumer load profile (LP) data. Owing to advances in the smart grid, immense amount of LP data became available to policymakers as potential to improving the electricity sector. Due to the...
Features selection (FS) techniques have an apparent need in many complex engineering applications especially the bearing fault diagnosis of low-speed industrial motor. The main goal of an FS algorithm is to select the most discriminant features subset from a high-dimension features vector that increases the model performance by reducing the redundant and irrelevant fault features. This paper proposes...
In this article, we present an application of metaheuristics optimization approaches to improve medical classifier performance. Genetic Algorithm (GA), Simulated Annealing (SA) and Particle Swarm Optimization (PSO) have been applied in conjunction with Least Square Support Vector Machine (LS-SVM) approach to optimize the total misclassification error in term of False Positive and False Negative rates...
The feature subset selection, along with the parameters of classifier significantly influences the classification accuracy. In order to ensure the optimal classification performance, the artificial bee colony (ABC) algorithm is proposed to simultaneously optimize the feature subset and the parameters of support vector machines (SVM), meanwhile for improving the optimizing performance of ABC algorithm,...
Development of microarray technology makes the number of research about Bioinformatics will increase as well. Microarray dataset contains genetic information and can be used to analyze thousands of samples and features. Especially in cancer research, the cancer data is generated by microarray technology, and will be a primary data for training and testing in the machine learning process. The main...
Generally, information system handles huge volume of dataset. Classifiers provide poor performance when such dataset are feed into it for categorization due to their high dimension. The most important attributes are extracted from the dataset prior to classification for efficient classifier design. It is also true that there may be many classifiers of a particular system, some provide better accuracy...
Recent advancements in sensor technology offer opportunities to manage business processes in a proactive manner. To enable an effective and real-time monitoring, sensor data have to be treated and processed in an event processing manner. Complex Event Processing is an efficient technology that detects useful complex events by matching primitive sensor events using event patterns. Event patterns can...
Student performance classification is a challenging task for teacher and stakeholder for better academic planning and management. Data mining can be used to find knowledge from student data to improve the performance of classifying model. Before applying a classification model, feature selection method is proposed in data preprocessing process to find out the most significant and intrinsic features...
This paper presents the improved algorithm for the Hybrid Approach of Neural network and Level-2 Fuzzy set (HANN-L2F). The main structure is including 2 parts. The first part is Neuro-Fuzzy system, including the MLP Neural network with the combination of the level-2 Fuzzy system. The second part is using k-nearest neighbor to classify the output from Neuro-fuzzy. The HANN-L2F is an algorithm with...
The challenge to choose the best algorithm and its best parameters for a given problem is known as Combined Algorithm Selection and Hyperparameter Optimization Problem. Among all the classification algorithms available are those based on human comprehensible representations, such as decision trees and classification rule induction. These algorithms are usually chosen by the clarity of the results...
With the advent of the computer science, the data volume that needed to be processed under many practical situations increases dramatically, challenging many traditional machine learning techniques. Bearing this in mind, we made an intensive study on the optimization of decision tree algorithm and its corresponding porting to the big data analysis in this paper. An optimized genetic algorithm is merged...
Data mining has been an active area of research for the past couple of decades. Classification is an important data mining technique that consists of assigning a data instance to one of the several predefined categories. Various successful methods have already been suggested and tested to solve the problems of the classification. In this paper, author proposed a new hybrid classifier by combining...
Genetic programming (GP) has been applied to solve data classification problems numerous times in previous studies and the findings in the literature confirm that GP is able to perform well. In more recent studies, researchers have shown that using a team of classifiers can outperform a single classifier. These teams are referred to as ensembles. Previously, several different attempts at creating...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.