The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
In this paper, we present a consumption pattern recognition system based on SVM. It can produce an optimized classification pattern using SVM algorithm and use the pattern to predict consumer behaviors. In this system, three dimension reduction methods including Principal Component Analysis (PCA), correlation analysis and data cubes are applied to reduce dimension of features and two training methods...
Non-wood forest is a kind of important forest resource. This paper focused on the information extraction of non-wood forest based on Advanced Land Observation Satellite (ALOS) data. Band characteristics were analyzed to get understanding of this data wholly by information content, correlation coefficient and Optimum Index Factor (OIF). A new set of data with eight bands were obtained by the fusion...
We present an approach for learning models that obtain accurate classification of large scale data objects, collected in spatiotemporal domains. The model generation is structured in three phases: pixel selection (spatial dimension reduction), spatiotemporal features extraction and feature selection. Novel techniques for the first two phases are presented, with two alternatives for the middle phase...
This paper introduces a new extension of outlier detection approaches and a new concept, class separation through variance. We show that accumulating information about the outlierness of points in multiple subspaces leads to a ranking in which classes with differing variance naturally tend to separate. Exploiting this leads to a highly effective and efficient unsupervised class separation approach,...
We explore the framework of permutation-based p-values for assessing the behavior of the classification error. In this paper we study two simple permutation tests. The first test estimates the null distribution by permuting the labels in the data; this has been used extensively in classification problems in computational biology. The second test produces permutations of the features within classes,...
Text classification categories Web documents in large collections into predefined classes based on their contents. Unfortunately, the classification process can be time-consuming and users are still required to spend considerable amount of time scanning through the classified Web documents to identify the ones that satisfy their information needs. In solving this problem, we first introduce CorSum,...
Data-related issues represent the main causes for insufficient performance in data mining. Existing strategies for tackling these issues include procedures for handling incomplete data - mandatory in various schemes, and feature selection, both augmenting the learning process. Our previous work on data imputation has shown that a good imputation policy for strongly correlated attributes with the class...
One of the most important issues in any association rule mining is the interpretation and evaluation of discovered rules. Thus, most algorithms employ the support-confidence framework for evaluating association and classification rules. Unfortunately, recent studies show that the support and confidence measures are insufficient for filtering out uninteresting association rules, for instance, even...
Ensemble approaches to classification have attracted a great deal of interest in recent years. Many methods have been developed to create the diversity among the classifiers. At present, there are two kinds of diversity creation methods: data partitioning and attributes partitioning. In some applications, attribute partitioning methods are capable of performance superior to data partitioning methods...
In classification problems, the class imbalance problem will cause a bias on the training of classifiers and will result in the lower sensitivity of detecting the minority class examples. The Mahalanobis-Taguchi System (MTS) is a diagnostic and forecasting technique for multivariate data. MTS establishes a classifier by constructing a continuous measurement scale rather than directly learning from...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.