Wyniki wyszukiwania

Pozycje od 1 do 20 spośród 23 wyników

Poprzednia

Następna

rozdział

Classification Models with Global Constraints for Ordinal Data

J S Cardoso, R Sousa

2010 Ninth International Conference on Machine Learning and Applications > 71 - 77

2010 Ninth International Conference on Machine Learning and Applications (ICMLA 2010)

Ordinal classification is a form of multi-class classification where there is an inherent ordering between the classes, but not a meaningful numeric difference between them. Although conventional methods, designed for nominal classes or regression problems, can be used to solve the ordinal data problem, there are benefits in developing models specific to this kind of data. This paper introduces a...

rozdział

A gene selection approach for classifying diseases based on microarray datasets

T H A Soliman, A A Sewissy, H AbdelLatif

2010 2nd International Conference on Computer Technology and Development > 626 - 631

2nd International Conference on Computer Technology and Development (ICCTD 2010)

Gene Selection is very important problem in the classification of serious diseases in clinical information systems. A limitation of these gene selection methods is that they may result in gene sets with some redundancy and yield an unnecessary large number of candidate genes for classification analysis. In the current work, a hybrid approach is presented in order to classify diseases, such as colon...

rozdział

County Level of Basic Public Services Classification Based on Support Vector Machine: Taking Guanzhong Urban Agglomeration as the Example

Zhao Jing, Dang Xinghua

2010 3rd International Conference on Information Management, Innovation Management and Industrial Engineering > 1 > 654 - 657

2010 3rd International Conference on Information Management, Innovation Management and Industrial Engineering (ICIII 2010)

The county level of basic public services analysis and classification play an important role in county economic growth and improve benefit of healthy development of urbanization in China. According to the county level of basic public services data which is large scale and imbalance, this paper presented a support vector machine model to classify the county level of basic public services. The method...

rozdział

An Evaluation of Rule-Based Classification Models Induced by a Fuzzy Method and Two Classic Learning Algorithms

M E Cintra, M C Monard, H de Arruda Camargo

2010 Eleventh Brazilian Symposium on Neural Networks > 188 - 193

2010 Eleventh Brazilian Symposium on Neural Networks (SBRN 2010)

Classification is a widely researched area in the machine learning and fuzzy communities with several approaches proposed by both communities. Some of the most relevant rule-based approaches from the machine learning community might include decision trees and rule inducers. The fuzzy community has also proposed many rule-based approaches, such as fuzzy decision trees and genetic fuzzy systems. This...

rozdział

An improved algorithm of decision tree for classifying large data set based on rainforest framework

B Thangaparvathi, D Anandhavalli

2010 INTERNATIONAL CONFERENCE ON COMMUNICATION CONTROL AND COMPUTING TECHNOLOGIES > 800 - 805

2010 International Conference on Communication Control and Computing Technologies

Data mining or Knowledge discovery is seen as an increasingly important tool by modern business to transform data into an informational advantage. Mining is a process of finding correlations among dozens of fields in large relational databases and extracts useful information that can be used to increase revenue, cuts costs, or both. Classification is a supervised machine learning procedure and an...

rozdział

Data Mining in census data with CART

Bin Sheng, Sun Gengxin

2010 3rd International Conference on Advanced Computer Theory and Engineering(ICACTE) > 3 > V3-260 - V3-264

2010 3rd International Conference on Advanced Computer Theory and Engineering (ICACTE 2010)

Census can provide the fundamental population data of the whole nation. The census data are rich with hidden information that can be used for the investigation of national conditions and national power. Data Mining aims at extract the implicit, previously unknown, and potentially useful knowledge from voluminous, non-complete, fuzzy, stochastic data. Using Data Mining in census data can make full...

rozdział

Text Separation from Mixed Documents Using a Tree-Structured Classifier

Xujun Peng, Srirangaraj Setlur, Venu Govindaraju, Ramachandrula Sitaram

2010 20th International Conference on Pattern Recognition > 241 - 244

2010 20th International Conference on Pattern Recognition (ICPR 2010)

In this paper, we propose a tree-structured multi-class classifier to identify annotations and overlapping text from machine printed documents. Each node of the tree-structured classifier is a binary weak learner. Unlike normal decision tree(DT) which only considers a subset of training data at each node and is susceptible to over-fitting, we boost the tree using all training data at each node with...

rozdział

Review of decision trees

Xie Niuniu, Liu Yuxun

2010 3rd International Conference on Computer Science and Information Technology > 5 > 105 - 109

2010 3rd IEEE International Conference on Computer Science and Information Technology (ICCSIT 2010)

The decision tree algorithm is a hot point in the field of data mining, which is usually used to form classifiers and prediction models. In practice, it has a wide application. This paper describes the decision tree technology and its development process, focuses on typical decision tree algorithms, analyzes their advantages and disadvantages, compares several algorithms, and finally discusses the...

rozdział

A framework of classifying maintenance requests based on learning techniques

Naghmeh Mahmoodian, Rusli Abdullah, Masrah Azrifah Azmi Murad

2010 International Conference on Information Retrieval&Knowledge Management (CAMP) > 245 - 249

2010 International Conference on Information Retrieval and Knowledge Management (CAMP 2010)

Classify maintenance request is one of the processes in the large software system to support maintainers in doing their daily maintenance tasks more effectively. Categorizing these maintenance requests are an essential requirement in managing the maintenance request for software maintainer and need a great effort as well as determining classification. Hence, this paper presents the framework from...

rozdział

Empirical study on the performance of the classifiers based on various criteria using ROC curve in medical health care

E Chandra Blessie, E Karthikeyan, B Selvaraj

2010 International Conference on Communication and Computational Intelligence (INCOCCI) > 515 - 518

2010 International Conference on Communication and Computational Intelligence (INCOCCI)

Classification is one of the most efficient data mining techniques in Machine Learning. In classification, Decision trees can handle high dimensional data. But, decision trees yield poor performance in medical health care. So, In this paper, we investigate the use of Receiver Operating Characteristic (ROC) curve for the evaluation of machine learning algorithms. In particular, we investigate the use...

rozdział

Using data mining predictive models to classify credit card applicants

Yap Bee Wah, Irma Rohaiza Ibrahim

2010 6th International Conference on Advanced Information Management and Service (IMS) > 394 - 398

2010 6th International Conference on Advanced Information Management and Service (IMS 2010)

Credit scoring using predictive models can help in the process of assessing credit worthiness during the credit evaluation process. The objective of credit scoring models is to assign credit risk score to determine if a customer is likely to default on the financial obligation. Construction of credit scoring models requires data mining techniques. Using historical data on payments, demographic characteristics...

rozdział

Usage of association rules and classification techniques in knowledge extraction of diabetes

S M Nuwangi, C R Oruthotaarachchi, J M P P Tilakaratna, H A Caldera

2010 6th International Conference on Advanced Information Management and Service (IMS) > 372 - 377

2010 6th International Conference on Advanced Information Management and Service (IMS 2010)

This research paper uses association rules and classification techniques to extract undiscovered information of diabetes. Previous phase of this research included the preliminary results of some undiscovered decision factors and side effects of diabetes, by considering diabetes type 1 and type 2 patients' data set. Advanced and reliable data mining techniques are used throughout this research to the...

rozdział

Fuzzy clustering decision tree for classifying working wafers of ion implanter

Shih-Cheng Horng, Yu-Liang Hsiao

2009 IEEE International Conference on Industrial Engineering and Engineering Management > 703 - 707

2009 IEEE International Conference on Industrial Engineering and Engineering Management (IEEM 2009)

In this paper, we propose a fuzzy clustering decision tree (FCDT) for the classification problem with large number of classes and continuous attributes. A hierarchical clustering concept is introduced to achieve a finer fuzzy partition. The proposed clustering algorithm split the data set into leaf clusters using splitting attributes based on a separation matrix and fuzzy rules. The leaf clusters...

rozdział

City Scientific and Technological Progress Level Classification Based on Support Vector Machine

Jing Zhao, Xinghua Dang

2009 International Conference on Information Management, Innovation Management and Industrial Engineering > 1 > 110 - 113

2009 International Conference on Information Management, Innovation Management and Industrial Engineering (ICIII 2009)

City scientific and technological progress level classification and promotion play a central role in spurring city income growth and reducing poverty. Based on the Chinese city data availability, this paper built evaluation index system on the level of city scientific and technological progress. According to the city scientific and technological progress data which is large scale and imbalance, this...

rozdział

Classification for talent management using Decision Tree Induction techniques

H. Jantan, A.R. Hamdan, Z.A. Othman

2009 2nd Conference on Data Mining and Optimization > 15 - 20

2009 2nd Conference on Data Mining and Optimization

Classification is one of the tasks in data mining. Nowadays, there are many classification techniques being used to solve classification problems such as neural network, genetic algorithm, Bayesian and others. In this article, we attempt to present a study on how talent management can be implemented using decision tree induction techniques. By using this approach, talent performance can be predicted...

rozdział

Discovery of Biomarker Genes from Earthworm Microarray Data by Discriminant Analysis and Clustering

Ying Li, Nan Wang, E.J. Perkins, Ping Gong

2009 International Joint Conference on Bioinformatics, Systems Biology and Intelligent Computing > 23 - 29

2009 International Joint Conference on Bioinformatics, Systems Biology and Intelligent Computing (IJCBS)

Monitoring, assessment and prediction of environmental risks that chemicals pose demand rapid and accurate diagnostic assays. One important goal of microarray experiments is to discover novel biomarkers for toxicity evaluation. A variety of toxicological effects have been associated with explosive compounds 2,4,6-trinitrotoluene (TNT) and 1,3,5-trinitro-1,3,5-triazacyclohexane (RDX). Here we developed...

rozdział

Applying Distributed Classification Algorithms to Wireless Sensor Networks A Brief View into the Application of the SPRINT Algorithm Family

B. Lantow

Seventh International Conference on Networking (icn 2008) > 52 - 59

2008 Seventh International Conference on Networking (ICN '08)

The SPRINT algorithm describes a distributed way to construct a decision tree for classification in large data sets. It can be applied to in-network classification tree construction. The costly data transfer of sensor data to the sink can be avoided while execution time is still acceptable. The SPRINT algorithm and its extensions are introduced. Furthermore, different scenarios that implement classification...

rozdział

Investigating Learning Methods for Binary Data

S. Visa, A. Ralescu, M. Ionescu

NAFIPS 2007 - 2007 Annual Meeting of the North American Fuzzy Information Processing Society > 441 - 445

NAFIPS 2007. Annual Meeting of the North American Fuzzy Information Processing Society

Michie et al. show in [1] that decision trees perform better than twenty other classification algorithms in classifying binary data. In this paper we further investigate this hypothesis by comparing the decision trees with a fuzzy set-based classifier and the naive Bayes on real and artificial datasets.

artykuł

Toward Exploratory Test-Instance-Centered Diagnosis in High-Dimensional Classification

C.C. Aggarwal

IEEE Transactions on Knowledge and Data Engineering > 2007 > 19 > 8 > 1001 - 1015

High-dimensional data is a difficult case for most subspace-based classification methods because of the large number of combinations of dimensions, which have discriminatory power. This is because there are an exponential number of combinations of dimensions that could decide the correct class instance, and this combination could vary with data locality and test instance. Therefore, most summarized...

rozdział

Naive Bayes Classification Given Probability Estimation Trees

Zengchang Qin

2006 5th International Conference on Machine Learning and Applications (ICMLA'6) > 34 - 42

2006 International Conference on Machine Learning and Applications

Tree induction is one of the most effective and widely used models in classification. Unfortunately, decision trees such as C4.5 have been found to provide poor probability estimates. By the empirical studies, Provost and Domingos found that probability estimation trees (PETs) give a fairly good probability estimation. However, different from normal decision trees, pruning reduces the performances...

Poprzednia

Następna

Opcje filtrowania

Słowa kluczowe:
PATTERN CLASSIFICATION
DECISION TREE

Data publikacji

Ustaw własny zakres dat

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Dostępność treści

Typ publikacji

Słowa kluczowe

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu