2008 IEEE International Conference on Data Mining Workshops

Items from 1 to 7 out of 7 results

chapter

A Comparative Study of Data Sampling and Cost Sensitive Learning

C. Seiffert, T.M. Khoshgoftaar, J. Van Hulse, A. Napolitano

2008 IEEE International Conference on Data Mining Workshops > 46 - 52

2008 IEEE International Conference on Data Mining Workshops

Two common challenges data mining and machine learning practitioners face in many application domains are unequal classification costs and class imbalance. Most traditional data mining techniques attempt to maximize overall accuracy rather than minimize cost. When data is imbalanced, such techniques result in models that highly favor the over represented class, the class which typically carries a...

chapter

Region Classification with Decision Trees

J. van Prehn, E.N. Smirnov

2008 IEEE International Conference on Data Mining Workshops > 53 - 59

2008 IEEE International Conference on Data Mining Workshops

The region-classification task is to construct class regions containing the correct classes of the objects being classified with a given probability. To turn a point classifier into a region classifier, the conformal framework is used . However, applying the framework requires a non-conformity function. This function estimates the instances' non-conformity for the point classifier used. This paper...

chapter

A Case Study on Classification Reliability

Honghua Dai

2008 IEEE International Conference on Data Mining Workshops > 69 - 73

2008 IEEE International Conference on Data Mining Workshops

The reliability of an induced classifier can be affected by several factors including the data oriented factors and the algorithm oriented factors. In some cases, the reliability could also be affected by knowledge oriented factors. In this paper, we analyze three special cases to examine the reliability of the discovered knowledge. Our case study results show that (1) in the cases of mining from...

chapter

Chi-Square Test Based Decision Trees Induction in Distributed Environment

Jie Ouyang, N. Patel, I.K. Sethi

2008 IEEE International Conference on Data Mining Workshops > 477 - 485

2008 IEEE International Conference on Data Mining Workshops

The decision tree-based classification is a popular approach for pattern recognition and data mining. Most decision tree induction methods assume training data being present at one central location. Given the growth in distributed databases at geographically dispersed locations, the methods for decision tree induction in distributed settings are gaining importance. This paper describes one distributed...

chapter

Co-training by Committee: A New Semi-supervised Learning Framework

M. Hady, F. Schwenker

2008 IEEE International Conference on Data Mining Workshops > 563 - 572

2008 IEEE International Conference on Data Mining Workshops

For many data mining applications, it is necessary to develop algorithms that use unlabeled data to improve the accuracy of the supervised learning. Co-Training is a popular semi-supervised learning algorithm. It assumes that each example is represented by two or more redundantly sufficient sets of features (views) and these views are independent given the class. However, these assumptions are not...

chapter

G-REX: A Versatile Framework for Evolutionary Data Mining

R. Konig, U. Johansson, L. Niklasson

2008 IEEE International Conference on Data Mining Workshops > 971 - 974

2008 IEEE International Conference on Data Mining Workshops

This paper presents G-REX, a versatile data mining framework based on genetic programming. What differs G-REX from other GP frameworks is that it doesn't strive to be a general purpose framework. This allows G-REX to include more functionality specific to data mining like preprocessing, evaluation- and optimization methods, but also a multitude of predefined classification and regression models. Examples...

chapter

GeoDMA - A Novel System for Spatial Data Mining

T.S. Korting, L.M.G. Fonseca, M.I.S. Escada, F.C. da Silva, more

2008 IEEE International Conference on Data Mining Workshops > 975 - 978

2008 IEEE International Conference on Data Mining Workshops

Although a huge amount of remote sensing data has been provided by Earth observation satellites, few data manipulation techniques and information extraction in large data sets have been developed. In this context, the present paper aims to show a new system for spatial data mining, and two test cases applied to land use change in the Brazilian Amazon region. We present the operational environment...

INFONA - science communication portal

2008 IEEE International Conference on Data Mining Workshops

A Comparative Study of Data Sampling and Cost Sensitive Learning

Region Classification with Decision Trees

A Case Study on Classification Reliability

Chi-Square Test Based Decision Trees Induction in Distributed Environment

Co-training by Committee: A New Semi-supervised Learning Framework

G-REX: A Versatile Framework for Evolutionary Data Mining

GeoDMA - A Novel System for Spatial Data Mining

Filter options

Publication date

Keywords

INFONA - science communication portal

2008 IEEE International Conference on Data Mining Workshops $("#expandableTitles").expandable();

A Comparative Study of Data Sampling and Cost Sensitive Learning

Region Classification with Decision Trees

A Case Study on Classification Reliability

Chi-Square Test Based Decision Trees Induction in Distributed Environment

Co-training by Committee: A New Semi-supervised Learning Framework

G-REX: A Versatile Framework for Evolutionary Data Mining

GeoDMA - A Novel System for Spatial Data Mining

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2008 IEEE International Conference on Data Mining Workshops