2008 IEEE International Conference on Data Mining Workshops

Items from 1 to 14 out of 14 results

chapter

A Comparative Study of Data Sampling and Cost Sensitive Learning

C. Seiffert, T.M. Khoshgoftaar, J. Van Hulse, A. Napolitano

2008 IEEE International Conference on Data Mining Workshops > 46 - 52

2008 IEEE International Conference on Data Mining Workshops

Two common challenges data mining and machine learning practitioners face in many application domains are unequal classification costs and class imbalance. Most traditional data mining techniques attempt to maximize overall accuracy rather than minimize cost. When data is imbalanced, such techniques result in models that highly favor the over represented class, the class which typically carries a...

chapter

A Case Study on Classification Reliability

Honghua Dai

2008 IEEE International Conference on Data Mining Workshops > 69 - 73

2008 IEEE International Conference on Data Mining Workshops

The reliability of an induced classifier can be affected by several factors including the data oriented factors and the algorithm oriented factors. In some cases, the reliability could also be affected by knowledge oriented factors. In this paper, we analyze three special cases to examine the reliability of the discovered knowledge. Our case study results show that (1) in the cases of mining from...

chapter

k-Nearest Neighbor Classification on First-Order Logic Descriptions

S. Ferilli, M. Biba, T. Basile, N. Di Mauro, more

2008 IEEE International Conference on Data Mining Workshops > 202 - 210

2008 IEEE International Conference on Data Mining Workshops

Classical attribute-value descriptions induce a multi-dimensional geometric space. One way for computing the distance between descriptions in such a space consists in evaluating an Euclidean distance between tuples of coordinates. This is the ground on which a large part of the Machine Learning literature has built its methods and techniques. However, the complexity of some domains require the use...

chapter

Semi-supervised Collaborative Clustering with Partial Background Knowledge

G. Forestier, C. Wemmert, P. Gancarski

2008 IEEE International Conference on Data Mining Workshops > 211 - 217

2008 IEEE International Conference on Data Mining Workshops

In this paper we present a new algorithm for semisupervised clustering. We assume to have a small set of labeled samples and we use it in a clustering algorithm to discover relevant patterns. We study how our algorithm works against two other semisupervised algorithms when the data are multimodal. Then, we study the case where the user is able to produce few samples for some classes but not for each...

chapter

Clustering Events on Streams Using Complex Context Information

YongChul Kwon, Wing Yee Lee, M. Balazinska, Guiping Xu

2008 IEEE International Conference on Data Mining Workshops > 238 - 247

2008 IEEE International Conference on Data Mining Workshops

Monitoring applications play an increasingly important role in many domains. They detect events in monitored systems and take actions such as invoke a program or notify an administrator. Often administrators must then manually investigate events to figure out the source of a problem. Stream processing engines (SPEs) are general purpose data management systems for monitoring applications. They provide...

chapter

Extraction of Discriminative Features from Hyperspectral Data

H. Kalkan, Y. Yardimci

2008 IEEE International Conference on Data Mining Workshops > 414 - 419

2008 IEEE International Conference on Data Mining Workshops

This paper presents a method to discover the discriminative patterns or features in hyperspectral data for classification. The proposed method searches the data space along both spectral and spatial frequency axis and combines the adjacent spectral and spatial frequency bands so that a simpler but more effective feature set is achieved. The algorithm is tested on hyperspectral images of hazelnut kernels...

chapter

Chi-Square Test Based Decision Trees Induction in Distributed Environment

Jie Ouyang, N. Patel, I.K. Sethi

2008 IEEE International Conference on Data Mining Workshops > 477 - 485

2008 IEEE International Conference on Data Mining Workshops

The decision tree-based classification is a popular approach for pattern recognition and data mining. Most decision tree induction methods assume training data being present at one central location. Given the growth in distributed databases at geographically dispersed locations, the methods for decision tree induction in distributed settings are gaining importance. This paper describes one distributed...

chapter

Efficient Distance Computation Using SQL Queries and UDFs

S.K. Pitchaimalai, C. Ordonez, C. Garcia-Alvarado

2008 IEEE International Conference on Data Mining Workshops > 533 - 542

2008 IEEE International Conference on Data Mining Workshops

Distance computation is one of the most computationally intensive operations employed by many data mining algorithms. Performing such matrix computations within a DBMS creates many optimization challenges. We propose techniques to efficiently compute Euclidean distance using SQL queries and user-defined functions (UDFs). We concentrate on efficient Euclidean distance computation for the well-known...

chapter

Reclassification Rules

Li-Shiang Tsay, Z.W. Ras, Seunghyun Im

2008 IEEE International Conference on Data Mining Workshops > 619 - 627

2008 IEEE International Conference on Data Mining Workshops

The ultimate goal of knowledge discovery (KD) is to extract sets of patterns leading to useful knowledge for obtaining user desirable outcomes. The key characteristics of knowledge usefulness is that these patterns are actionable. In the last decade, KD algorithms such as mining for association rules, clustering, and classification rules, have made a tremendous progress and have been demonstrated...

chapter

ARUBAS: An Association Rule Based Similarity Framework for Associative Classifiers

B. Depaire, K. Vanhoof, G. Wets

2008 IEEE International Conference on Data Mining Workshops > 692 - 699

2008 IEEE International Conference on Data Mining Workshops

This article introduces ARUBAS, a new framework to build associative classifiers. In contrast with many existing associative classifiers, it uses class association rules to transform the feature space and uses instance-based reasoning to classify new instances. The framework allows the researcher to use any association rule mining algorithm to produce the class association rules. Every aspect of the...

chapter

ZCS Revisited: Zeroth-Level Classifier Systems for Data Mining

F.A. Tzima, P.A. Mitkas

2008 IEEE International Conference on Data Mining Workshops > 700 - 709

2008 IEEE International Conference on Data Mining Workshops

Learning classifier systems (LCS) are machine learning systems designed to work for both multi-step and single-step decision tasks. The latter case presents an interesting,though not widely studied, challenge for such algorithms,especially when they are applied to real-world data mining problems. The present investigation departs from the popular approach of applying accuracy-based LCS to data mining...

chapter

The Set Classification Problem and Solution Methods

Xia Ning, G. Karypis

2008 IEEE International Conference on Data Mining Workshops > 720 - 729

2008 IEEE International Conference on Data Mining Workshops

This paper focuses on developing classification algorithms for problems in which there is a need to predict the class based on multiple observations (examples) of the same phenomenon (class). These problems give rise to a new classification problem, referred to as set classification, that requires the prediction of a set of instances given the prior knowledge that all the instances of the set belong...

chapter

If Constraint-Based Mining is the Answer: What is the Constraint? (Invited Talk)

J.-F. Boulicaut

2008 IEEE International Conference on Data Mining Workshops > 730

2008 IEEE International Conference on Data Mining Workshops

Constraint-based mining has been proven to be extremely useful. It has been applied not only to many pattern discovery settings (e.g., for sequential pattern mining) but also, recently, on classification and clustering tasks (see, e.g., ). It appears as a key technology for an inductive database perspective on knowledge discovery in databases (KDD), and constraint-based mining is indeed an answer...

chapter

Using Contextual Information to Decrease the Cost of Incorrect Predictions in On-line Customer Behavior Modeling

M. Gorgoglione, C. Palmisano, S. Lombardi

2008 IEEE International Conference on Data Mining Workshops > 780 - 788

2008 IEEE International Conference on Data Mining Workshops

The performance of user profiling models depends on both the predictive accuracy and the cost of incorrect predictions. In this paper we study whether including contextual information leads to a decrease in the misclassification cost. Several experimental analyses were done by varying the cost ratio, the market granularity and the granularity of context. The experimental results show that context...

Filter options

Keywords:
PATTERN CLASSIFICATION

Publication date

Set your own date range

Keywords

CLASSIFICATION ALGORITHMS (11)
DATA MINING (9)
LEARNING (ARTIFICIAL INTELLIGENCE) (5)
ACCURACY (4)
PATTERN CLUSTERING (4)
CLUSTERING ALGORITHMS (3)
CONFERENCES (3)
DECISION TREES (3)
DISTANCE MEASUREMENT (3)
KNOWLEDGE DISCOVERY (3)
LEARNING SYSTEMS (3)
MACHINE LEARNING (3)
TRAINING (3)
TRAINING DATA (3)
ASSOCIATION RULES (2)
CLASSIFICATION (2)
EUCLIDEAN DISTANCE (2)
KERNEL (2)
PREDICTIVE MODELS (2)
QUERY PROCESSING (2)
ACTIONABLE PATTERNS (1)
ADJACENT SPECTRAL BANDS (1)
ANALYSIS OF VARIANCE (1)
APPLICATION DOMAIN (1)
ARUBAS-SCHEFFER ALGORITHM (1)
ASSOCIATION RULE MINING ALGORITHM (1)
ASSOCIATION RULE-BASED SIMILARITY FRAMEWORK (1)
ASSOCIATIVE CLASSIFIER (1)
BAND PASS FILTERS (1)
BENCHMARK TESTING (1)
BINARY TREES (1)
BIOLOGICAL SYSTEM MODELING (1)
BOOSTING (1)
BUILDINGS (1)
CDM (1)
CHAID ALGORITHM (1)
CHAPTERS (1)
CHI SQUARE TEST (1)
CHI-SQUARE TEST (1)
CLASS IMBALANCE (1)
CLASSICAL ATTRIBUTE-VALUE DESCRIPTIONS (1)
CLASSIFICATION RELIABILITY (1)
CLASSIFICATION TASK (1)
CLASSIFICATION TECHNIQUE (1)
CLASSIFICATION TREE ANALYSIS (1)
CLUSTERING (1)
CLUSTERING METHODS (1)
CLUSTERING TASK (1)
CLUSTERING TECHNIQUE (1)
COLLABORATION (1)
COLLABORATIVE CLUSTERING (1)
COMPANIES (1)
COMPLEX CONTEXT INFORMATION (1)
COMPLEXITY THEORY (1)
COMPUTER AIDED SOFTWARE ENGINEERING (1)
CONCEPTUAL LEARNING SYSTEMS (1)
CONSTRAINT BACK PROPAGATION (1)
CONSTRAINT RELAXATION STRATEGY (1)
CONSTRAINT-BASED DATA MINING QUERY (1)
CONSTRAINTS (1)
CONSUMER BEHAVIOUR (1)
CONTEXT DISTANCE MEASURE (1)
CONTEXT GRANULARITY (1)
CONTEXT MODELING (1)
CONTEXTUAL INFORMATION (1)
COST SENSITIVE LEARNING (1)
COSTING (1)
DATA CLASSIFICATION (1)
DATA INTEGRITY (1)
DATA MANAGEMENT SYSTEMS (1)
DATA MINING ALGORITHMS (1)
DATA MODELS (1)
DATA ORIENTED FACTORS (1)
DATA SAMPLING (1)
DATA STREAM (1)
DATABASES (1)
DBMS (1)
DECISION TREES INDUCTION (1)
DECLARATIVE SEMANTICS (1)
DEDUCTIVE DATABASES (1)
DISCOVERY RELIABILITY (1)
DISCRIMINATIVE FEATURE EXTRACTION (1)
DISCRIMINATIVE FEATURES (1)
DISCRIMINATIVE PATTERNS (1)
DISTANCE (1)
DISTANCE MEASURES (1)
DISTRIBUTED DATABASES (1)
DISTRIBUTED DECISION TREE (1)
DISTRIBUTED ENVIRONMENT (1)
EARTH (1)
EUCLIDEAN DISTANCE COMPUTATION (1)
EVENT CLUSTERING (1)
EVENT CONTEXT (1)
EVENT CONTEXT DATA MODEL (1)
EVENT DETECTION (1)
EVENT PROCESSING (1)
FEATURE EXTRACTION (1)
FEATURE SPACE TRANSFORM (1)
FETS (1)
more

INFONA - science communication portal

2008 IEEE International Conference on Data Mining Workshops $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2008 IEEE International Conference on Data Mining Workshops