2008 IEEE International Conference on Data Mining Workshops

Items from 1 to 19 out of 19 results

chapter

Online Reliability Estimates for Individual Predictions in Data Streams

P.P. Rodrigues, J. Gama, Z. Bosnic

2008 IEEE International Conference on Data Mining Workshops > 36 - 45

2008 IEEE International Conference on Data Mining Workshops

Several predictive systems are nowadays vital for operations and decision support. The quality of these systems is most of the time defined by their average accuracy which has low or no information at all about the estimated error of each individual prediction. In many sensitive applications, users should be allowed to associate a measure of reliability to each prediction. In the case of batch systems,...

chapter

A Comparative Study of Data Sampling and Cost Sensitive Learning

C. Seiffert, T.M. Khoshgoftaar, J. Van Hulse, A. Napolitano

2008 IEEE International Conference on Data Mining Workshops > 46 - 52

2008 IEEE International Conference on Data Mining Workshops

Two common challenges data mining and machine learning practitioners face in many application domains are unequal classification costs and class imbalance. Most traditional data mining techniques attempt to maximize overall accuracy rather than minimize cost. When data is imbalanced, such techniques result in models that highly favor the over represented class, the class which typically carries a...

chapter

Behavior Informatics and Analytics: Let Behavior Talk

Longbing Cao

2008 IEEE International Conference on Data Mining Workshops > 87 - 96

2008 IEEE International Conference on Data Mining Workshops

Behavior is increasingly recognized as a key component in business intelligence and problem-solving. Different from traditional behavior analysis, which mainly focus on implicit behavior and explicit business appearance as a result of business usage and customer demographics, this paper proposes the field of Behavior Informatics and Analytics (BIA), to support explicit behavior involvement through...

chapter

Multiple-Instance Regression with Structured Data

K.L. Wagstaff, T. Lane, A. Roper

2008 IEEE International Conference on Data Mining Workshops > 291 - 300

2008 IEEE International Conference on Data Mining Workshops

We present a multiple-instance regression algorithm that models internal bag structure to identify the items most relevant to the bag labels. Multiple-instance regression (MIR) operates on a set of bags with real-valued labels, each containing a set of unlabeled items, in which the relevance of each item to its bag label is unknown. The goal is to predict the labels of new bags from their contents...

chapter

A Spatio-temporal Simulation Model for Movement Data Generation

D. Alberg, M. Last, S. Elnekave

2008 IEEE International Conference on Data Mining Workshops > 320 - 325

2008 IEEE International Conference on Data Mining Workshops

The real-world process of generating a large spatio-temporal data collection presents a very difficult technical problem. First, this process is very expensive, requiring a lot of various high-technology software tools and modern hardware infrastructure (sensors, servers, GPS infrastructure etc.) installations; second, the recorded trajectories sometimes cannot represent any special traffic or movement...

chapter

Data Mining for Climate Change and Impacts

A.R. Ganguly, K. Steinhaeuser

2008 IEEE International Conference on Data Mining Workshops > 385 - 394

2008 IEEE International Conference on Data Mining Workshops

Knowledge discovery from temporal, spatial and spatiotemporal data is critical for climate change science and climate impacts. Climate statistics is a mature area. However, recent growth in observations and model outputs, combined with the increased availability of geographical data, presents new opportunities for data miners. This paper maps climate requirements to solutions available in temporal,...

chapter

RE-SPaM: Using Regular Expressions for Sequential Pattern Mining in Trajectory Databases

L.I. Gomez, A.A. Vaisman

2008 IEEE International Conference on Data Mining Workshops > 395 - 398

2008 IEEE International Conference on Data Mining Workshops

In sequential pattern mining, languages based on regular expressions (RE) were proposed to restrict frequent sequences to the ones that satisfy user-specified constraints. In these languages, REs are applied over items. We propose a much powerful language, based on regular expressions, denoted RE-SPaM, where the basic elements are constraints over the attributes of the items. Expressions in this language...

chapter

Web Query Prediction by Unifying Model

Ning Liu, J. Yan, Shuicheng Yan, Weiguo Fan, more

2008 IEEE International Conference on Data Mining Workshops > 436 - 441

2008 IEEE International Conference on Data Mining Workshops

Recently, many commercial products, such as Google Trends and Yahoo! Buzz, are released to monitor the past search engine query frequency trend. However, little research has been devoted for predicting the upcoming query trend, which is of great importance in providing guidelines for future business planning. In this paper, a unified solution is presented for such a purpose. Besides the classical...

chapter

Distributed Data Mining Models as Services on the Grid

E. Cesario, D. Talia

2008 IEEE International Conference on Data Mining Workshops > 486 - 495

2008 IEEE International Conference on Data Mining Workshops

This paper describes how distributed data mining models, such as collective learning, ensemble learning, and meta-learning models, can be implemented as WSRF mining services by exploiting the Grid infrastructure. Our goal is to design a general distributed architectural model that can be exploited for different distributed mining algorithms deployed as Grid services for the analysis of dispersed data...

chapter

Stream-Close: Fast Mining of Closed Frequent Itemsets in High Speed Data Streams

B.N. Ranganath, M.N. Murty

2008 IEEE International Conference on Data Mining Workshops > 516 - 525

2008 IEEE International Conference on Data Mining Workshops

With the emergence of large-volume and high-speed streaming data, the recent techniques for stream mining of CFIpsilas (closed frequent itemsets) will become inefficient. When concept drift occurs at a slow rate in high speed data streams, the rate of change of information across different sliding windows will be negligible. So, the user wonpsilat be devoid of change in information if we slide window...

chapter

Distributed Linear Programming and Resource Management for Data Mining in Distributed Environments

H. Dutta, H. Kargupta

2008 IEEE International Conference on Data Mining Workshops > 543 - 552

2008 IEEE International Conference on Data Mining Workshops

Advances in computing and communication has resulted in very large scale distributed environments in recent years. They are capable of storing large volumes of data and often have multiple compute nodes. However, the inherent heterogeneity of data components, the dynamic nature of distributed systems, the need for information synchronization and data fusion over a network and security and access control...

chapter

Bounding and Estimating Association Rule Support from Clusters on Binary Data

C. Ordonez, Kai Zhao, Zhibo Chen

2008 IEEE International Conference on Data Mining Workshops > 609 - 618

2008 IEEE International Conference on Data Mining Workshops

The theoretical relationship between association rules and machine learning techniques needs to be studied in more depth. This article studies the use of clustering as a model for association rule mining. The clustering model is exploited to bound and estimate association rule support and confidence. We first study the efficient computation of the clustering model with K-means; we show the sufficient...

chapter

A Logical Formulation of the Granular Data Model

Tuan-Fang Fan, Churn-Jung Liau, Tsau-Young Lin, K. Lee

2008 IEEE International Conference on Data Mining Workshops > 628 - 634

2008 IEEE International Conference on Data Mining Workshops

In data mining problems, data is usually provided in the form of data tables. To represent knowledge discovered from data tables, decision logic (DL) is proposed in rough set theory. While DL is an instance of propositional logic, we can also describe data tables by other logical formalisms. In this paper, we use a kind of many-sorted logic, called attribute value-sorted logic, to study association...

chapter

Exploiting Data Semantics to Discover, Extract, and Model Web Sources

J.L. Ambite, C.A. Knoblock, K. Lerman, A. Plangprasopchok, more

2008 IEEE International Conference on Data Mining Workshops > 771 - 779

2008 IEEE International Conference on Data Mining Workshops

We describe Deimos, a system that automatically discovers and models new sources of information.The system exploits four core technologies developed by our group that makes an end-to-end solution to this problem possible. First, given an example source, Deimos finds other similar sources online. Second, it invokes and extracts data from these sources. Third, given the syntactic structure of a source,...

chapter

Simultaneous Co-segmentation and Predictive Modeling for Large, Temporal Marketing Data

M. Deodhar, J. Ghosh

2008 IEEE International Conference on Data Mining Workshops > 806 - 815

2008 IEEE International Conference on Data Mining Workshops

Several marketing problems involve prediction of customer purchase behavior and forecasting future preferences. We consider predictive modeling of large scale, bi-modal or multimodal temporal marketing data, for instance, datasets consisting of customer spending behavior over time. Such datasets are characterized by variability in purchase patterns across different customer subgroups and shifting...

chapter

Combining Behavioral and Social Network Data for Online Advertising

A. Bagherjeiran, R. Parekh

2008 IEEE International Conference on Data Mining Workshops > 837 - 846

2008 IEEE International Conference on Data Mining Workshops

There are two main requirements for effective advertising in social networks. The first is that links in the social network are relevant to the targeted ads. The second is that social information can be easily incorporated with existing targeting methods to predict response rates. Our purpose in this paper is to investigate these requirements. We measure the relevance of a social network, the Yahoo!...

chapter

Temporal Evolution of the UK Web

I. Bordino, P. Boldi, D. Donato, M. Santini, more

2008 IEEE International Conference on Data Mining Workshops > 909 - 918

2008 IEEE International Conference on Data Mining Workshops

Recently, a new temporal dataset has been made public: it is made of a series of twelve 100 M pages snapshots of the .uk domain. The Web graphs of the twelve snapshots have been merged into a single time-aware graph that provide constant-time access to temporal information. In this paper we present the first statistical analysis performed on this graph, with the goal of checking whether the information...

chapter

G-REX: A Versatile Framework for Evolutionary Data Mining

R. Konig, U. Johansson, L. Niklasson

2008 IEEE International Conference on Data Mining Workshops > 971 - 974

2008 IEEE International Conference on Data Mining Workshops

This paper presents G-REX, a versatile data mining framework based on genetic programming. What differs G-REX from other GP frameworks is that it doesn't strive to be a general purpose framework. This allows G-REX to include more functionality specific to data mining like preprocessing, evaluation- and optimization methods, but also a multitude of predefined classification and regression models. Examples...

chapter

A Data Stream Mining System

H. Thakkar, B. Mozafari, C. Zaniolo

2008 IEEE International Conference on Data Mining Workshops > 987 - 990

2008 IEEE International Conference on Data Mining Workshops

On-line data stream mining has attracted much research interest, but systems that can be used as a workbench for online mining have not been researched, since they pose many difficult research challenges. The proposed system addresses these challenges by an architecture based on three main technical advances, (i) introduction of new constructs and synoptic data structures whereby complex KDD queries...

Filter options

Keywords:
DATA MODELS

Publication date

Set your own date range

Keywords

DATA MINING (16)
PREDICTIVE MODELS (5)
ASSOCIATION RULES (4)
COMPUTATIONAL MODELING (4)
ACCURACY (3)
ALGORITHM DESIGN AND ANALYSIS (3)
BIOLOGICAL SYSTEM MODELING (3)
CLUSTERING ALGORITHMS (3)
CONFERENCES (3)
INTERNET (3)
ITEMSETS (3)
PATTERN CLUSTERING (3)
REGRESSION ANALYSIS (3)
STATISTICAL ANALYSIS (3)
TRAINING (3)
ANALYTICAL MODELS (2)
CLUSTERING (2)
DATA ANALYSIS (2)
DATA HANDLING (2)
DATABASES (2)
DECISION TREES (2)
DISTRIBUTED DATABASES (2)
EVOLUTION (BIOLOGY) (2)
KNOWLEDGE DISCOVERY (2)
LEARNING (ARTIFICIAL INTELLIGENCE) (2)
LINEAR REGRESSION (2)
METEOROLOGY (2)
OPTIMIZATION (2)
QUERY PROCESSING (2)
RELIABILITY (2)
TRAJECTORY (2)
WEB SERVICES (2)
ADVERTISING (1)
ADVERTISING DATA PROCESSING (1)
AGRICULTURE (1)
ANALYSIS OF VARIANCE (1)
APPLICATION DOMAIN (1)
APPROXIMATION METHODS (1)
ASSOCIATION RULE MINING (1)
ASSOCIATION RULE SUPPORT ESTIMATION (1)
ATMOSPHERIC MODELING (1)
ATTRIBUTE VALUE-SORTED LOGIC (1)
AUTOREGRESSIVE PROCESSES (1)
BAG LABELS (1)
BATCH SYSTEM (1)
BEHAVIOR INFORMATICS (1)
BEHAVIOR INFORMATICS AND ANALYTICS (1)
BEHAVIOR MODELING (1)
BEHAVIOR PATTERN ANALYSIS (1)
BEHAVIOR PRESENTATION (1)
BEHAVIORAL DATA CONSTRUCTION (1)
BEHAVIORAL-SOCIAL NETWORK DATA (1)
BEHAVIOURAL SCIENCES COMPUTING (1)
BINARY DATA (1)
BISMUTH (1)
BOUND (1)
BUSINESS (1)
BUSINESS INTELLIGENCE (1)
BUSINESS PLANNING (1)
BUSINESS USAGE (1)
CANONICAL MODEL (1)
CFI'S (1)
CITIES AND TOWNS (1)
CLASS IMBALANCE (1)
CLASSIFICATION ALGORITHMS (1)
CLIMATE IMPACT (1)
CLIMATE STATISTICS (1)
CLOSED FREQUENT ITEMSET MINING (1)
CLOSED FREQUENT ITEMSETS (1)
CLUSTERING ALGORITHM (1)
CO-SEGMENTATION (1)
COMPETITIVE INTELLIGENCE (1)
COMPLEX KDD QUERIES (1)
COMPUTATIONAL COMPLEXITY (1)
COMPUTATIONAL LINGUISTICS (1)
COMPUTER SCIENCE (1)
CONSTANT-TIME ACCESS (1)
CONSUMER BEHAVIOUR (1)
CORRELATION (1)
CORRELATION METHODS (1)
COSINE SIGNAL HIDDEN PERIODICITIES MODEL (1)
COST ACCOUNTING (1)
COST SENSITIVE LEARNING (1)
CRAWLER BEHAVIOUR (1)
CRAWLERS (1)
CROP YIELD PREDICTION (1)
CUSTOMER DEMOGRAPHICS (1)
CUSTOMER PURCHASE BEHAVIOR (1)
CUSTOMER SPENDING BEHAVIOR (1)
DATA EXTRACTION (1)
DATA FUSION (1)
DATA MODEL (1)
DATA SAMPLING (1)
DATA SEMANTICS (1)
DATA STREAM (1)
DATA STREAM MINING SYSTEM (1)
DATA STREAM PREDICTION (1)
DATA TABLE (1)
DATABASE MANAGEMENT SYSTEMS (1)
more

INFONA - science communication portal

2008 IEEE International Conference on Data Mining Workshops $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2008 IEEE International Conference on Data Mining Workshops