2008 IEEE International Conference on Data Mining Workshops

Items from 21 to 40 out of 140 results

chapter

Scalable Sparse Bayesian Network Learning for Spatial Applications

T. Liebig, C. Korner, M. May

2008 IEEE International Conference on Data Mining Workshops > 420 - 425

2008 IEEE International Conference on Data Mining Workshops

Traffic routes through a street network contain patterns and are no random walks. Such patterns exist for instance along streets or between neighbouring street segments. The extraction of these patterns is a challenging task due to the enormous size of city street networks, the large number of required training data and the unknown distribution of the latter. We apply Bayesian Networks to model the...

chapter

A Semi-supervised Learning Algorithm for Recognizing Sub-classes

R.R. Vatsavai, S. Shekhar, B. Bhaduri

2008 IEEE International Conference on Data Mining Workshops > 458 - 467

2008 IEEE International Conference on Data Mining Workshops

In many practical situations it is not feasible to collect labeled samples for all available classes in a domain. Especially in supervised classification of remotely sensed images it is impossible to collect ground truth information over large geographic regions for all thematic classes. As a result often analysts collect labels for aggregate classes (e.g., Forest, Agriculture, Urban). In this paper...

chapter

A Vector-Geometry Based Spatial kNN-Algorithm for Traffic Frequency Predictions

M. May, D. Hecker, C. Korner, S. Scheider, more

2008 IEEE International Conference on Data Mining Workshops > 442 - 447

2008 IEEE International Conference on Data Mining Workshops

We introduce s-kNN, a nearest neighbor based spatial data mining algorithm. It belongs to the class of vector-geometry based algorithms that reason on complex spatial objects instead of point measurements. In contrast to most methods in this class, it does on the fly spatial computations that cannot be replaced by a pre-processing step without sacrificing efficiency. The key is a partial evaluation...

chapter

Discovering Triggering Events from Longitudinal Data

C. Loglisci, D. Malerba

2008 IEEE International Conference on Data Mining Workshops > 248 - 256

2008 IEEE International Conference on Data Mining Workshops

Longitudinal data consist of the repeated measurements of some variables which describe the dynamics of a domain(process or phenomenon) over time. They can be analyzed in order to explain what event may cause the transition from a state into the next one during the evolution of the domain. Generally, approaches to this explanation problem rely on the exclusive usage of domain knowledge, while an analysis...

chapter

A Spatio-temporal Simulation Model for Movement Data Generation

D. Alberg, M. Last, S. Elnekave

2008 IEEE International Conference on Data Mining Workshops > 320 - 325

2008 IEEE International Conference on Data Mining Workshops

The real-world process of generating a large spatio-temporal data collection presents a very difficult technical problem. First, this process is very expensive, requiring a lot of various high-technology software tools and modern hardware infrastructure (sensors, servers, GPS infrastructure etc.) installations; second, the recorded trajectories sometimes cannot represent any special traffic or movement...

chapter

Discovery of Internal and External Hyperclique Patterns in Complex Graph Databases

T. Yamamoto, T. Ozaki, T. Ohkawa

2008 IEEE International Conference on Data Mining Workshops > 301 - 309

2008 IEEE International Conference on Data Mining Workshops

In some applications, the whole structure of the target data can be represented naturally in "multi-structured graphs" that are complex graphs whose vertices consist of aset of structured data such as itemsets, sequences and so on. To catch the strong affinity relationship in multi-structured graphs, in this paper, we propose an algorithm named HFMG to discover novel and meaningful frequent...

chapter

Multiple-Instance Regression with Structured Data

K.L. Wagstaff, T. Lane, A. Roper

2008 IEEE International Conference on Data Mining Workshops > 291 - 300

2008 IEEE International Conference on Data Mining Workshops

We present a multiple-instance regression algorithm that models internal bag structure to identify the items most relevant to the bag labels. Multiple-instance regression (MIR) operates on a set of bags with real-valued labels, each containing a set of unlabeled items, in which the relevance of each item to its bag label is unknown. The goal is to predict the labels of new bags from their contents...

chapter

Association Action Rules

Z.W. Ras, A. Dardzinska, L.-S. Tsay, H. Wasyluk

2008 IEEE International Conference on Data Mining Workshops > 283 - 290

2008 IEEE International Conference on Data Mining Workshops

Action rules describe possible transitions of objects from one state to another with respect to a distinguished attribute. Previous research on action rule discovery usually required the extraction of classification rules before constructing any action rule. This paper gives anew approach for generating association-type action rules. The notion of frequent action sets and Apriori-like strategy generating...

chapter

Extension of Partitional Clustering Methods for Handling Mixed Data

Y. Naija, S. Chakhar, K. Blibech, R. Robbana

2008 IEEE International Conference on Data Mining Workshops > 257 - 266

2008 IEEE International Conference on Data Mining Workshops

Clustering is an active research topic in data mining and different methods have been proposed in the literature. Most of these methods are based on the use of a distance measure defined either on numerical attributes or on categorical attributes. However, in fields such as road traffic and medicine, datasets are composed of numerical and categorical attributes. Recently, there have been several proposals...

chapter

Word Sense Discovery for Web Information Retrieval

T. Nykiel, H. Rybinski

2008 IEEE International Conference on Data Mining Workshops > 267 - 274

2008 IEEE International Conference on Data Mining Workshops

Word meaning disambiguation has always been an important problem in many computer science tasks, such as information retrieval and extraction. One of the problems,faced in automatic word sense discovery, is the number of different senses a word can have. Often, senses are dominated by some other, more frequent ones. Discovering such dominated meanings can significantly improve quality of many text-related...

chapter

Geographic Knowledge Discovery in INGENS: An Inductive Database Perspective

A. Appice, A. Ciampi, A. Lanza, D. Malerba, more

2008 IEEE International Conference on Data Mining Workshops > 326 - 331

2008 IEEE International Conference on Data Mining Workshops

INGENS is a prototype of GIS which integrates a geographic knowledge discovery engine to mine several kinds of spatial KDD objects from the topographic maps stored in a spatial database. In this paper we describe the main principles of an inductive spatial database in INGENS. Inductive database allows to keep permanent KDD objects and integrate database technology with systems for the geographic knowledge...

chapter

Mining Correlated Pairs of Patterns in Multidimensional Structured Databases

T. Ozaki, T. Ohkawa

2008 IEEE International Conference on Data Mining Workshops > 275 - 282

2008 IEEE International Conference on Data Mining Workshops

Structured data is becoming increasingly abundant in many application domains recently. In this paper, as one of the correlation mining, we propose new data mining problems of finding frequent and correlated pairs of patterns in structured databases. First, we consider the problem of finding all frequent and correlated pattern pairs in two dimensional structured databases. Then, two kinds of top-k...

chapter

The Impact of Structural Changes on Predictions of Diffusion in Networks

M. Lahiri, A.S. Maiya, R. Sulo, Habiba, more

2008 IEEE International Conference on Data Mining Workshops > 939 - 948

2008 IEEE International Conference on Data Mining Workshops

In a typical realistic scenario, there exist some past data about the structure of the network which are analyzed with respect to some possibly future spreading process, such as behavior, opinion, disease, or computer malware. How sensitive are the predictions made about spread and spreaders to the changes in the structure of the network? We investigate the answer to this question by considering seven...

chapter

DMDM 2008 Message and Committee

2008 IEEE International Conference on Data Mining Workshops > xxxv - xxxvii

2008 IEEE International Conference on Data Mining Workshops

chapter

Domain Driven Data Mining (D3M)

Longbing Cao

2008 IEEE International Conference on Data Mining Workshops > 74 - 76

2008 IEEE International Conference on Data Mining Workshops

In deploying data mining into the real-world business, we have to cater for business scenarios, organizational factors, user preferences and business needs. However, the current data mining algorithms and tools often stop at the delivery of patterns satisfying expected technical interestingness. Business people are not informed about how and what to do to take over the technical deliverables. The...

chapter

A Case Study on Classification Reliability

Honghua Dai

2008 IEEE International Conference on Data Mining Workshops > 69 - 73

2008 IEEE International Conference on Data Mining Workshops

The reliability of an induced classifier can be affected by several factors including the data oriented factors and the algorithm oriented factors. In some cases, the reliability could also be affected by knowledge oriented factors. In this paper, we analyze three special cases to examine the reliability of the discovered knowledge. Our case study results show that (1) in the cases of mining from...

chapter

One-Class Classification of Text Streams with Concept Drift

Yang Zhang, Xue Li, M. Orlowska

2008 IEEE International Conference on Data Mining Workshops > 116 - 125

2008 IEEE International Conference on Data Mining Workshops

Research on streaming data classification has been mostly based on the assumption that data can be fully labelled. However, this is impractical. Firstly it is impossible to make a complete labelling before all data has arrived. Secondly it is generally very expensive to obtain fully labelled data by using man power. Thirdly user interests may change with time so the labels issued earlier may be inconsistent...

chapter

TransRank: A Novel Algorithm for Transfer of Rank Learning

Depin Chen, Jun Yan, Gang Wang, Yan Xiong, more

2008 IEEE International Conference on Data Mining Workshops > 106 - 115

2008 IEEE International Conference on Data Mining Workshops

Recently, learning to rank technique has attracted much attention. However, the lack of labeled training data seriously limits its application in real-world tasks. In this paper, we propose to break this bottleneck by considering the cross-domain ldquotransfer of rank learningrdquo problem. Simultaneously, we propose a novel algorithm called TransRank, which can effectively utilize the labeled data...

chapter

Post-Processing of Discovered Association Rules Using Ontologies

C. Marinica, F. Guillet, H. Briand

2008 IEEE International Conference on Data Mining Workshops > 126 - 133

2008 IEEE International Conference on Data Mining Workshops

In Data Mining, the usefulness of association rules is strongly limited by the huge amount of delivered rules. In this paper we propose a new approach to prune and filter discovered rules. Using Domain Ontologies, we strengthen the integration of user knowledge in the post-processing task. Furthermore, an interactive and iterative framework is designed to assist the user along the analyzing task....

chapter

Hierarchical Text Categorization in a Transductive Setting

M. Ceci

2008 IEEE International Conference on Data Mining Workshops > 184 - 191

2008 IEEE International Conference on Data Mining Workshops

Transductive learning is the learning setting that permits to learn from "particular to particular'' and to consider both labelled and unlabelled examples when taking classification decisions. In this paper, we investigate the use of transductive learning in the context of hierarchical text categorization. At this aim, we exploit a modified version of an inductive hierarchical learning framework...

Publication date

Set your own date range

Content availability

Available (139)
None (1)

Keywords

DATA MINING (86)
CLASSIFICATION ALGORITHMS (29)
DATABASES (23)
DATA MODELS (19)
LEARNING (ARTIFICIAL INTELLIGENCE) (19)
CLUSTERING ALGORITHMS (18)
TRAINING (18)
DISTANCE MEASUREMENT (17)
FEATURE EXTRACTION (16)
ACCURACY (15)
PATTERN CLUSTERING (15)
ALGORITHM DESIGN AND ANALYSIS (14)
CONFERENCES (14)
PATTERN CLASSIFICATION (14)
ASSOCIATION RULES (13)
QUERY PROCESSING (11)
INTERNET (10)
ITEMSETS (10)
INDEXES (8)
KNOWLEDGE DISCOVERY (8)
PREDICTIVE MODELS (8)
STATISTICAL ANALYSIS (8)
COMPUTATIONAL MODELING (7)
CORRELATION (7)
DATABASE MANAGEMENT SYSTEMS (7)
DECISION TREES (7)
ESTIMATION (7)
GRAPH THEORY (7)
KERNEL (7)
MATHEMATICAL MODEL (7)
ONTOLOGIES (ARTIFICIAL INTELLIGENCE) (7)
TEXT ANALYSIS (7)
TRAINING DATA (7)
OPTIMIZATION (6)
RELIABILITY (6)
VISUAL DATABASES (6)
WEB SERVICES (6)
BIOLOGICAL SYSTEM MODELING (5)
CLASSIFICATION (5)
CLUSTERING (5)
FILTERING (5)
GRAPH MINING (5)
MACHINE LEARNING (5)
MARKETING (5)
MATRIX ALGEBRA (5)
MERGING (5)
ONTOLOGIES (5)
PROTEINS (5)
SPATIAL DATABASES (5)
APPROXIMATION METHODS (4)
BIOLOGY (4)
BUILDINGS (4)
BUSINESS (4)
CITIES AND TOWNS (4)
DATA ANALYSIS (4)
ENGINES (4)
EQUATIONS (4)
HIDDEN MARKOV MODELS (4)
HUMANS (4)
IMAGE CLASSIFICATION (4)
LABELING (4)
LEARNING SYSTEMS (4)
METEOROLOGY (4)
NOISE (4)
PEDIATRICS (4)
PROBABILITY (4)
PROPOSALS (4)
REDUNDANCY (4)
REGRESSION ANALYSIS (4)
REMOTE SENSING (4)
SET THEORY (4)
SOCIAL NETWORK SERVICES (4)
SOFTWARE (4)
SUPPORT VECTOR MACHINES (4)
TIME SERIES ANALYSIS (4)
WEB PAGES (4)
AGRICULTURE (3)
AMINO ACIDS (3)
ANALYTICAL MODELS (3)
ANOMALY DETECTION (3)
ATMOSPHERIC MEASUREMENTS (3)
BENCHMARK TESTING (3)
CLASSIFICATION TREE ANALYSIS (3)
COMPANIES (3)
COMPLEXITY THEORY (3)
COMPUTER SCIENCE (3)
CONSUMER BEHAVIOUR (3)
DATA HANDLING (3)
DATA VISUALISATION (3)
DATA VISUALIZATION (3)
DELAY (3)
DISTRIBUTED DATABASES (3)
EVOLUTION (BIOLOGY) (3)
GEOGRAPHIC INFORMATION SYSTEMS (3)
GRAPHICS (3)
IMAGE COLOR ANALYSIS (3)
IMAGE SEQUENCES (3)
INFORMATION EXTRACTION (3)
IP NETWORKS (3)
KNOWLEDGE ENGINEERING (3)
more

INFONA - science communication portal

2008 IEEE International Conference on Data Mining Workshops

Scalable Sparse Bayesian Network Learning for Spatial Applications

A Semi-supervised Learning Algorithm for Recognizing Sub-classes

A Vector-Geometry Based Spatial kNN-Algorithm for Traffic Frequency Predictions

Discovering Triggering Events from Longitudinal Data

A Spatio-temporal Simulation Model for Movement Data Generation

Discovery of Internal and External Hyperclique Patterns in Complex Graph Databases

Multiple-Instance Regression with Structured Data

Association Action Rules

Extension of Partitional Clustering Methods for Handling Mixed Data

Word Sense Discovery for Web Information Retrieval

Geographic Knowledge Discovery in INGENS: An Inductive Database Perspective

Mining Correlated Pairs of Patterns in Multidimensional Structured Databases

The Impact of Structural Changes on Predictions of Diffusion in Networks

DMDM 2008 Message and Committee

Domain Driven Data Mining (D3M)

A Case Study on Classification Reliability

One-Class Classification of Text Streams with Concept Drift

TransRank: A Novel Algorithm for Transfer of Rank Learning

Post-Processing of Discovered Association Rules Using Ontologies

Hierarchical Text Categorization in a Transductive Setting

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

2008 IEEE International Conference on Data Mining Workshops $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2008 IEEE International Conference on Data Mining Workshops