2008 IEEE International Conference on Data Mining Workshops

Items from 41 to 60 out of 139 results

chapter

k-Nearest Neighbor Classification on First-Order Logic Descriptions

S. Ferilli, M. Biba, T. Basile, N. Di Mauro, more

2008 IEEE International Conference on Data Mining Workshops > 202 - 210

2008 IEEE International Conference on Data Mining Workshops

Classical attribute-value descriptions induce a multi-dimensional geometric space. One way for computing the distance between descriptions in such a space consists in evaluating an Euclidean distance between tuples of coordinates. This is the ground on which a large part of the Machine Learning literature has built its methods and techniques. However, the complexity of some domains require the use...

chapter

Semi-supervised Collaborative Clustering with Partial Background Knowledge

G. Forestier, C. Wemmert, P. Gancarski

2008 IEEE International Conference on Data Mining Workshops > 211 - 217

2008 IEEE International Conference on Data Mining Workshops

In this paper we present a new algorithm for semisupervised clustering. We assume to have a small set of labeled samples and we use it in a clustering algorithm to discover relevant patterns. We study how our algorithm works against two other semisupervised algorithms when the data are multimodal. Then, we study the case where the user is able to produce few samples for some classes but not for each...

chapter

Mining Temporal Patterns with Quantitative Intervals

T. Guyet, R. Quiniou

2008 IEEE International Conference on Data Mining Workshops > 218 - 227

2008 IEEE International Conference on Data Mining Workshops

In this paper we consider the problem of discovering frequent temporal patterns in a database of temporal sequences, where a temporal sequence is a set of items with associated dates and durations. Since the quantitative temporal information appears to be fundamental in many contexts, it is taken into account in the mining processes and returned as part of the extracted knowledge. To this end, we...

chapter

Plant Protein Localization Using Discriminative and Frequent Partition-Based Subsequences

S.V. Jazayeri, O.R. Zaiane

2008 IEEE International Conference on Data Mining Workshops > 228 - 237

2008 IEEE International Conference on Data Mining Workshops

The function of proteins in the living cells varies with respect to their localizations. Extracellular plant proteins are responsible for vital functions such as nutrition acquisition, protection from pathogens, communication with other soil organisms, etc. Hence, characterizing these proteins and distinguishing them from intracellular proteins is of high interest to biologists. Nonetheless, the small...

chapter

Clustering Events on Streams Using Complex Context Information

YongChul Kwon, Wing Yee Lee, M. Balazinska, Guiping Xu

2008 IEEE International Conference on Data Mining Workshops > 238 - 247

2008 IEEE International Conference on Data Mining Workshops

Monitoring applications play an increasingly important role in many domains. They detect events in monitored systems and take actions such as invoke a program or notify an administrator. Often administrators must then manually investigate events to figure out the source of a problem. Stream processing engines (SPEs) are general purpose data management systems for monitoring applications. They provide...

chapter

Discovering Triggering Events from Longitudinal Data

C. Loglisci, D. Malerba

2008 IEEE International Conference on Data Mining Workshops > 248 - 256

2008 IEEE International Conference on Data Mining Workshops

Longitudinal data consist of the repeated measurements of some variables which describe the dynamics of a domain(process or phenomenon) over time. They can be analyzed in order to explain what event may cause the transition from a state into the next one during the evolution of the domain. Generally, approaches to this explanation problem rely on the exclusive usage of domain knowledge, while an analysis...

chapter

Extension of Partitional Clustering Methods for Handling Mixed Data

Y. Naija, S. Chakhar, K. Blibech, R. Robbana

2008 IEEE International Conference on Data Mining Workshops > 257 - 266

2008 IEEE International Conference on Data Mining Workshops

Clustering is an active research topic in data mining and different methods have been proposed in the literature. Most of these methods are based on the use of a distance measure defined either on numerical attributes or on categorical attributes. However, in fields such as road traffic and medicine, datasets are composed of numerical and categorical attributes. Recently, there have been several proposals...

chapter

Word Sense Discovery for Web Information Retrieval

T. Nykiel, H. Rybinski

2008 IEEE International Conference on Data Mining Workshops > 267 - 274

2008 IEEE International Conference on Data Mining Workshops

Word meaning disambiguation has always been an important problem in many computer science tasks, such as information retrieval and extraction. One of the problems,faced in automatic word sense discovery, is the number of different senses a word can have. Often, senses are dominated by some other, more frequent ones. Discovering such dominated meanings can significantly improve quality of many text-related...

chapter

Mining Correlated Pairs of Patterns in Multidimensional Structured Databases

T. Ozaki, T. Ohkawa

2008 IEEE International Conference on Data Mining Workshops > 275 - 282

2008 IEEE International Conference on Data Mining Workshops

Structured data is becoming increasingly abundant in many application domains recently. In this paper, as one of the correlation mining, we propose new data mining problems of finding frequent and correlated pairs of patterns in structured databases. First, we consider the problem of finding all frequent and correlated pattern pairs in two dimensional structured databases. Then, two kinds of top-k...

chapter

Association Action Rules

Z.W. Ras, A. Dardzinska, L.-S. Tsay, H. Wasyluk

2008 IEEE International Conference on Data Mining Workshops > 283 - 290

2008 IEEE International Conference on Data Mining Workshops

Action rules describe possible transitions of objects from one state to another with respect to a distinguished attribute. Previous research on action rule discovery usually required the extraction of classification rules before constructing any action rule. This paper gives anew approach for generating association-type action rules. The notion of frequent action sets and Apriori-like strategy generating...

chapter

Multiple-Instance Regression with Structured Data

K.L. Wagstaff, T. Lane, A. Roper

2008 IEEE International Conference on Data Mining Workshops > 291 - 300

2008 IEEE International Conference on Data Mining Workshops

We present a multiple-instance regression algorithm that models internal bag structure to identify the items most relevant to the bag labels. Multiple-instance regression (MIR) operates on a set of bags with real-valued labels, each containing a set of unlabeled items, in which the relevance of each item to its bag label is unknown. The goal is to predict the labels of new bags from their contents...

chapter

Discovery of Internal and External Hyperclique Patterns in Complex Graph Databases

T. Yamamoto, T. Ozaki, T. Ohkawa

2008 IEEE International Conference on Data Mining Workshops > 301 - 309

2008 IEEE International Conference on Data Mining Workshops

In some applications, the whole structure of the target data can be represented naturally in "multi-structured graphs" that are complex graphs whose vertices consist of aset of structured data such as itemsets, sequences and so on. To catch the strong affinity relationship in multi-structured graphs, in this paper, we propose an algorithm named HFMG to discover novel and meaningful frequent...

chapter

Harmonic Blind Sound Source Isolation Enhanced by Spectrum Clustering

Xin Zhang, Wenxin Jiang, Z.W. Ras

2008 IEEE International Conference on Data Mining Workshops > 310 - 319

2008 IEEE International Conference on Data Mining Workshops

Automatic indexing of music by instruments and their types is a challenging problem, especially when multiple instruments are playing at the same time. We have built a database containing more than one million of music instrument sounds, each described by a large number o features including standard MPEG7 audio descriptors, features for speech recognition, and many new audio features developed by...

chapter

A Spatio-temporal Simulation Model for Movement Data Generation

D. Alberg, M. Last, S. Elnekave

2008 IEEE International Conference on Data Mining Workshops > 320 - 325

2008 IEEE International Conference on Data Mining Workshops

The real-world process of generating a large spatio-temporal data collection presents a very difficult technical problem. First, this process is very expensive, requiring a lot of various high-technology software tools and modern hardware infrastructure (sensors, servers, GPS infrastructure etc.) installations; second, the recorded trajectories sometimes cannot represent any special traffic or movement...

chapter

Geographic Knowledge Discovery in INGENS: An Inductive Database Perspective

A. Appice, A. Ciampi, A. Lanza, D. Malerba, more

2008 IEEE International Conference on Data Mining Workshops > 326 - 331

2008 IEEE International Conference on Data Mining Workshops

INGENS is a prototype of GIS which integrates a geographic knowledge discovery engine to mine several kinds of spatial KDD objects from the topographic maps stored in a spatial database. In this paper we describe the main principles of an inductive spatial database in INGENS. Inductive database allows to keep permanent KDD objects and integrate database technology with systems for the geographic knowledge...

chapter

Enriching Spatial OLAP with Map Generalization: a Conceptual Multidimensional Model

S. Bimonte, J. Gensel, M. Bertolotto

2008 IEEE International Conference on Data Mining Workshops > 332 - 341

2008 IEEE International Conference on Data Mining Workshops

Map generalization is used to derive maps for secondary scales and/or specific goals. This operation greatly benefits spatial decision support systems as it can provide a global and simplified representation of a phenomenon discarding irrelevant information. The recent popularity of OLAP systems for various application domains has generated much interest for the development of spatial OLAP (SOLAP)...

chapter

Risk Assessment of Atmospheric Hazard Releases Using K-Means Clustering

G. Cervone, P. Franzese, Y. Ezber, Z. Boybeyi

2008 IEEE International Conference on Data Mining Workshops > 342 - 348

2008 IEEE International Conference on Data Mining Workshops

Unsupervised machine learning algorithms are used to perform statistical analysis of several transport and dispersion model runs which simulate emissions from a fixed source under different atmospheric conditions. A clustering algorithm is used to automatically group the results of the transport and dispersion simulations according to their respective cloud characteristics. Each cluster of clouds...

chapter

A Robust Graph-Based Algorithm for Detection and Characterization of Anomalies in Noisy Multivariate Time Series

Haibin Cheng, Pang-Ning Tan, C. Potter, S. Klooster

2008 IEEE International Conference on Data Mining Workshops > 349 - 358

2008 IEEE International Conference on Data Mining Workshops

Detection of anomalies in multivariate time series is an important data mining task with potential applications in medical diagnosis, ecosystem modeling, and network traffic monitoring. In this paper, we present a robust graph-based algorithm for detecting anomalies in noisy multivariate time series data. A key feature of the algorithm is the alignment of kernel matrices constructed from the time...

chapter

Kernels for the Investigation of Localized Spatiotemporal Transitions of Drought with Support Vector Machines

M.W. Collier, A. McGovern

2008 IEEE International Conference on Data Mining Workshops > 359 - 368

2008 IEEE International Conference on Data Mining Workshops

We present and discuss several spatiotemporal kernels designed to mine real-life and simulated data in support of drought prediction. We implement and empirically validate these kernels for support vector machines. Issues related to the nature of geographic data such as autocorrelation and directionality are investigated.

chapter

Standards-Based Coastal Sensor Web

S.S. Durbha, R.L. King, N.H. Younan, S.A. Rajender, more

2008 IEEE International Conference on Data Mining Workshops > 369 - 374

2008 IEEE International Conference on Data Mining Workshops

Coastal buoys and stations provide frequent, high quality marine observations for oceanographic study, weather service, atmospheric and public safety. Sharing of the generated data sets requires tremendous efforts and coordination among the different sensor network agencies to come to a shared understanding and for dissemination in a uniform way. Syntactic standardization provides data description...

Publication date

Set your own date range

Keywords

DATA MINING (86)
CLASSIFICATION ALGORITHMS (29)
DATABASES (23)
DATA MODELS (19)
LEARNING (ARTIFICIAL INTELLIGENCE) (19)
CLUSTERING ALGORITHMS (18)
TRAINING (18)
DISTANCE MEASUREMENT (17)
FEATURE EXTRACTION (16)
ACCURACY (15)
PATTERN CLUSTERING (15)
ALGORITHM DESIGN AND ANALYSIS (14)
CONFERENCES (14)
PATTERN CLASSIFICATION (14)
ASSOCIATION RULES (13)
QUERY PROCESSING (11)
INTERNET (10)
ITEMSETS (10)
INDEXES (8)
KNOWLEDGE DISCOVERY (8)
PREDICTIVE MODELS (8)
STATISTICAL ANALYSIS (8)
COMPUTATIONAL MODELING (7)
CORRELATION (7)
DATABASE MANAGEMENT SYSTEMS (7)
DECISION TREES (7)
ESTIMATION (7)
GRAPH THEORY (7)
KERNEL (7)
MATHEMATICAL MODEL (7)
ONTOLOGIES (ARTIFICIAL INTELLIGENCE) (7)
TEXT ANALYSIS (7)
TRAINING DATA (7)
OPTIMIZATION (6)
RELIABILITY (6)
VISUAL DATABASES (6)
WEB SERVICES (6)
BIOLOGICAL SYSTEM MODELING (5)
CLASSIFICATION (5)
CLUSTERING (5)
FILTERING (5)
GRAPH MINING (5)
MACHINE LEARNING (5)
MARKETING (5)
MATRIX ALGEBRA (5)
MERGING (5)
ONTOLOGIES (5)
PROTEINS (5)
SPATIAL DATABASES (5)
APPROXIMATION METHODS (4)
BIOLOGY (4)
BUILDINGS (4)
BUSINESS (4)
CITIES AND TOWNS (4)
DATA ANALYSIS (4)
ENGINES (4)
EQUATIONS (4)
HIDDEN MARKOV MODELS (4)
HUMANS (4)
IMAGE CLASSIFICATION (4)
LABELING (4)
LEARNING SYSTEMS (4)
METEOROLOGY (4)
NOISE (4)
PEDIATRICS (4)
PROBABILITY (4)
PROPOSALS (4)
REDUNDANCY (4)
REGRESSION ANALYSIS (4)
REMOTE SENSING (4)
SET THEORY (4)
SOCIAL NETWORK SERVICES (4)
SOFTWARE (4)
SUPPORT VECTOR MACHINES (4)
TIME SERIES ANALYSIS (4)
WEB PAGES (4)
AGRICULTURE (3)
AMINO ACIDS (3)
ANALYTICAL MODELS (3)
ANOMALY DETECTION (3)
ATMOSPHERIC MEASUREMENTS (3)
BENCHMARK TESTING (3)
CLASSIFICATION TREE ANALYSIS (3)
COMPANIES (3)
COMPLEXITY THEORY (3)
COMPUTER SCIENCE (3)
CONSUMER BEHAVIOUR (3)
DATA HANDLING (3)
DATA VISUALISATION (3)
DATA VISUALIZATION (3)
DELAY (3)
DISTRIBUTED DATABASES (3)
EVOLUTION (BIOLOGY) (3)
GEOGRAPHIC INFORMATION SYSTEMS (3)
GRAPHICS (3)
IMAGE COLOR ANALYSIS (3)
IMAGE SEQUENCES (3)
INFORMATION EXTRACTION (3)
IP NETWORKS (3)
KNOWLEDGE ENGINEERING (3)
more

INFONA - science communication portal

2008 IEEE International Conference on Data Mining Workshops

k-Nearest Neighbor Classification on First-Order Logic Descriptions

Semi-supervised Collaborative Clustering with Partial Background Knowledge

Mining Temporal Patterns with Quantitative Intervals

Plant Protein Localization Using Discriminative and Frequent Partition-Based Subsequences

Clustering Events on Streams Using Complex Context Information

Discovering Triggering Events from Longitudinal Data

Extension of Partitional Clustering Methods for Handling Mixed Data

Word Sense Discovery for Web Information Retrieval

Mining Correlated Pairs of Patterns in Multidimensional Structured Databases

Association Action Rules

Multiple-Instance Regression with Structured Data

Discovery of Internal and External Hyperclique Patterns in Complex Graph Databases

Harmonic Blind Sound Source Isolation Enhanced by Spectrum Clustering

A Spatio-temporal Simulation Model for Movement Data Generation

Geographic Knowledge Discovery in INGENS: An Inductive Database Perspective

Enriching Spatial OLAP with Map Generalization: a Conceptual Multidimensional Model

Risk Assessment of Atmospheric Hazard Releases Using K-Means Clustering

A Robust Graph-Based Algorithm for Detection and Characterization of Anomalies in Noisy Multivariate Time Series

Kernels for the Investigation of Localized Spatiotemporal Transitions of Drought with Support Vector Machines

Standards-Based Coastal Sensor Web

Filter options

Publication date

Keywords

INFONA - science communication portal

2008 IEEE International Conference on Data Mining Workshops $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2008 IEEE International Conference on Data Mining Workshops