2008 IEEE International Conference on Data Mining Workshops

Items from 1 to 20 out of 86 results

chapter

Using Contextual Information in Transactional Segmentation: An Empirical Study in E-Commerce

M.F. Faraone, M. Gorgoglione, C. Palmisano

2008 IEEE International Conference on Data Mining Workshops > 796 - 805

2008 IEEE International Conference on Data Mining Workshops

The growing complexity and variability characterizing markets have induced scholars and marketers to propose new segmentation approaches. Recent research has shown that including the context in which a transaction occurs in customer behavior models, improves the ability of predicting their behavior. However, no systematic research has studied whether contextual information really matters in market...

chapter

Semantic Analysis Method for Unstructured Data in Telecom Services

M. Iwashita, K. Nishimatsu, S. Shimogawa

2008 IEEE International Conference on Data Mining Workshops > 789 - 795

2008 IEEE International Conference on Data Mining Workshops

A variety of services have recently been provided depending on highly developed networks and personal equipment. With these advances, connecting this equipment has become increasingly more complicated. Problems such as an increase in no-connection and determining the cause have become difficult in some cases because software is often updated to keep up with advancements in services or security. Telecom...

chapter

Keyword Extraction Based on Lexical Chains and Word Co-occurrence for Chinese News Web Pages

Xinghua Li, Xindong Wu, Xuegang Hu, Fei Xie, more

2008 IEEE International Conference on Data Mining Workshops > 744 - 751

2008 IEEE International Conference on Data Mining Workshops

This paper presents a new keyword extraction algorithm for Chinese news Web pages using lexical chains and word co-occurrence combined with frequency features, cohesion features, and corelation features. A lexical chain is an external performance consistency by semantically related words of a text, and is the representation of the semantic content of a portion of the text. Word co-occurrence distribution...

chapter

Semantic Features for Multi-view Semi-supervised and Active Learning of Text Classification

Shiliang Sun

2008 IEEE International Conference on Data Mining Workshops > 731 - 735

2008 IEEE International Conference on Data Mining Workshops

For multi-view learning, existing methods usually exploit originally provided features for classifier training, which ignore the latent correlation between different views. In this paper, semantic features integrating information from multiple views are extracted for pattern representation. Canonical correlation analysis is used to learn the representation of semantic spaces where semantic features...

chapter

Mining Allocating Patterns in One-Sum Weighted Items

Y.J. Wang, Xinwei Zheng, F. Coenen, C.Y. Li

2008 IEEE International Conference on Data Mining Workshops > 592 - 598

2008 IEEE International Conference on Data Mining Workshops

An association rule (AR) is a common knowledge model in data mining that describes an implicative co-occurring relationship between two disjoint sets of binary-valued transaction database attributes (items), expressed in the form of an "antecedent rArr consequent" rule. A variant of the AR is the weighted association rule (WAR). With regard to a marketing context, this paper introduces a...

chapter

Efficient Distance Computation Using SQL Queries and UDFs

S.K. Pitchaimalai, C. Ordonez, C. Garcia-Alvarado

2008 IEEE International Conference on Data Mining Workshops > 533 - 542

2008 IEEE International Conference on Data Mining Workshops

Distance computation is one of the most computationally intensive operations employed by many data mining algorithms. Performing such matrix computations within a DBMS creates many optimization challenges. We propose techniques to efficiently compute Euclidean distance using SQL queries and user-defined functions (UDFs). We concentrate on efficient Euclidean distance computation for the well-known...

chapter

Co-training by Committee: A New Semi-supervised Learning Framework

M. Hady, F. Schwenker

2008 IEEE International Conference on Data Mining Workshops > 563 - 572

2008 IEEE International Conference on Data Mining Workshops

For many data mining applications, it is necessary to develop algorithms that use unlabeled data to improve the accuracy of the supervised learning. Co-Training is a popular semi-supervised learning algorithm. It assumes that each example is represented by two or more redundantly sufficient sets of features (views) and these views are independent given the class. However, these assumptions are not...

chapter

Stream-Close: Fast Mining of Closed Frequent Itemsets in High Speed Data Streams

B.N. Ranganath, M.N. Murty

2008 IEEE International Conference on Data Mining Workshops > 516 - 525

2008 IEEE International Conference on Data Mining Workshops

With the emergence of large-volume and high-speed streaming data, the recent techniques for stream mining of CFIpsilas (closed frequent itemsets) will become inefficient. When concept drift occurs at a slow rate in high speed data streams, the rate of change of information across different sliding windows will be negligible. So, the user wonpsilat be devoid of change in information if we slide window...

chapter

Service Oriented KDD: A Framework for Grid Data Mining Workflows

M. Lackovic, D. Talia, P. Trunfio

2008 IEEE International Conference on Data Mining Workshops > 496 - 505

2008 IEEE International Conference on Data Mining Workshops

Weka4WS is an extension of the Weka toolkit to support remote execution of data mining tasks as grid services. A first version of Weka4WS supporting concurrent execution of multiple data mining tasks on remote grid nodes has been presented in a previous work. In this paper we present a new version supporting also the composition and execution of data mining workflows on a grid. This new version of...

chapter

Behavior Informatics and Analytics: Let Behavior Talk

Longbing Cao

2008 IEEE International Conference on Data Mining Workshops > 87 - 96

2008 IEEE International Conference on Data Mining Workshops

Behavior is increasingly recognized as a key component in business intelligence and problem-solving. Different from traditional behavior analysis, which mainly focus on implicit behavior and explicit business appearance as a result of business usage and customer demographics, this paper proposes the field of Behavior Informatics and Analytics (BIA), to support explicit behavior involvement through...

chapter

Mining Temporal Patterns with Quantitative Intervals

T. Guyet, R. Quiniou

2008 IEEE International Conference on Data Mining Workshops > 218 - 227

2008 IEEE International Conference on Data Mining Workshops

In this paper we consider the problem of discovering frequent temporal patterns in a database of temporal sequences, where a temporal sequence is a set of items with associated dates and durations. Since the quantitative temporal information appears to be fundamental in many contexts, it is taken into account in the mining processes and returned as part of the extracted knowledge. To this end, we...

chapter

Actionable Knowledge Discovery for Threats Intelligence Support Using a Multi-dimensional Data Mining Methodology

O. Thonnard, M. Dacier

2008 IEEE International Conference on Data Mining Workshops > 154 - 163

2008 IEEE International Conference on Data Mining Workshops

This paper describes a multi-dimensional knowledge discovery and data mining (KDD) methodology that aims at discovering actionable knowledge related to Internet threats, taking into account domain expert guidance and the integration of domain-specific intelligence during the data mining process. The objectives are twofold: i) to develop global indicators for assessing the prevalence of certain malicious...

chapter

A Vector-Geometry Based Spatial kNN-Algorithm for Traffic Frequency Predictions

M. May, D. Hecker, C. Korner, S. Scheider, more

2008 IEEE International Conference on Data Mining Workshops > 442 - 447

2008 IEEE International Conference on Data Mining Workshops

We introduce s-kNN, a nearest neighbor based spatial data mining algorithm. It belongs to the class of vector-geometry based algorithms that reason on complex spatial objects instead of point measurements. In contrast to most methods in this class, it does on the fly spatial computations that cannot be replaced by a pre-processing step without sacrificing efficiency. The key is a partial evaluation...

chapter

Discovering Triggering Events from Longitudinal Data

C. Loglisci, D. Malerba

2008 IEEE International Conference on Data Mining Workshops > 248 - 256

2008 IEEE International Conference on Data Mining Workshops

Longitudinal data consist of the repeated measurements of some variables which describe the dynamics of a domain(process or phenomenon) over time. They can be analyzed in order to explain what event may cause the transition from a state into the next one during the evolution of the domain. Generally, approaches to this explanation problem rely on the exclusive usage of domain knowledge, while an analysis...

chapter

A Spatio-temporal Simulation Model for Movement Data Generation

D. Alberg, M. Last, S. Elnekave

2008 IEEE International Conference on Data Mining Workshops > 320 - 325

2008 IEEE International Conference on Data Mining Workshops

The real-world process of generating a large spatio-temporal data collection presents a very difficult technical problem. First, this process is very expensive, requiring a lot of various high-technology software tools and modern hardware infrastructure (sensors, servers, GPS infrastructure etc.) installations; second, the recorded trajectories sometimes cannot represent any special traffic or movement...

chapter

Discovery of Internal and External Hyperclique Patterns in Complex Graph Databases

T. Yamamoto, T. Ozaki, T. Ohkawa

2008 IEEE International Conference on Data Mining Workshops > 301 - 309

2008 IEEE International Conference on Data Mining Workshops

In some applications, the whole structure of the target data can be represented naturally in "multi-structured graphs" that are complex graphs whose vertices consist of aset of structured data such as itemsets, sequences and so on. To catch the strong affinity relationship in multi-structured graphs, in this paper, we propose an algorithm named HFMG to discover novel and meaningful frequent...

chapter

Association Action Rules

Z.W. Ras, A. Dardzinska, L.-S. Tsay, H. Wasyluk

2008 IEEE International Conference on Data Mining Workshops > 283 - 290

2008 IEEE International Conference on Data Mining Workshops

Action rules describe possible transitions of objects from one state to another with respect to a distinguished attribute. Previous research on action rule discovery usually required the extraction of classification rules before constructing any action rule. This paper gives anew approach for generating association-type action rules. The notion of frequent action sets and Apriori-like strategy generating...

chapter

Extension of Partitional Clustering Methods for Handling Mixed Data

Y. Naija, S. Chakhar, K. Blibech, R. Robbana

2008 IEEE International Conference on Data Mining Workshops > 257 - 266

2008 IEEE International Conference on Data Mining Workshops

Clustering is an active research topic in data mining and different methods have been proposed in the literature. Most of these methods are based on the use of a distance measure defined either on numerical attributes or on categorical attributes. However, in fields such as road traffic and medicine, datasets are composed of numerical and categorical attributes. Recently, there have been several proposals...

chapter

Word Sense Discovery for Web Information Retrieval

T. Nykiel, H. Rybinski

2008 IEEE International Conference on Data Mining Workshops > 267 - 274

2008 IEEE International Conference on Data Mining Workshops

Word meaning disambiguation has always been an important problem in many computer science tasks, such as information retrieval and extraction. One of the problems,faced in automatic word sense discovery, is the number of different senses a word can have. Often, senses are dominated by some other, more frequent ones. Discovering such dominated meanings can significantly improve quality of many text-related...

chapter

Geographic Knowledge Discovery in INGENS: An Inductive Database Perspective

A. Appice, A. Ciampi, A. Lanza, D. Malerba, more

2008 IEEE International Conference on Data Mining Workshops > 326 - 331

2008 IEEE International Conference on Data Mining Workshops

INGENS is a prototype of GIS which integrates a geographic knowledge discovery engine to mine several kinds of spatial KDD objects from the topographic maps stored in a spatial database. In this paper we describe the main principles of an inductive spatial database in INGENS. Inductive database allows to keep permanent KDD objects and integrate database technology with systems for the geographic knowledge...

Keywords:
DATA MINING

Publication date

Set your own date range

Keywords

DATABASES (19)
CLASSIFICATION ALGORITHMS (18)
DATA MODELS (16)
ASSOCIATION RULES (13)
ALGORITHM DESIGN AND ANALYSIS (12)
CONFERENCES (12)
ACCURACY (10)
CLUSTERING ALGORITHMS (10)
PATTERN CLUSTERING (10)
ITEMSETS (9)
PATTERN CLASSIFICATION (9)
TRAINING (9)
KNOWLEDGE DISCOVERY (8)
LEARNING (ARTIFICIAL INTELLIGENCE) (8)
DECISION TREES (7)
FEATURE EXTRACTION (7)
STATISTICAL ANALYSIS (7)
CORRELATION (6)
INDEXES (6)
INTERNET (6)
BIOLOGICAL SYSTEM MODELING (5)
COMPUTATIONAL MODELING (5)
DATABASE MANAGEMENT SYSTEMS (5)
DISTANCE MEASUREMENT (5)
GRAPH MINING (5)
GRAPH THEORY (5)
KERNEL (5)
ONTOLOGIES (ARTIFICIAL INTELLIGENCE) (5)
QUERY PROCESSING (5)
RELIABILITY (5)
SPATIAL DATABASES (5)
TEXT ANALYSIS (5)
TRAINING DATA (5)
VISUAL DATABASES (5)
WEB SERVICES (5)
BUSINESS (4)
EQUATIONS (4)
LABELING (4)
MARKETING (4)
MATHEMATICAL MODEL (4)
MATRIX ALGEBRA (4)
MERGING (4)
ONTOLOGIES (4)
SOFTWARE (4)
WEB PAGES (4)
ANALYTICAL MODELS (3)
ATMOSPHERIC MEASUREMENTS (3)
BIOLOGY (3)
BUILDINGS (3)
CITIES AND TOWNS (3)
CLASSIFICATION (3)
DATA ANALYSIS (3)
DATA VISUALISATION (3)
DATA VISUALIZATION (3)
DISTRIBUTED DATABASES (3)
ENGINES (3)
ESTIMATION (3)
EVOLUTION (BIOLOGY) (3)
GEOGRAPHIC INFORMATION SYSTEMS (3)
GRAPHICS (3)
HUMANS (3)
INFORMATION EXTRACTION (3)
KNOWLEDGE ENGINEERING (3)
MACHINE LEARNING (3)
MAINTENANCE ENGINEERING (3)
MANGANESE (3)
MEDICAL COMPUTING (3)
OPTIMIZATION (3)
PREDICTIVE MODELS (3)
PROBABILITY (3)
PROPOSALS (3)
PROTEINS (3)
SECURITY (3)
SERVERS (3)
SOCIAL NETWORK SERVICES (3)
SOCIAL NETWORKING (ONLINE) (3)
SPATIAL DATA MINING (3)
TEXT MINING (3)
WAVELET TRANSFORMS (3)
AMINO ACIDS (2)
ANOMALY DETECTION (2)
APPROXIMATION METHODS (2)
APRIORI ALGORITHM (2)
ARTIFICIAL INTELLIGENCE (2)
ASSOCIATION RULE MINING (2)
BEHAVIOURAL SCIENCES COMPUTING (2)
BLOGS (2)
CAD (2)
CARTOGRAPHY (2)
CLASSIFICATION TREE ANALYSIS (2)
CLUSTERING (2)
CLUSTERING METHODS (2)
CO-TRAINING (2)
COMPLEXITY THEORY (2)
COMPUTATIONAL COMPLEXITY (2)
COMPUTER SCIENCE (2)
CORRELATION MINING (2)
DATA CLUSTERING (2)
DATA HANDLING (2)
more

INFONA - science communication portal

2008 IEEE International Conference on Data Mining Workshops

Using Contextual Information in Transactional Segmentation: An Empirical Study in E-Commerce

Semantic Analysis Method for Unstructured Data in Telecom Services

Keyword Extraction Based on Lexical Chains and Word Co-occurrence for Chinese News Web Pages

Semantic Features for Multi-view Semi-supervised and Active Learning of Text Classification

Mining Allocating Patterns in One-Sum Weighted Items

Efficient Distance Computation Using SQL Queries and UDFs

Co-training by Committee: A New Semi-supervised Learning Framework

Stream-Close: Fast Mining of Closed Frequent Itemsets in High Speed Data Streams

Service Oriented KDD: A Framework for Grid Data Mining Workflows

Behavior Informatics and Analytics: Let Behavior Talk

Mining Temporal Patterns with Quantitative Intervals

Actionable Knowledge Discovery for Threats Intelligence Support Using a Multi-dimensional Data Mining Methodology

A Vector-Geometry Based Spatial kNN-Algorithm for Traffic Frequency Predictions

Discovering Triggering Events from Longitudinal Data

A Spatio-temporal Simulation Model for Movement Data Generation

Discovery of Internal and External Hyperclique Patterns in Complex Graph Databases

Association Action Rules

Extension of Partitional Clustering Methods for Handling Mixed Data

Word Sense Discovery for Web Information Retrieval

Geographic Knowledge Discovery in INGENS: An Inductive Database Perspective

Filter options

Publication date

Keywords

INFONA - science communication portal

2008 IEEE International Conference on Data Mining Workshops $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2008 IEEE International Conference on Data Mining Workshops