2008 IEEE International Conference on Data Mining Workshops

Items from 61 to 80 out of 139 results

chapter

Detection and Exploration of Outlier Regions in Sensor Data Streams

C. Franke, M. Gertz

2008 IEEE International Conference on Data Mining Workshops > 375 - 384

2008 IEEE International Conference on Data Mining Workshops

Sensor networks play an important role in applications concerned with environmental monitoring, disaster management, and policy making. Effective and flexible techniques are needed to explore unusual environmental phenomena in sensor readings that are continuously streamed to applications. In this paper, we propose a framework that allows to detect outlier sensors and to efficiently construct outlier...

chapter

Data Mining for Climate Change and Impacts

A.R. Ganguly, K. Steinhaeuser

2008 IEEE International Conference on Data Mining Workshops > 385 - 394

2008 IEEE International Conference on Data Mining Workshops

Knowledge discovery from temporal, spatial and spatiotemporal data is critical for climate change science and climate impacts. Climate statistics is a mature area. However, recent growth in observations and model outputs, combined with the increased availability of geographical data, presents new opportunities for data miners. This paper maps climate requirements to solutions available in temporal,...

chapter

RE-SPaM: Using Regular Expressions for Sequential Pattern Mining in Trajectory Databases

L.I. Gomez, A.A. Vaisman

2008 IEEE International Conference on Data Mining Workshops > 395 - 398

2008 IEEE International Conference on Data Mining Workshops

In sequential pattern mining, languages based on regular expressions (RE) were proposed to restrict frequent sequences to the ones that satisfy user-specified constraints. In these languages, REs are applied over items. We propose a much powerful language, based on regular expressions, denoted RE-SPaM, where the basic elements are constraints over the attributes of the items. Expressions in this language...

chapter

Incremental Maintenance of Discovered Spatial Colocation Patterns

Jiangfeng He, Qinming He, Feng Qian, Qi Chen

2008 IEEE International Conference on Data Mining Workshops > 399 - 407

2008 IEEE International Conference on Data Mining Workshops

Unlike the traditional incremental updating problem for discrete data, the appended data to spatial dataset may introduce lots of new relations between the added events and the existing events. Moreover, as the measure in mining of colocation patterns, participation index is complicated to handle compared with simply support counter. Thus, the incremental maintenance of colocation patterns for dynamic...

chapter

Speeding up Array Query Processing by Just-In-Time Compilation

C. Jucovschi, P. Baumann, S. Stancu-Mara

2008 IEEE International Conference on Data Mining Workshops > 408 - 413

2008 IEEE International Conference on Data Mining Workshops

Interpreted languages frequently suffer from higher processing times as compared to compiled approaches. Typically this happens when complex computations are performed. Array DBMSs, which extend database functionality with multidimensional array modeling and query support, find themselves in exactly this situation: queries often involve a large number of operations, and each such operation is applied...

chapter

Extraction of Discriminative Features from Hyperspectral Data

H. Kalkan, Y. Yardimci

2008 IEEE International Conference on Data Mining Workshops > 414 - 419

2008 IEEE International Conference on Data Mining Workshops

This paper presents a method to discover the discriminative patterns or features in hyperspectral data for classification. The proposed method searches the data space along both spectral and spatial frequency axis and combines the adjacent spectral and spatial frequency bands so that a simpler but more effective feature set is achieved. The algorithm is tested on hyperspectral images of hazelnut kernels...

chapter

Scalable Sparse Bayesian Network Learning for Spatial Applications

T. Liebig, C. Korner, M. May

2008 IEEE International Conference on Data Mining Workshops > 420 - 425

2008 IEEE International Conference on Data Mining Workshops

Traffic routes through a street network contain patterns and are no random walks. Such patterns exist for instance along streets or between neighbouring street segments. The extraction of these patterns is a challenging task due to the enormous size of city street networks, the large number of required training data and the unknown distribution of the latter. We apply Bayesian Networks to model the...

chapter

High Granularity Remote Sensing and Crop Production over Space and Time: NDVI over the Growing Season and Prediction of Cotton Yields at the Farm Field Level in Texas

B. Little, M. Schucking, B. Gartrell, Bing Chen, more

2008 IEEE International Conference on Data Mining Workshops > 426 - 435

2008 IEEE International Conference on Data Mining Workshops

Remote sensing has been applied to agriculture at very coarse levels of granularity (i.e., national levels) but few investigations have focused on yield prediction at the farm unit level. Specific aims of the present investigation are to analyze the ability of Moderate Resolution Imaging Spectroradiometer (MODIS) data to predict cotton yields in two highly homogeneous counties in west Texas. In one...

chapter

Web Query Prediction by Unifying Model

Ning Liu, J. Yan, Shuicheng Yan, Weiguo Fan, more

2008 IEEE International Conference on Data Mining Workshops > 436 - 441

2008 IEEE International Conference on Data Mining Workshops

Recently, many commercial products, such as Google Trends and Yahoo! Buzz, are released to monitor the past search engine query frequency trend. However, little research has been devoted for predicting the upcoming query trend, which is of great importance in providing guidelines for future business planning. In this paper, a unified solution is presented for such a purpose. Besides the classical...

chapter

A Vector-Geometry Based Spatial kNN-Algorithm for Traffic Frequency Predictions

M. May, D. Hecker, C. Korner, S. Scheider, more

2008 IEEE International Conference on Data Mining Workshops > 442 - 447

2008 IEEE International Conference on Data Mining Workshops

We introduce s-kNN, a nearest neighbor based spatial data mining algorithm. It belongs to the class of vector-geometry based algorithms that reason on complex spatial objects instead of point measurements. In contrast to most methods in this class, it does on the fly spatial computations that cannot be replaced by a pre-processing step without sacrificing efficiency. The key is a partial evaluation...

chapter

Detecting and Tracking Spatio-temporal Clusters with Adaptive History Filtering

J. Rosswog, K. Ghose

2008 IEEE International Conference on Data Mining Workshops > 448 - 457

2008 IEEE International Conference on Data Mining Workshops

This paper addresses the problem of detecting and tracking moving clusters in spatio-temporal data sets. Spatio-temporal data sets contain data elements that move in space over time. Traditional data clustering algorithms work well on static data sets that contain well separated clusters. When traditional techniques are applied to spatio-temporal data they breakdown when the moving data elements intersect...

chapter

A Semi-supervised Learning Algorithm for Recognizing Sub-classes

R.R. Vatsavai, S. Shekhar, B. Bhaduri

2008 IEEE International Conference on Data Mining Workshops > 458 - 467

2008 IEEE International Conference on Data Mining Workshops

In many practical situations it is not feasible to collect labeled samples for all available classes in a domain. Especially in supervised classification of remotely sensed images it is impossible to collect ground truth information over large geographic regions for all thematic classes. As a result often analysts collect labels for aggregate classes (e.g., Forest, Agriculture, Urban). In this paper...

chapter

Mining Unstructured Text at Gigabyte per Second Speeds

A. Ratner

2008 IEEE International Conference on Data Mining Workshops > 468 - 476

2008 IEEE International Conference on Data Mining Workshops

Humans communicate with text in thousands of languages, in dozens of scripts, in a variety of binary codes, on millions of topics. There is a need, for both government and commercial applications, to identify these text characteristics to enable follow-on processing such as transcoding, translation, transliteration, routing and prioritization. This paper deals with the implementation of real-time...

chapter

Chi-Square Test Based Decision Trees Induction in Distributed Environment

Jie Ouyang, N. Patel, I.K. Sethi

2008 IEEE International Conference on Data Mining Workshops > 477 - 485

2008 IEEE International Conference on Data Mining Workshops

The decision tree-based classification is a popular approach for pattern recognition and data mining. Most decision tree induction methods assume training data being present at one central location. Given the growth in distributed databases at geographically dispersed locations, the methods for decision tree induction in distributed settings are gaining importance. This paper describes one distributed...

chapter

Distributed Data Mining Models as Services on the Grid

E. Cesario, D. Talia

2008 IEEE International Conference on Data Mining Workshops > 486 - 495

2008 IEEE International Conference on Data Mining Workshops

This paper describes how distributed data mining models, such as collective learning, ensemble learning, and meta-learning models, can be implemented as WSRF mining services by exploiting the Grid infrastructure. Our goal is to design a general distributed architectural model that can be exploited for different distributed mining algorithms deployed as Grid services for the analysis of dispersed data...

chapter

Service Oriented KDD: A Framework for Grid Data Mining Workflows

M. Lackovic, D. Talia, P. Trunfio

2008 IEEE International Conference on Data Mining Workshops > 496 - 505

2008 IEEE International Conference on Data Mining Workshops

Weka4WS is an extension of the Weka toolkit to support remote execution of data mining tasks as grid services. A first version of Weka4WS supporting concurrent execution of multiple data mining tasks on remote grid nodes has been presented in a previous work. In this paper we present a new version supporting also the composition and execution of data mining workflows on a grid. This new version of...

chapter

Exploiting Graphic Card Processor Technology to Accelerate Data Mining Queries in SAP NetWeaver BIA

C. Weyerhaeuser, T. Mindnich, F. Faerber, W. Lehner

2008 IEEE International Conference on Data Mining Workshops > 506 - 515

2008 IEEE International Conference on Data Mining Workshops

Within business Intelligence contexts, the importance of data mining algorithms is continuously increasing, particularly from the perspective of applications and users that demand novel algorithms on the one hand and an efficient implementation exploiting novel system architectures on the other hand. Within this paper, we focus on the latter issue and report our experience with the exploitation of...

chapter

Stream-Close: Fast Mining of Closed Frequent Itemsets in High Speed Data Streams

B.N. Ranganath, M.N. Murty

2008 IEEE International Conference on Data Mining Workshops > 516 - 525

2008 IEEE International Conference on Data Mining Workshops

With the emergence of large-volume and high-speed streaming data, the recent techniques for stream mining of CFIpsilas (closed frequent itemsets) will become inefficient. When concept drift occurs at a slow rate in high speed data streams, the rate of change of information across different sliding windows will be negligible. So, the user wonpsilat be devoid of change in information if we slide window...

chapter

Parallel Hierarchical Clustering on Market Basket Data

Baoying Wang, Qin Ding, I. Rahal

2008 IEEE International Conference on Data Mining Workshops > 526 - 532

2008 IEEE International Conference on Data Mining Workshops

Data clustering has been proven to be a promising data mining technique. Recently, there have been many attempts for clustering market-basket data. In this paper, we propose a parallelized hierarchical clustering approach on market-basket data (PH-Clustering), which is implemented using MPI. Based on the analysis of the major clustering steps, we adopt a partial local and partial global approach to...

chapter

Efficient Distance Computation Using SQL Queries and UDFs

S.K. Pitchaimalai, C. Ordonez, C. Garcia-Alvarado

2008 IEEE International Conference on Data Mining Workshops > 533 - 542

2008 IEEE International Conference on Data Mining Workshops

Distance computation is one of the most computationally intensive operations employed by many data mining algorithms. Performing such matrix computations within a DBMS creates many optimization challenges. We propose techniques to efficiently compute Euclidean distance using SQL queries and user-defined functions (UDFs). We concentrate on efficient Euclidean distance computation for the well-known...

Publication date

Set your own date range

Keywords

DATA MINING (86)
CLASSIFICATION ALGORITHMS (29)
DATABASES (23)
DATA MODELS (19)
LEARNING (ARTIFICIAL INTELLIGENCE) (19)
CLUSTERING ALGORITHMS (18)
TRAINING (18)
DISTANCE MEASUREMENT (17)
FEATURE EXTRACTION (16)
ACCURACY (15)
PATTERN CLUSTERING (15)
ALGORITHM DESIGN AND ANALYSIS (14)
CONFERENCES (14)
PATTERN CLASSIFICATION (14)
ASSOCIATION RULES (13)
QUERY PROCESSING (11)
INTERNET (10)
ITEMSETS (10)
INDEXES (8)
KNOWLEDGE DISCOVERY (8)
PREDICTIVE MODELS (8)
STATISTICAL ANALYSIS (8)
COMPUTATIONAL MODELING (7)
CORRELATION (7)
DATABASE MANAGEMENT SYSTEMS (7)
DECISION TREES (7)
ESTIMATION (7)
GRAPH THEORY (7)
KERNEL (7)
MATHEMATICAL MODEL (7)
ONTOLOGIES (ARTIFICIAL INTELLIGENCE) (7)
TEXT ANALYSIS (7)
TRAINING DATA (7)
OPTIMIZATION (6)
RELIABILITY (6)
VISUAL DATABASES (6)
WEB SERVICES (6)
BIOLOGICAL SYSTEM MODELING (5)
CLASSIFICATION (5)
CLUSTERING (5)
FILTERING (5)
GRAPH MINING (5)
MACHINE LEARNING (5)
MARKETING (5)
MATRIX ALGEBRA (5)
MERGING (5)
ONTOLOGIES (5)
PROTEINS (5)
SPATIAL DATABASES (5)
APPROXIMATION METHODS (4)
BIOLOGY (4)
BUILDINGS (4)
BUSINESS (4)
CITIES AND TOWNS (4)
DATA ANALYSIS (4)
ENGINES (4)
EQUATIONS (4)
HIDDEN MARKOV MODELS (4)
HUMANS (4)
IMAGE CLASSIFICATION (4)
LABELING (4)
LEARNING SYSTEMS (4)
METEOROLOGY (4)
NOISE (4)
PEDIATRICS (4)
PROBABILITY (4)
PROPOSALS (4)
REDUNDANCY (4)
REGRESSION ANALYSIS (4)
REMOTE SENSING (4)
SET THEORY (4)
SOCIAL NETWORK SERVICES (4)
SOFTWARE (4)
SUPPORT VECTOR MACHINES (4)
TIME SERIES ANALYSIS (4)
WEB PAGES (4)
AGRICULTURE (3)
AMINO ACIDS (3)
ANALYTICAL MODELS (3)
ANOMALY DETECTION (3)
ATMOSPHERIC MEASUREMENTS (3)
BENCHMARK TESTING (3)
CLASSIFICATION TREE ANALYSIS (3)
COMPANIES (3)
COMPLEXITY THEORY (3)
COMPUTER SCIENCE (3)
CONSUMER BEHAVIOUR (3)
DATA HANDLING (3)
DATA VISUALISATION (3)
DATA VISUALIZATION (3)
DELAY (3)
DISTRIBUTED DATABASES (3)
EVOLUTION (BIOLOGY) (3)
GEOGRAPHIC INFORMATION SYSTEMS (3)
GRAPHICS (3)
IMAGE COLOR ANALYSIS (3)
IMAGE SEQUENCES (3)
INFORMATION EXTRACTION (3)
IP NETWORKS (3)
KNOWLEDGE ENGINEERING (3)
more

INFONA - science communication portal

2008 IEEE International Conference on Data Mining Workshops

Detection and Exploration of Outlier Regions in Sensor Data Streams

Data Mining for Climate Change and Impacts

RE-SPaM: Using Regular Expressions for Sequential Pattern Mining in Trajectory Databases

Incremental Maintenance of Discovered Spatial Colocation Patterns

Speeding up Array Query Processing by Just-In-Time Compilation

Extraction of Discriminative Features from Hyperspectral Data

Scalable Sparse Bayesian Network Learning for Spatial Applications

High Granularity Remote Sensing and Crop Production over Space and Time: NDVI over the Growing Season and Prediction of Cotton Yields at the Farm Field Level in Texas

Web Query Prediction by Unifying Model

A Vector-Geometry Based Spatial kNN-Algorithm for Traffic Frequency Predictions

Detecting and Tracking Spatio-temporal Clusters with Adaptive History Filtering

A Semi-supervised Learning Algorithm for Recognizing Sub-classes

Mining Unstructured Text at Gigabyte per Second Speeds

Chi-Square Test Based Decision Trees Induction in Distributed Environment

Distributed Data Mining Models as Services on the Grid

Service Oriented KDD: A Framework for Grid Data Mining Workflows

Exploiting Graphic Card Processor Technology to Accelerate Data Mining Queries in SAP NetWeaver BIA

Stream-Close: Fast Mining of Closed Frequent Itemsets in High Speed Data Streams

Parallel Hierarchical Clustering on Market Basket Data

Efficient Distance Computation Using SQL Queries and UDFs

Filter options

Publication date

Keywords

INFONA - science communication portal

2008 IEEE International Conference on Data Mining Workshops $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2008 IEEE International Conference on Data Mining Workshops