Data Mining (ICDM), 2010 IEEE 10th International Conference on

chapter

Cover Art

2010 IEEE International Conference on Data Mining > C1

2010 10th IEEE International Conference on Data Mining (ICDM 2010)

chapter

Title Page i

2010 IEEE International Conference on Data Mining > i

2010 10th IEEE International Conference on Data Mining (ICDM 2010)

The following topics are dealt with: data mining; local clustering; spatiotemporal event detection; time series; Markov models; email classification; data stream; parallel mining; Bayesian network; unsupervised learning; missing values prediction; anomaly detection; decision tree; binary classifier; data similarity matrix; data mapping; support vector machine; Mapreduce; document similarity; social...

chapter

Title Page iii

2010 IEEE International Conference on Data Mining > iii

2010 10th IEEE International Conference on Data Mining (ICDM 2010)

chapter

Copyright Page

2010 IEEE International Conference on Data Mining > iv

2010 10th IEEE International Conference on Data Mining (ICDM 2010)

chapter

Organizing Committee

2010 IEEE International Conference on Data Mining > xix - xxi

2010 10th IEEE International Conference on Data Mining (ICDM 2010)

chapter

Message from the Program Committee Co-Chairs

2010 IEEE International Conference on Data Mining > xvii - xviii

2010 10th IEEE International Conference on Data Mining (ICDM 2010)

chapter

Welcome Message from the Conference Chairs

2010 IEEE International Conference on Data Mining > xv - xvi

2010 10th IEEE International Conference on Data Mining (ICDM 2010)

chapter

Program Committee

2010 IEEE International Conference on Data Mining > xxii - xxviii

2010 10th IEEE International Conference on Data Mining (ICDM 2010)

chapter

Mining Billion-node Graphs: Patterns, Generators and Tools

C Faloutsos

2010 IEEE International Conference on Data Mining > 5

2010 10th IEEE International Conference on Data Mining (ICDM 2010)

What do graphs look like? How do they evolve over time? How to handle a graph with a billion nodes? We present a comprehensive list of static and temporal laws, and some recent observations on real graphs (e.g., "eigenSpokes"). For generators, we describe some recent ones, which naturally match all of the known properties of real graphs. Finally, for tools, we present "oddball"...

chapter

Assessing the Significance of Groups in High-Dimensional Data

G McLachlan

2010 IEEE International Conference on Data Mining > 6

2010 10th IEEE International Conference on Data Mining (ICDM 2010)

Summary form only only given. We consider the problem of assessing the significance of groups in high-dimensional data. In the case of supervised classification where there are data of known origin with respect to the groups under consideration, a guide to the degree of separation among the groups can be given in terms of the estimated error rate of a classifier formed to allocate a new observation...

chapter

10 Years of Data Mining Research: Retrospect and Prospect

Xindong Wu

2010 IEEE International Conference on Data Mining > 7

2010 10th IEEE International Conference on Data Mining (ICDM 2010)

chapter

Detecting Novel Discrepancies in Communication Networks

J Abello, T Eliassi-Rad, N Devanur

2010 IEEE International Conference on Data Mining > 8 - 17

2010 10th IEEE International Conference on Data Mining (ICDM 2010)

We address the problem of detecting characteristic patterns in communication networks. We introduce a scalable approach based on set-system discrepancy. By implicitly labeling each network edge with the sequence of times in which its two endpoints communicate, we view an entire communication network as a set-system. This view allows us to use combinatorial discrepancy as a mechanism to "observe"...

chapter

Multi-agent Random Walks for Local Clustering on Graphs

M Alamgir, U von Luxburg

2010 IEEE International Conference on Data Mining > 18 - 27

2010 10th IEEE International Conference on Data Mining (ICDM 2010)

We consider the problem of local graph clustering where the aim is to discover the local cluster corresponding to a point of interest. The most popular algorithms to solve this problem start a random walk at the point of interest and let it run until some stopping criterion is met. The vertices visited are then considered the local cluster. We suggest a more powerful alternative, the multi-agent random...

chapter

Spatiotemporal Event Detection in Mobility Network

T S Au, Rong Duan, Heeyoung Kim, Guang-Qin Ma

2010 IEEE International Conference on Data Mining > 28 - 37

2010 10th IEEE International Conference on Data Mining (ICDM 2010)

Learning and identifying events in network traffic is crucial for service providers to improve their mobility network performance. In fact, large special events attract cell phone users to relative small areas, which causes sudden surge in network traffic. To handle such increased load, it is necessary to measure the increased network traffic and quantify the impact of the events, so that relevant...

chapter

An Unsupervised Approach to Modeling Personalized Contexts of Mobile Users

Tengfei Bao, Happia Cao, Enhong Chen, Jilei Tian, more

2010 IEEE International Conference on Data Mining > 38 - 47

2010 10th IEEE International Conference on Data Mining (ICDM 2010)

Mobile context modeling is a process of recognizing and reasoning about contexts and situations in a mobile environment, which is critical for the success of context-aware mobile services. While there are prior work on mobile context modeling, the use of unsupervised learning techniques for mobile context modeling is still under-explored. Indeed, unsupervised techniques have the ability to learn personalized...

chapter

Fast and Flexible Multivariate Time Series Subsequence Search

K Bhaduri, Qiang Zhu, N C Oza, A N Srivastava

2010 IEEE International Conference on Data Mining > 48 - 57

2010 10th IEEE International Conference on Data Mining (ICDM 2010)

Multivariate Time-Series (MTS) are ubiquitous, and are generated in areas as disparate as sensor recordings in aerospace systems, music and video streams, medical monitoring, and financial systems. Domain experts are often interested in searching for interesting multivariate patterns from these MTS databases which can contain up to several gigabytes of data. Surprisingly, research on MTS search is...

chapter

iSAX 2.0: Indexing and Mining One Billion Time Series

A Camerra, T Palpanas, J Shieh, E Keogh

2010 IEEE International Conference on Data Mining > 58 - 67

2010 10th IEEE International Conference on Data Mining (ICDM 2010)

There is an increasingly pressing need, by several applications in diverse domains, for developing techniques able to index and mine very large collections of time series. Examples of such applications come from astronomy, biology, the web, and other domains. It is not unusual for these applications to involve numbers of time series in the order of hundreds of millions to billions. However, all relevant...

chapter

Abstraction Augmented Markov Models

C Caragea, A Silvescu, D Caragea, V Honavar

2010 IEEE International Conference on Data Mining > 68 - 77

2010 10th IEEE International Conference on Data Mining (ICDM 2010)

High accuracy sequence classification often requires the use of higher order Markov models (MMs). However, the number of MM parameters increases exponentially with the range of direct dependencies between sequence elements, thereby increasing the risk of over fitting when the data set is limited in size. We present abstraction augmented Markov models (AAMMs) that effectively reduce the number of numeric...

chapter

A Graph-Based Approach for Multi-folder Email Classification

S Chakravarthy, A Venkatachalam, A Telang

2010 IEEE International Conference on Data Mining > 78 - 87

2010 10th IEEE International Conference on Data Mining (ICDM 2010)

This paper presents a novel framework for multi-folder email classification using graph mining as the underlying technique. Although several techniques exist (e.g., SVM, TF-IDF, n-gram) for addressing this problem in a delimited context, they heavily rely on extracting high-frequency keywords, thus ignoring the inherent structural aspects of an email (or document in general) which can play a critical...

INFONA - science communication portal

2010 IEEE International Conference on Data Mining

Cover Art

Title Page i

Title Page iii

Copyright Page

Table of Contents

Organizing Committee

Message from the Program Committee Co-Chairs

Welcome Message from the Conference Chairs

Program Committee

Mining Billion-node Graphs: Patterns, Generators and Tools

Assessing the Significance of Groups in High-Dimensional Data

10 Years of Data Mining Research: Retrospect and Prospect

Detecting Novel Discrepancies in Communication Networks

Multi-agent Random Walks for Local Clustering on Graphs

Spatiotemporal Event Detection in Mobility Network

An Unsupervised Approach to Modeling Personalized Contexts of Mobile Users

Fast and Flexible Multivariate Time Series Subsequence Search

iSAX 2.0: Indexing and Mining One Billion Time Series

Abstraction Augmented Markov Models

A Graph-Based Approach for Multi-folder Email Classification

Filter options

Publication date

Keywords

INFONA - science communication portal

2010 IEEE International Conference on Data Mining $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2010 IEEE International Conference on Data Mining