Data Mining, 2009. ICDM '09. Ninth IEEE International Conference on

We propose novel multi-armed bandit (explore/exploit) schemes to maximize total clicks on a content module published regularly on Yahoo! Intuitively, one can "explore'' each candidate item by displaying it to a small fraction of user visits to estimate the item's click-through rate (CTR), and then "exploit'' high CTR items in order to maximize clicks. While bandit methods that seek to find...

chapter

Connecting Sparsely Distributed Similar Bloggers

N. Agarwal, Huan Liu, S. Subramanya, J.J. Salerno, more

2009 Ninth IEEE International Conference on Data Mining > 11 - 20

2009 Ninth IEEE International Conference on Data Mining (ICDM 2009)

The nature of the Blogosphere determines that the majority of bloggers are only connected with a small number of fellow bloggers, and similar bloggers can be largely disconnected from each other. Aggregating them allows for cost-effective personalized services, targeted marketing, and exploration of new business opportunities. As most bloggers have only a small number of adjacent bloggers, the problem...

chapter

Rule Ensembles for Multi-target Regression

T. Aho, B. Zenko, S. Dzeroski

2009 Ninth IEEE International Conference on Data Mining > 21 - 30

2009 Ninth IEEE International Conference on Data Mining (ICDM 2009)

Methods for learning decision rules are being successfully applied to many problem domains, especially where understanding and interpretation of the learned model is necessary. In many real life problems, we would like to predict multiple related (nominal or numeric) target attributes simultaneously. Methods for learning rules that predict multiple targets at once already exist, but are unfortunately...

chapter

A Local Scalable Distributed Expectation Maximization Algorithm for Large Peer-to-Peer Networks

K. Bhaduri, A.N. Srivastava

2009 Ninth IEEE International Conference on Data Mining > 31 - 40

2009 Ninth IEEE International Conference on Data Mining (ICDM 2009)

This paper describes a local and distributed expectation maximization algorithm for learning parameters of Gaussian mixture models (GMM) in large peer-to-peer (P2P) environments. The algorithm can be used for a variety of well-known data mining tasks in distributed environments such as clustering, anomaly detection, target tracking, and density estimation to name a few, necessary for many emerging...

chapter

Cross-Guided Clustering: Transfer of Relevant Supervision across Domains for Improved Clustering

I. Bhattacharya, S. Godbole, S. Joshi, A. Verma

2009 Ninth IEEE International Conference on Data Mining > 41 - 50

2009 Ninth IEEE International Conference on Data Mining (ICDM 2009)

Lack of supervision in clustering algorithms often leads to clusters that are not useful or interesting to human reviewers. We investigate if supervision can be automatically transferred to a clustering task in a target domain, by providing a relevant supervised partitioning of a dataset from a different source domain. The target clustering is made more meaningful for the human user by trading off...

chapter

Audio Classification of Bird Species: A Statistical Manifold Approach

F. Briggs, R. Raich, X.Z. Fern

2009 Ninth IEEE International Conference on Data Mining > 51 - 60

2009 Ninth IEEE International Conference on Data Mining (ICDM 2009)

Our goal is to automatically identify which species of bird is present in an audio recording using supervised learning. Devising effective algorithms for bird species classification is a preliminary step toward extracting useful ecological data from recordings collected in the field. We propose a probabilistic model for audio features within a short interval of time, then derive its Bayes risk-minimizing...

chapter

Finding Associations and Computing Similarity via Biased Pair Sampling

A. Campagna, R. Pagh

2009 Ninth IEEE International Conference on Data Mining > 61 - 70

2009 Ninth IEEE International Conference on Data Mining (ICDM 2009)

Sampling-based methods have previously been proposed for the problem of finding interesting associations in data, even for low-support items. While these methods do not guarantee precise results, they can be vastly more efficient than approaches that rely on exact counting. However, for many similarity measures no such methods have been known. In this paper we show how a wide variety of measures can...

chapter

Beyond Banditron: A Conservative and Efficient Reduction for Online Multiclass Prediction with Bandit Setting Model

Guangyun Chen, Gang Chen, Jianwen Zhang, Shuo Chen, more

2009 Ninth IEEE International Conference on Data Mining > 71 - 80

2009 Ninth IEEE International Conference on Data Mining (ICDM 2009)

In this paper, we consider a recently proposed supervised learning problem, called online multiclass prediction with bandit setting model. Aiming at learning from partial feedback of online classification results, i.e. ??true?? when the predicting label is right or ??false?? when the predicting label is wrong, this new kind of problems arouses much of researchers' interest due to its close relations...

chapter

Probabilistic Similarity Query on Dimension Incomplete Data

Wei Cheng, Xiaoming Jin, Jian-Tao Sun

2009 Ninth IEEE International Conference on Data Mining > 81 - 90

2009 Ninth IEEE International Conference on Data Mining (ICDM 2009)

Retrieving similar data has drawn many research efforts in the literature due to its importance in data mining, database and information retrieval. This problem is challenging when the data is incomplete. In previous research, data incompleteness refers to the fact that data values for some dimensions are unknown. However, in many practical applications (e.g., data collection by sensor network under...

INFONA - science communication portal

2009 Ninth IEEE International Conference on Data Mining

Cover Art

Title Page i

Title Page iii

Copyright Page

Table of Contents

Message from the General Co-Chairs

Message from the Program Committee Co-Chairs

Program Committee

Organizing Committee

Steering Committee

ICDM 2009 Program

Explore/Exploit Schemes for Web Content Optimization

Connecting Sparsely Distributed Similar Bloggers

Rule Ensembles for Multi-target Regression

A Local Scalable Distributed Expectation Maximization Algorithm for Large Peer-to-Peer Networks

Cross-Guided Clustering: Transfer of Relevant Supervision across Domains for Improved Clustering

Audio Classification of Bird Species: A Statistical Manifold Approach

Finding Associations and Computing Similarity via Biased Pair Sampling

Beyond Banditron: A Conservative and Efficient Reduction for Online Multiclass Prediction with Bandit Setting Model

Probabilistic Similarity Query on Dimension Incomplete Data

Filter options

Publication date

Keywords

INFONA - science communication portal

2009 Ninth IEEE International Conference on Data Mining $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2009 Ninth IEEE International Conference on Data Mining