2017 IEEE International Conference on Data Mining (ICDM)

chapter

Spatio-Temporal Neural Networks for Space-Time Series Forecasting and Relations Discovery

Ali Ziat, Edouard Delasalles, Ludovic Denoyer, Patrick Gallinari

2017 IEEE International Conference on Data Mining (ICDM) > 705 - 714

We introduce a dynamical spatio-temporal model formalized as a recurrent neural network for forecasting time series of spatial processes, i.e. series of observations sharing temporal and spatial dependencies. The model learns these dependencies through a structured latent dynamical component, while a decoder predicts the observations from the latent representations. We consider several variants of...

chapter

Differentially Private Mixture of Generative Neural Networks

Gergely Acs, Luca Melis, Claude Castelluccia, Emiliano De Cristofaro

2017 IEEE International Conference on Data Mining (ICDM) > 715 - 720

2017 IEEE International Conference on Data Mining (ICDM)

Generative models are used in an increasing number of applications that rely on large amounts of contextually rich information about individuals. Owing to possible privacy violations, however, publishing or sharing generative models is not always viable. In this paper, we introduce a novel solution for privately releasing generative models and entire high-dimensional datasets produced by these models...

chapter

A Probabilistic Geographical Aspect-Opinion Model for Geo-Tagged Microblogs

Aman Ahuja, Wei Wei, Wei Lu, Kathleen M. Carley, more

2017 IEEE International Conference on Data Mining (ICDM) > 721 - 726

2017 IEEE International Conference on Data Mining (ICDM)

Due to the rapid increase in the number of users owning location-based devices, there is a considerable amount of geo-tagged data available on social media websites, such as Twitter and Facebook. This geo-tagged data can be useful in a variety of ways to extract location-specific information, as well as to comprehend the variation of information across different geographical regions. A lot of techniques...

chapter

Aspect Sentiment Model for Micro Reviews

Reinald Kim Amplayo, Seung-won Hwang

2017 IEEE International Conference on Data Mining (ICDM) > 727 - 732

2017 IEEE International Conference on Data Mining (ICDM)

This paper aims at an aspect sentiment model for aspect-based sentiment analysis (ABSA) focused on micro reviews. This task is important in order to understand short reviews majority of the users write, while existing topic models are targeted for expert-level long reviews with sufficient co-occurrence patterns to observe. Current methods on aggregating micro reviews using metadata information may...

chapter

Mining the Demographics of Political Sentiment from Twitter Using Learning from Label Proportions

Ehsan Mohammady Ardehaly, Aron Culotta

2017 IEEE International Conference on Data Mining (ICDM) > 733 - 738

2017 IEEE International Conference on Data Mining (ICDM)

Opinion mining and demographic attribute inference have many applications in social science. In this paper, we propose models to infer daily joint probabilities of multiple latent attributes from Twitter data, such as political sentiment and demographic attributes. Since it is costly and time-consuming to annotate data for traditional supervised classification, we instead propose scalable Learning...

chapter

Hierarchical Multinomial-Dirichlet Model for the Estimation of Conditional Probability Tables

Laura Azzimonti, Giorgio Corani, Marco Zaffalon

2017 IEEE International Conference on Data Mining (ICDM) > 739 - 744

2017 IEEE International Conference on Data Mining (ICDM)

We present a novel approach for estimating conditional probability tables, based on a joint, rather than independent, estimate of the conditional distributions belonging to the same table. We derive exact analytical expressions for the estimators and we analyse their properties both analytically and via simulation. We then apply this method to the estimation of parameters in a Bayesian network. Given...

chapter

Multi-party Sparse Discriminant Learning

Jiang Bian, Haoyi Xiong, Wei Cheng, Wenqing Hu, more

2017 IEEE International Conference on Data Mining (ICDM) > 745 - 750

2017 IEEE International Conference on Data Mining (ICDM)

Sparse Discriminant Analysis (SDA) has been widely used to improve the performance of classical Fisher's Linear Discriminant Analysis in supervised metric learning, feature selection and classification. With the increasing needs of distributed data collection, storage and processing, enabling the Sparse Discriminant Learning to embrace the Multi-Party distributed computing environments becomes an...

chapter

MDL for Causal Inference on Discrete Data

Kailash Budhathoki, Jilles Vreeken

2017 IEEE International Conference on Data Mining (ICDM) > 751 - 756

2017 IEEE International Conference on Data Mining (ICDM)

The algorithmic Markov condition states that the most likely causal direction between two random variables X and Y can be identified as the direction with the lowest Kolmogorov complexity. This notion is very powerful as it can detect any causal dependency that can be explained by a physical process. However, due to the halting problem, it is also not computable. In this paper we propose an computable...

chapter

Efficient Mining of Subsample-Stable Graph Patterns

Aleksey Buzmakov, Sergei O. Kuznetsov, Amedeo Napoli

2017 IEEE International Conference on Data Mining (ICDM) > 757 - 762

2017 IEEE International Conference on Data Mining (ICDM)

A scalable method for mining graph patterns stable under subsampling is proposed. The existing subsample stability and robustness measures are not antimonotonic according to definitions known so far. We study a broader notion of antimonotonicity for graph patterns, so that measures of subsample stability become antimonotonic. Then we propose gSOFIA for mining the most subsample-stable graph patterns...

chapter

Multi-level Feedback Web Links Selection Problem: Learning and Optimization

Kechao Cai, Kun Chen, Longbo Huang, John C.S. Lui

2017 IEEE International Conference on Data Mining (ICDM) > 763 - 768

2017 IEEE International Conference on Data Mining (ICDM)

Selecting the right web links for a website is important because appropriate links not only can provide high attractiveness but can also increase the website's revenue. In this work, we first show that web links have an intrinsic multi-level feedback structure. For example, consider a 2-level feedback web link: the 1st level feedback provides the Click-Through Rate (CTR) and the 2nd level feedback...

chapter

HitFraud: A Broad Learning Approach for Collective Fraud Detection in Heterogeneous Information Networks

Bokai Cao, Mia Mao, Siim Viidu, Philip S. Yu

2017 IEEE International Conference on Data Mining (ICDM) > 769 - 774

2017 IEEE International Conference on Data Mining (ICDM)

On electronic game platforms, different payment transactions have different levels of risk. Risk is generally higher for digital goods in e-commerce. However, it differs based on product and its popularity, the offer type (packaged game, virtual currency to a game or subscription service), storefront and geography. Existing fraud policies and models make decisions independently for each transaction...

chapter

Domain Adaptation for Online ECG Monitoring

Diego Carrera, Beatrice Rossi, Pasqualina Fragneto, Giacomo Boracchi

2017 IEEE International Conference on Data Mining (ICDM) > 775 - 780

2017 IEEE International Conference on Data Mining (ICDM)

Successful ECG monitoring algorithms often rely on learned models to describe the heartbeats morphology. Unfortunately, when the heart rate increases the heartbeats get transformed, and a model that can properly describe the heartbeats of a specific user in resting conditions might not be appropriate for monitoring the same user during everyday activities. We model heartbeats by dictionaries yielding...

chapter

EC3: Combining Clustering and Classification for Ensemble Learning

Tanmoy Chakraborty

2017 IEEE International Conference on Data Mining (ICDM) > 781 - 786

2017 IEEE International Conference on Data Mining (ICDM)

We propose EC3, a novel algorithm that merges classification and clustering together in order to support both binary and multi-class classification. EC3 is based on a principled combination of multiple classification and multiple clustering methods using a convex optimization function. We additionally propose iEC3, a variant of EC3 that handles imbalanced training data. We perform an extensive experimental...

chapter

Boosting Deep Learning Risk Prediction with Generative Adversarial Networks for Electronic Health Records

Zhengping Che, Yu Cheng, Shuangfei Zhai, Zhaonan Sun, more

2017 IEEE International Conference on Data Mining (ICDM) > 787 - 792

2017 IEEE International Conference on Data Mining (ICDM)

The rapid growth of Electronic Health Records (EHRs), as well as the accompanied opportunities in Data-Driven Healthcare (DDH), has been attracting widespread interests and attentions. Recent progress in the design and applications of deep learning methods has shown promising results and is forcing massive changes in healthcare academia and industry, but most of these methods rely on massive labeled...

chapter

Clustering by Shift

Morteza Haghir Chehreghani

2017 IEEE International Conference on Data Mining (ICDM) > 793 - 798

2017 IEEE International Conference on Data Mining (ICDM)

In order to yield a more balanced partitioning, we investigate the use of additive regularizations for the Min Cut cost function, instead of normalization. In particular, we study the case where the regularization term is the sum of the squared size of the clusters, which then leads to shifting (adaptively) the pairwise similarities. We study the connection of such a model with Correlation Clustering...

chapter

Efficient Computation of Pairwise Minimax Distance Measures

Morteza Haghir Chehreghani

2017 IEEE International Conference on Data Mining (ICDM) > 799 - 804

2017 IEEE International Conference on Data Mining (ICDM)

We study efficient computation of Minimax distances measures, which enable to capture the correct structures via taking the transitive relations into account. We analyze in detail two settings, the dense graphs and the sparse graphs. In particular, we show that an adapted variant of the Kruskal’s algorithm is the most efficient approach for computing pairwise Minimax distances. However, for dense...

chapter

Warehouse Site Selection for Online Retailers in Inter-Connected Warehouse Networks

Can Chen, Junming Liu, Qiao Li, Yijun Wang, more

2017 IEEE International Conference on Data Mining (ICDM) > 805 - 810

2017 IEEE International Conference on Data Mining (ICDM)

Supply chain management aims at delivering goods in the shortest time at the lowest possible price while ensuring the best possible quality and is now vital to the success of the online retail business. Executing effective warehouse site selection has been one of the key challenges in the development of a successful supply chain system. While some effective strategies for warehouse site selection...

chapter

Learning Multiple Similarities of Users and Items in Recommender Systems

Huiyuan Chen, Jing Li

2017 IEEE International Conference on Data Mining (ICDM) > 811 - 816

2017 IEEE International Conference on Data Mining (ICDM)

In addition to the sparse user-item (U-I) matrix, an increasing number of current recommender systems seek to improve performance by exploiting extra heterogeneous data sources (e.g., online social networks). Such rich side sources can provide very useful information about users' personal behaviors and items' properties, therefore can significantly benefit recommender systems. Most existing work can...

chapter

Learning to Fuse Music Genres with Generative Adversarial Dual Learning

Zhiqian Chen, Chih-Wei Wu, Yen-Cheng Lu, Alexander Lerch, more

2017 IEEE International Conference on Data Mining (ICDM) > 817 - 822

2017 IEEE International Conference on Data Mining (ICDM)

FusionGAN is a novel genre fusion framework for music generation that integrates the strengths of generative adversarial networks and dual learning. In particular, the proposed method offers a dual learning extension that can effectively integrate the styles of the given domains. To efficiently quantify the difference among diverse domains and avoid the vanishing gradient issue, FusionGAN provides...

chapter

Data Prefetching for Large Tiered Storage Systems

Giovanni Cherubini, Yusik Kim, Mark Lantz, Vinodh Venkatesan

2017 IEEE International Conference on Data Mining (ICDM) > 823 - 828

2017 IEEE International Conference on Data Mining (ICDM)

In multi-tier storage systems with large amounts of data, most of the data is stored on inexpensive slower tiers such as cloud or tape to achieve cost savings. This also implies that retrieving the data from the slower storage tiers incurs high latency. Therefore, it would be beneficial to proactively prefetch data from slower tiers to faster tiers by predicting future data accesses. State-of-the-art...

INFONA - science communication portal

2017 IEEE International Conference on Data Mining (ICDM)

Spatio-Temporal Neural Networks for Space-Time Series Forecasting and Relations Discovery

Differentially Private Mixture of Generative Neural Networks

A Probabilistic Geographical Aspect-Opinion Model for Geo-Tagged Microblogs

Aspect Sentiment Model for Micro Reviews

Mining the Demographics of Political Sentiment from Twitter Using Learning from Label Proportions

Hierarchical Multinomial-Dirichlet Model for the Estimation of Conditional Probability Tables

Multi-party Sparse Discriminant Learning

MDL for Causal Inference on Discrete Data

Efficient Mining of Subsample-Stable Graph Patterns

Multi-level Feedback Web Links Selection Problem: Learning and Optimization

HitFraud: A Broad Learning Approach for Collective Fraud Detection in Heterogeneous Information Networks

Domain Adaptation for Online ECG Monitoring

EC3: Combining Clustering and Classification for Ensemble Learning

Boosting Deep Learning Risk Prediction with Generative Adversarial Networks for Electronic Health Records

Clustering by Shift

Efficient Computation of Pairwise Minimax Distance Measures

Warehouse Site Selection for Online Retailers in Inter-Connected Warehouse Networks

Learning Multiple Similarities of Users and Items in Recommender Systems

Learning to Fuse Music Genres with Generative Adversarial Dual Learning

Data Prefetching for Large Tiered Storage Systems

Filter options

Publication date

Keywords

INFONA - science communication portal

2017 IEEE International Conference on Data Mining (ICDM) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 IEEE International Conference on Data Mining (ICDM)