2015 IEEE International Conference on Big Data (Big Data)

chapter

How to make money from your information and keep your privacy

Divya Rao, Wee Keong Ng

2015 IEEE International Conference on Big Data (Big Data) > 2859 - 2861

Today big data is synonymous with every business and organization, so much so that data brokers have made a business of trading this big data like any other commodity. In turn, the buyers of this big data make massive profits. The only one who loses out on profits and his privacy is the internet user — the generator and owner of this big data. Our work looks at allowing the user to monetize on his...

chapter

City users' classification with mobile phone data

Lorenzo Gabrielli, Barbara Furletti, Roberto Trasarti, Fosca Giannotti, more

2015 IEEE International Conference on Big Data (Big Data) > 1007 - 1012

2015 IEEE International Conference on Big Data (Big Data)

Nowadays mobile phone data are an actual proxy for studying the users' social life and urban dynamics. In this paper we present the Sociometer, and analytical framework aimed at classifying mobile phone users into behavioral categories by means of their call habits. The analytical process starts from spatio-temporal profiles, learns the different behaviors, and returns annotated profiles. After the...

chapter

A novel initialization method for particle swarm optimization-based FCM in big biomedical data

Chanpaul J. Wang, Hua Fang, Chonggang Wang, Mahmoud Daneshmand, more

2015 IEEE International Conference on Big Data (Big Data) > 2942 - 2944

2015 IEEE International Conference on Big Data (Big Data)

Based on empirical studies, the feature of random initialization in Particle Swarm Optimization (PSO) based Fuzzy c-means (FCM) methods affects the computational performance especially in big data. As the data points in high-density areas are more likely near the cluster centroids, we design a new algorithm to guide the initialization according to the data density patterns. Our algorithm is initialized...

chapter

Data veracity estimation with ensembling truth discovery methods

Laure Berti-Equille

2015 IEEE International Conference on Big Data (Big Data) > 2628 - 2636

2015 IEEE International Conference on Big Data (Big Data)

Estimation of data veracity is recognized as one of the grand challenges of big data. Typically, the goal of truth discovery is to determine the veracity of multi-source, conflicting data and return, as outputs, a veracity label and a confidence score for each data value, along with the trustworthiness score of each source claiming it. Although a plethora of methods has been proposed, it is unlikely...

chapter

Performance of graph reconstruction method for large-scale web graph analysis

Ryota Takei, Ayahiko Niimi

2015 IEEE International Conference on Big Data (Big Data) > 2852 - 2854

2015 IEEE International Conference on Big Data (Big Data)

We have already proposed a graph analysis method that could shorten the analysis time by reconstructing a web graph. In our proposed method, a web graph is reconstructed for parallel distributed processing of possible graphs by clustering a web graph and reconstructing the web graph for Compression Graph and Cluster Graphs. Compression Graph represents the relationship between clusters, whereas Cluster...

chapter

Edge importance identification for energy efficient graph processing

S M Faisal, G. Tziantzioulis, A. M. Gok, N. Hardavellas, more

2015 IEEE International Conference on Big Data (Big Data) > 347 - 354

2015 IEEE International Conference on Big Data (Big Data)

Modern graphs are large, often containing billions of nodes and edges that demand huge amount of processing for analysis purposes. The algorithms processing these graphs often run for long time and consume substantial amount of energy. However, not all edges in the graphs are equally important. Some edges play critical role in maintaining the community and other interesting structures in the graph,...

chapter

Directional decision lists

Marc Goessling, Shan Kang

2015 IEEE International Conference on Big Data (Big Data) > 2762 - 2766

2015 IEEE International Conference on Big Data (Big Data)

In this paper we introduce a novel family of decision lists consisting of highly interpretable models which can be learned efficiently in a greedy manner. The defining property is that all rules are oriented in the same direction. Particular examples of this family are decision lists with monotonically decreasing (or increasing) probabilities. On simulated data we empirically confirm that the proposed...

chapter

Probabilistic km-anonymity efficient anonymization of large set-valued datasets

Gergely Acs, Jagdish Prasad Achara, Claude Castelluccia

2015 IEEE International Conference on Big Data (Big Data) > 1164 - 1173

2015 IEEE International Conference on Big Data (Big Data)

Set-valued dataset contains different types of items/values per individual, for example, visited locations, purchased goods, watched movies, or search queries. As it is relatively easy to re-identify individuals in such datasets, their release poses significant privacy threats. Hence, organizations aiming to share such datasets must adhere to personal data regulations. In order to get rid of these...

chapter

BigFUN: A performance study of big data management system functionality

Pouria Pirzadeh, Michael J. Carey, Till Westmann

2015 IEEE International Conference on Big Data (Big Data) > 507 - 514

2015 IEEE International Conference on Big Data (Big Data)

In this paper, we report on an evaluation of four representative Big Data management systems (BDMSs): Mon-goDB, Hive, AsterixDB, and a commercial parallel shared-nothing relational database system. In terms of features, all offer to store and manage large volumes of data, and all provide some degree of query processing capabilities on top of such data. Our evaluation is based on a micro-benchmark...

INFONA - science communication portal

2015 IEEE International Conference on Big Data (Big Data)

How to make money from your information and keep your privacy

City users' classification with mobile phone data

A novel initialization method for particle swarm optimization-based FCM in big biomedical data

Data veracity estimation with ensembling truth discovery methods

Performance of graph reconstruction method for large-scale web graph analysis

Edge importance identification for energy efficient graph processing

Directional decision lists

Probabilistic km-anonymity efficient anonymization of large set-valued datasets

BigFUN: A performance study of big data management system functionality

Filter options

Publication date

Keywords

INFONA - science communication portal

2015 IEEE International Conference on Big Data (Big Data) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2015 IEEE International Conference on Big Data (Big Data)