2015 IEEE International Congress on Big Data

We study online compound decision problems in the context of sequential prediction of real valued sequences. In particular, we consider finite state (FS) predictors that are constructed based on the sequence history, whose length is quite large for applications involving big data. To mitigate over training problems, we define hierarchical equivalence classes and apply the exponentiated gradient (EG)...

chapter

A Parallel Distributed Weka Framework for Big Data Mining Using Spark

Aris-Kyriakos Koliopoulos, Paraskevas Yiapanis, Firat Tekiner, Goran Nenadic, more

2015 IEEE International Congress on Big Data > 9 - 16

2015 IEEE International Congress on Big Data (BigData Congress)

Effective Big Data Mining requires scalable and efficient solutions that are also accessible to users of all levels of expertise. Despite this, many current efforts to provide effective knowledge extraction via large-scale Big Data Mining tools focus more on performance than on use and tuning which are complex problems even for experts. Weka is a popular and comprehensive Data Mining workbench with...

chapter

Geometrical and Topological Modelling: A Fast Computation of Spatial 3D TLS Data Selections

Jose I. Rodrigues, Mauro Figueiredo, Ivo Silvestre, Cristina Veiga-Pires

2015 IEEE International Congress on Big Data > 17 - 24

2015 IEEE International Congress on Big Data (BigData Congress)

Underground caves and their specific structures are important for geomorphological studies. In this paper we present a new tool to identify and map speleothems by surveying cave chambers interiors. One of the research problems that we had to solve was that we were dealing with a great number of points that resulted from the Laser scan. The cave chamber was surveyed using Terrestrial Laser Scanning...

chapter

PaWI: Parallel Weighted Itemset Mining by Means of MapReduce

Elena Baralis, Luca Cagliero, Paolo Garza, Luigi Grimaudo

2015 IEEE International Congress on Big Data > 25 - 32

2015 IEEE International Congress on Big Data (BigData Congress)

Frequent item set mining is an exploratory data mining technique that has fruitfully been exploited to extract recurrent co-occurrences between data items. Since in many application contexts items are enriched with weights denoting their relative importance in the analyzed data, pushing item weights into the item set mining process, i.e., Mining weighted item sets rather than traditional item sets,...

chapter

Distributed SPARQL over Big RDF Data: A Comparative Analysis Using Presto and MapReduce

Mulugeta Mammo, Srividya K. Bansal

2015 IEEE International Congress on Big Data > 33 - 40

2015 IEEE International Congress on Big Data (BigData Congress)

The processing of large volumes of RDF data require an efficient storage and query processing engine that can scale well with the volume of data. The initial attempts to address this issue focused on optimizing native RDF stores as well as conventional relational databases management systems. But as the volume of RDF data grew to exponential proportions, the limitations of these systems became apparent...

chapter

A GPU Based SVM Method with Accelerated Kernel Matrix Calculation

Bo Yan, Yitian Ren, Zijiang Yang

2015 IEEE International Congress on Big Data > 41 - 46

2015 IEEE International Congress on Big Data (BigData Congress)

Support vector machine (SVM) is a popular classifier dealing with small-scale datasets. It has outstanding performance compared to other classifiers. However the execution time is extremely long when training Big Data. The Graphics Processing Unit (GPU) is a massively parallel device which performs very well as a co-processor. NVIDIA proposed a programming platform, CUDA, in 2006, which makes it much...

chapter

A Clustered Approach for Fast Computation of Betweenness Centrality in Social Networks

Paolo Suppa, Eugenio Zimeo

2015 IEEE International Congress on Big Data > 47 - 54

2015 IEEE International Congress on Big Data (BigData Congress)

In the last few years, the data generated by social networking systems have become interesting to analyze local and global social phenomena. A useful metric to identify influential people or opinion leaders is the between ness centrality index. The computation of this index is a very demanding task since its exact calculation exhibits O(nm) time complexity for unweighted graphs. This complexity has...

chapter

A Semantic Recommender for Micro-blog Users

Stefano Faralli, Giovanni Stilo, Paola Velardi

2015 IEEE International Congress on Big Data > 55 - 62

2015 IEEE International Congress on Big Data (BigData Congress)

In this paper we propose a Twitter recommender based on a semantic description of users' interests. To express interests we use friendship information, which is readily available in users' profiles, not only in Twitter but in the majority of Social Networks, thus presenting substantial advantage in terms of computational complexity with respect to methods based on content mining. To obtain a synthetic...

chapter

Incorporating Tie Strength in Robust Social Recommendation

Youliang Zhong, Jian Yang, Robertus Nugroho

2015 IEEE International Congress on Big Data > 63 - 70

2015 IEEE International Congress on Big Data (BigData Congress)

In this paper, we present a novel method in making recommendations by leveraging Tie Strength, an integrated social relationship measurement calculated from various user information gathered from social media. Moreover, the proposed method adopts Least Absolute Errors in factorization scheme to reduce the sensitivity to data outliers. We have conducted comprehensive experiments over the real datasets...

INFONA - science communication portal

2015 IEEE International Congress on Big Data

Cover Art

Title Page i

Title Page iii

Copyright Page

Table of Contents

Organizing Committee

Message from the General Chairs

Message from the Program Committee Chairs

Program Committee

External Reviewers

IEEE Computer Society Technical Committee on Services Computing

A Scalable Approach for Online Hierarchical Big Data Mining

A Parallel Distributed Weka Framework for Big Data Mining Using Spark

Geometrical and Topological Modelling: A Fast Computation of Spatial 3D TLS Data Selections

PaWI: Parallel Weighted Itemset Mining by Means of MapReduce

Distributed SPARQL over Big RDF Data: A Comparative Analysis Using Presto and MapReduce

A GPU Based SVM Method with Accelerated Kernel Matrix Calculation

A Clustered Approach for Fast Computation of Betweenness Centrality in Social Networks

A Semantic Recommender for Micro-blog Users

Incorporating Tie Strength in Robust Social Recommendation

Filter options

Publication date

Keywords

INFONA - science communication portal

2015 IEEE International Congress on Big Data $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2015 IEEE International Congress on Big Data