2016 IEEE 32nd International Conference on Data Engineering Workshops (ICDEW)

book

2016 IEEE 32nd International Conference on Data Engineering Workshops (ICDEW)

IEEE

chapter

Air quality assessment from social media and structured data: Pollutants and health impacts in urban planning

Xu Du, Onyeka Emebo, Aparna Varde, Niket Tandon, more

2016 IEEE 32nd International Conference on Data Engineering Workshops (ICDEW) > 54 - 59

2016 IEEE 32nd International Conference on Data Engineering Workshops (ICDEW)

This paper describes our work on mining pollutant data to assess air quality in urban areas. Notable aspects of this work are that we mine social media and structured data in a domain-specific context, incorporate commonsense knowledge in mining media opinions and focus on the urban planning domain in a multicity environment. The results of mining are useful for predictive analysis in urbanization...

chapter

Pathology text mining - on Norwegian prostate cancer reports

Anders Dahl, Atilla Ozkan, Hercules Dalianis

2016 IEEE 32nd International Conference on Data Engineering Workshops (ICDEW) > 84 - 87

2016 IEEE 32nd International Conference on Data Engineering Workshops (ICDEW)

Pathology reports are written by pathologists, skilled physicians, that know how to interpret disorders in various tissue samples from the human body. To obtain valuable statistics on outcome of disorders, as for example cancer and effect of treatment, statistics are collected. Therefore, cancer pathology reports interpreted and coded into databases at cancer registries. In Norway is this task carried...

chapter

IncReStore: Incremental computation of mapreduce workflows

Ahmed E. Khalifa, Iman Elghandour, Nagwa El-Makky

2016 IEEE 32nd International Conference on Data Engineering Workshops (ICDEW) > 39 - 46

2016 IEEE 32nd International Conference on Data Engineering Workshops (ICDEW)

Many applications in various industrial and research areas analyze large continuously evolving data. Big data analytics platforms such as MapReduce focus on distributed batch processing, and therefore, a query needs to be re-executed every time its input data evolve. In this paper, we present IncReStore, a system that incrementally computes queries on fast growing datasets by materializing query outputs...

chapter

Data mining for better healthcare: A path towards automated data analysis?

Tania Cerquitelli, Elena Baralis, Lia Morra, Silvia Chiusano

2016 IEEE 32nd International Conference on Data Engineering Workshops (ICDEW) > 60 - 63

2016 IEEE 32nd International Conference on Data Engineering Workshops (ICDEW)

In today's world, large volumes of medical data are being continuously generated, but their value is severely undermined by our inability to translate them into knowledge and, ultimately, actions. Data mining techniques allow the extraction of previously unknown interesting patterns from large datasets, but their complexity limits their practical diffusion. Data-driven analysis is a multi-step process,...

chapter

Mining internet media for monitoring changes of public emotions about infectious diseases

Sungwoon Choi, Jangho Lee, Sangheon Pack, Yoon-Seok Chang, more

2016 IEEE 32nd International Conference on Data Engineering Workshops (ICDEW) > 68 - 70

2016 IEEE 32nd International Conference on Data Engineering Workshops (ICDEW)

The Internet encompasses websites, email, social media, and Internet-based television. Given the widespread use of networked computers and mobile devices, it has become possible to monitor the behavior of Internet users by examining their access logs and queries. Based on large-scale web and text mining of Internet media articles and associated user comments, we propose a framework to rapidly monitor...

chapter

Playing LEGO with JSON: Probabilistic joins over attribute-value fragments

Manuel Hoffmann, Evica Milchevski, Sebastian Michel

2016 IEEE 32nd International Conference on Data Engineering Workshops (ICDEW) > 173 - 180

2016 IEEE 32nd International Conference on Data Engineering Workshops (ICDEW)

Information about an entity can hardly be assumed to be given in one single document, created in a single instance of time. Rather, it is reasonable to assume that information is spread over multiple documents and created/enriched over time—for instance through crowdsourcing facts or mined from social network streams, one after the other. In this work, we consider the problem of assembling entity-centric...

chapter

Exploiting SIMD for complex numerical predicates

Dongxiao Song, Shimin Chen

2016 IEEE 32nd International Conference on Data Engineering Workshops (ICDEW) > 143 - 149

2016 IEEE 32nd International Conference on Data Engineering Workshops (ICDEW)

We study the use of SIMD instructions to support complex conjunctive numerical predicates. Compared to previous studies, we aim to model more realistic use scenarios, where different data types, different comparison operations, and different predicate types can be mixed in a single filtering clause. Moreover, the evaluation of the predicates on a set of columns can take advantage of multiple processor...

chapter

A comparison of Flashcache with IQ-Twemcached

Yazeed Alabdulkarim, Marwan Almaymoni, Ziwen Cao, Shahram Ghandeharizadeh, more

2016 IEEE 32nd International Conference on Data Engineering Workshops (ICDEW) > 20 - 26

2016 IEEE 32nd International Conference on Data Engineering Workshops (ICDEW)

Person-to-person cloud service providers such as Facebook use Host-side (HsC) and Application-side (AsC) caches to enhance performance. Using Facebook's Flashcache as the representative of HsC and IQ-Twemcached as the representative of AsC, this study quantifies their tradeoffs using both a read-heavy and a write-heavy workload. Obtained results show Flashcache provides significant benefit for I/O...

chapter

High variety cloud databases

Shrainik Jain, Dominik Moritz, Bill Howe

2016 IEEE 32nd International Conference on Data Engineering Workshops (ICDEW) > 12 - 19

2016 IEEE 32nd International Conference on Data Engineering Workshops (ICDEW)

Big Data is colloquially described in terms of the three Vs: Volume, Velocity, and Variety. Volume and velocity receive a disproportionate amount of research attention, however, variety is frequently cited by practitioners as the Big Data problem that “keeps them up at night” — the problem that resists direct attacks in terms of new algorithms, systems, and approaches. We find that the cloud-based...

chapter

About CP

2016 IEEE 32nd International Conference on Data Engineering Workshops (ICDEW) > 1

2016 IEEE 32nd International Conference on Data Engineering Workshops (ICDEW)

Provides general information on various non-technical conference events.

chapter

Workshops list

2016 IEEE 32nd International Conference on Data Engineering Workshops (ICDEW) > 1

2016 IEEE 32nd International Conference on Data Engineering Workshops (ICDEW)

Provides a schedule of conference events and a listing of which papers were presented in each session.

chapter

Hub page

2016 IEEE 32nd International Conference on Data Engineering Workshops (ICDEW) > 1

2016 IEEE 32nd International Conference on Data Engineering Workshops (ICDEW)

Presents the proceedings page that links various sections of the overall electronic record.

chapter

Author index

2016 IEEE 32nd International Conference on Data Engineering Workshops (ICDEW) > 1 - 4

2016 IEEE 32nd International Conference on Data Engineering Workshops (ICDEW)

Presents an index of the authors whose articles are published in the conference proceedings record.

chapter

Welcome

2016 IEEE 32nd International Conference on Data Engineering Workshops (ICDEW) > 1

2016 IEEE 32nd International Conference on Data Engineering Workshops (ICDEW)

Presents the introductory welcome message from the conference proceedings. May include the conference officers' congratulations to all involved with the conference event and publication of the proceedings record.

chapter

HyPer beyond software: Exploiting modern hardware for main-memory database systems

Alfons Kemper

2016 IEEE 32nd International Conference on Data Engineering Workshops (ICDEW) > 130

2016 IEEE 32nd International Conference on Data Engineering Workshops (ICDEW)

In this presentation, we survey the use of advanced hard-ware features for optimizing main-memory database systems in the context of our HyPer project. The access behavior of database objects from simultaneous OLTP transactions is monitored using the virtual memory management component in order to compact the database into hot and cold partitions. The cold partitions are stored in compressed data...

chapter

[7th Data Engineering Meets the Semantic Web - Front matter]

2016 IEEE 32nd International Conference on Data Engineering Workshops (ICDEW) > 94 - 96

2016 IEEE 32nd International Conference on Data Engineering Workshops (ICDEW)

Conference proceedings front matter may contain various advertisements, welcome messages, committee or program information, and other miscellaneous conference information. This may in some cases also include the cover art, table of contents, copyright statements, title-page or half title-pages, blank pages, venue maps or other general information relating to the conference that was part of the original...

chapter

From user graph to Topics Graph: Towards twitter followee recommendation based on knowledge graphs

Danae Pla Karidi

2016 IEEE 32nd International Conference on Data Engineering Workshops (ICDEW) > 121 - 123

2016 IEEE 32nd International Conference on Data Engineering Workshops (ICDEW)

Twitter is a rapidly growing microblogging platform that allows its users to send and read short messages, called tweets. Because of the fact that a user's timeline consists of the latest tweets of their followees (users that they are following), followee recommendation is a problem of significant importance. In this work we propose a followee recommendation approach, which takes advantage of the...

chapter

Including hierarchical navigation in a Graph Database query language with an OBDA approach

Nicolle Chaves Cysneiros, Ana Carolina Salgado

2016 IEEE 32nd International Conference on Data Engineering Workshops (ICDEW) > 109 - 114

2016 IEEE 32nd International Conference on Data Engineering Workshops (ICDEW)

Distributed and replicated systems, such as Big Data applications, deal with conflicting and duplicated data. Therefore, it is needed a database with flexible data model to materialize the data and an end-user oriented interface in order to allow queries in heterogeneous data. Using OBDA approach in a Graph Database is an attempt to solve these problems. However, new problems arise regarding hierarchical...

INFONA - science communication portal

2016 IEEE 32nd International Conference on Data Engineering Workshops (ICDEW)