Big Data, 2013 IEEE International Conference on

chapter

Copyright page

2013 IEEE International Conference on Big Data > 1

2013 IEEE International Conference on Big Data

chapter

Cover page

2013 IEEE International Conference on Big Data > 1

2013 IEEE International Conference on Big Data

chapter

Re-projection of terabyte-sized images

Peter Bajcsy, Antoine Vandecreme, Mary Brady

2013 IEEE International Conference on Big Data > 1

2013 IEEE International Conference on Big Data

This work addresses the problem of re-projecting a terabyte-sized 3D data set represented as a set of 2D Deep Zoom pyramids. In general, a re-projection for small 3D data sets is executed directly in RAM. However, RAM becomes a limiting factor for terabyte-sized 3D volumes formed by a stack of hundreds of megapixel to gigapixel 2D frames. We have benchmarked three methods to perform the re-projection...

chapter

Key usage patterns for apache Hadoop in the enterprise

Amr Awadallah

2013 IEEE International Conference on Big Data > 1

2013 IEEE International Conference on Big Data

Cloudera Cloudera provides enterprises with ∗the∗ big data platform for next generation data management and analytics. This new platform allows companies to perform more flexible analysis on more types of data and in greater volumes. Amr Awadallah, CTO/Founder at Cloudera, will cover the key underlying patterns for how Hadoop is transforming the way organizations manage and derive value from data.

chapter

Welcome message from the organizers

2013 IEEE International Conference on Big Data > 1 - 2

2013 IEEE International Conference on Big Data

chapter

Organization

2013 IEEE International Conference on Big Data > 1 - 2

2013 IEEE International Conference on Big Data

chapter

The Microsoft Academic Search challenges at KDD Cup 2013

Martine De Cock, Senjuti Basu Roy, Swapna Savvana, Vani Mandava, more

2013 IEEE International Conference on Big Data > 1 - 4

2013 IEEE International Conference on Big Data

Microsoft Academic Search is a free search engine specific to scholarly material. It currently covers more than 50 million publications and over 19 million authors across a variety of domains. One of the main challenges in correctly indexing this material is author name ambiguity and the resulting noise in author profiles. KDD Cup 2013 invited participants to tackle this problem in 2 ways: (1) by...

chapter

Enterprise pre-sales forums: A preliminary study of metadata and content

Vinay Deolalikar

2013 IEEE International Conference on Big Data > 1 - 4

2013 IEEE International Conference on Big Data

Asynchronous discussion forums are one of the artifacts of the internet age. They occur in a wide variety of applications from distance learning to technical support. Technical support forums have also proliferated in enterprises, and today form a salient feature of many technical interactions in large enterprises. Two interconnected example applications where such forums may be employed are the following:...

chapter

Author index

2013 IEEE International Conference on Big Data > 1 - 5

2013 IEEE International Conference on Big Data

chapter

Optimizing a MapReduce module of preprocessing high-throughput DNA sequencing data

Wei-Chun Chung, Yu-Jung Chang, Chien-Chih Chen, Der-Tsai Lee, more

2013 IEEE International Conference on Big Data > 1 - 6

2013 IEEE International Conference on Big Data

The MapReduce framework has become the de facto choice for big data analysis in a variety of applications. In MapReduce programming model, computation is distributed to a cluster of computing nodes that runs in parallel. The performance of a MapReduce application is thus affected by system and middleware, characteristics of data, and design and implementation of the algorithms. In this study, we focus...

chapter

Assessment of dimensionality reduction based on communication channel model; application to immersive information visualization

Mohammadreza Babaee, Mihai Datcu, Gerhard Rigoll

2013 IEEE International Conference on Big Data > 1 - 6

2013 IEEE International Conference on Big Data

We are dealing with large-scale high-dimensional image data sets requiring new approaches for data mining where visualization plays the main role. Dimension reduction (DR) techniques are widely used to visualize high-dimensional data. However, the information loss due to reducing the number of dimensions is the drawback of DRs. In this paper, we introduce a novel metric to assess the quality of DRs...

chapter

Fast solution of load shedding problems via a sequence of linear programs

Harish S. Bhat, Garnet J. Vaz, Juan C. Meza

2013 IEEE International Conference on Big Data > 1 - 6

2013 IEEE International Conference on Big Data

Given a power network consisting of nodes (generators/loads) and edges (lines), there exist a set of constraints that must be satisfied in order for the system to be operational. When one or more power lines are cut, the bus phases and load/generator power values may need to be altered in order to restore the system to operation. The load shedding problem is to find the smallest adjustment to the...

chapter

The Code rebalancing problem for a storage-flexible Data Center Network

Iryna Andriyanova, Alan Jule, Emina Soljanin

2013 IEEE International Conference on Big Data > 1 - 6

2013 IEEE International Conference on Big Data

The paper considers the impact of changing code parameters on the network load, for some given storage-flexible Data Center Network (DCN), i.e. such DCN in which the reliability and the storage volume can be modified during the storage life of the DCN data. Two regimes of the network load are considered: transition (during the migration process) and stationary (at the end of the migration process)...

chapter

Knowledge cubes — A proposal for scalable and semantically-guided management of Big Data

Amgad Madkour, Walid G. Aref, Saleh Basalamah

2013 IEEE International Conference on Big Data > 1 - 7

2013 IEEE International Conference on Big Data

A Knowledge Cube, or cube for short, is an intelligent and adaptive database instance capable of storing, analyzing, and searching data. Each cube is established based on semantic aspects, e.g., (1) Topical, (2) Contextual, (3) Spatial, or (4) Temporal. A cube specializes in handling data that is only relevant to the cube's semantics. Knowledge cubes are inspired by two prime architectures: (1) Dataspaces...

chapter

Modeling and querying data in NoSQL databases

Karamjit Kaur, Rinkle Rani

2013 IEEE International Conference on Big Data > 1 - 7

2013 IEEE International Conference on Big Data

Relational databases are providing storage for several decades now. However for today's interactive web and mobile applications the importance of flexibility and scalability in data model can not be over-stated. The term NoSQL broadly covers all non-relational databases that provide schema-less and scalable model. NoSQL databases which are also termed as Internetage databases are currently being used...

chapter

Lung transplant outcome prediction using UNOS data

Ankit Agrawal, Reda Al-Bahrani, Mark J. Russo, Jaishankar Raman, more

2013 IEEE International Conference on Big Data > 1 - 8

2013 IEEE International Conference on Big Data

We analyze lung transplant data from the United Network for Organ Sharing (UNOS) program with the aim of developing accurate risk prediction models for mortality within 1 year of lung transplant using data mining techniques. The data used in this study is de-identified and consists of 62 predictor attributes, and 1-year posttranplant survial outcome for patients who underwent lung transplant between...

chapter

A big data analytics framework for scientific data management

Sandro Fiore, Cosimo Palazzo, Alessandro D'Anca, Ian Foster, more

2013 IEEE International Conference on Big Data > 1 - 8

2013 IEEE International Conference on Big Data

The Ophidia project is a research effort addressing big data analytics requirements, issues, and challenges for eScience. We present here the Ophidia analytics framework, which is responsible for atomically processing, transforming and manipulating array-based data. This framework provides a common way to run on large clusters analytics tasks applied to big datasets. The paper highlights the design...

chapter

Dynamic reduction of query result sets for interactive visualizaton

Leilani Battle, Michael Stonebraker, Remco Chang

2013 IEEE International Conference on Big Data > 1 - 8

2013 IEEE International Conference on Big Data

Modern database management systems (DBMS) have been designed to efficiently store, manage and perform computations on massive amounts of data. In contrast, many existing visualization systems do not scale seamlessly from small data sets to enormous ones. We have designed a three-tiered visualization system called ScalaR to deal with this issue. ScalaR dynamically performs resolution reduction when...

chapter

Managing massive graphs in relational DBMS

Ruiwen Chen

2013 IEEE International Conference on Big Data > 1 - 8

2013 IEEE International Conference on Big Data

Massive graphs emerge in many real-world applications. Practitioners often find relational databases are inefficient in graph data management. In this paper, we investigate the efficiency issue by analyzing both I/O and CPU costs. First, we find the storage of a graph in relational DBMS violates the locality principle: graph queries will always reference neighbors; however, the data locations of neighbors...

chapter

Robustness of emotion extraction from 20^th century English books

Alberto Acerbi, Vasileios Lampos, R. Alexander Bentley

2013 IEEE International Conference on Big Data > 1 - 8

2013 IEEE International Conference on Big Data

In this paper, we test the robustness of emotion extraction from English language books published in the 20^th century. Our analysis is performed on a sample of the 8 million digitized books available in the Google Books Ngram corpus by applying three independent emotion detection tools: WordNet Affect, Linguistic Inquiry and Word Count, and a recently proposed ‘Hedonometer’ method. We also assess...

INFONA - science communication portal

2013 IEEE International Conference on Big Data

Copyright page

Cover page

Re-projection of terabyte-sized images

Key usage patterns for apache Hadoop in the enterprise

Welcome message from the organizers

Organization

The Microsoft Academic Search challenges at KDD Cup 2013

Enterprise pre-sales forums: A preliminary study of metadata and content

Author index

Optimizing a MapReduce module of preprocessing high-throughput DNA sequencing data

Assessment of dimensionality reduction based on communication channel model; application to immersive information visualization

Fast solution of load shedding problems via a sequence of linear programs

The Code rebalancing problem for a storage-flexible Data Center Network

Knowledge cubes — A proposal for scalable and semantically-guided management of Big Data

Modeling and querying data in NoSQL databases

Lung transplant outcome prediction using UNOS data

A big data analytics framework for scientific data management

Dynamic reduction of query result sets for interactive visualizaton

Managing massive graphs in relational DBMS

Robustness of emotion extraction from 20^th century English books

Filter options

Publication date

Keywords

INFONA - science communication portal

2013 IEEE International Conference on Big Data $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2013 IEEE International Conference on Big Data