High Performance Computing (HiPC), 2013 20th International Conference on

chapter

Front cover

20th Annual International Conference on High Performance Computing > c1

2013 20th International Conference on High Performance Computing (HiPC)

chapter

Message from program chair

20th Annual International Conference on High Performance Computing > 1 - 4

2013 20th International Conference on High Performance Computing (HiPC)

chapter

Analyzing the performance impact of authorization constraints and optimizing the authorization methods for workflows

Nadeem Chaudhary, Ligang He

20th Annual International Conference on High Performance Computing > 1 - 9

2013 20th International Conference on High Performance Computing (HiPC)

Many workflow management systems have been developed to enhance the performance of workflow executions. The authorization policies deployed in the system may restrict the task executions. The common authorization constraints include role constraints, Separation of Duty (SoD), Binding of Duty (BoD) and temporal constraints. This paper presents the methods to check the feasibility of these constraints,...

chapter

Program

20th Annual International Conference on High Performance Computing > 1 - 10

2013 20th International Conference on High Performance Computing (HiPC)

chapter

Author index

20th Annual International Conference on High Performance Computing > 1 - 13

2013 20th International Conference on High Performance Computing (HiPC)

chapter

iFlatLFS: Performance optimization for accessing massive small files

Songling Fu, Chenlin Huang, Ligang He, Nadeem Chaudhary, more

20th Annual International Conference on High Performance Computing > 10 - 19

2013 20th International Conference on High Performance Computing (HiPC)

The processing of massive small files is a challenge in the design of distributed file systems. Currently, the combined-block-storage approach is prevalent. However, the approach employs traditional file systems like ExtFS and may cause inefficiency for random access to small files. This paper focuses on optimizing the performance of data servers in accessing massive small files. We present a Flat...

chapter

Adding data parallelism to streaming pipelines for throughput optimization

Peng Li, Kunal Agrawal, Jeremy Buhler, Roger D. Chamberlain

20th Annual International Conference on High Performance Computing > 20 - 29

2013 20th International Conference on High Performance Computing (HiPC)

The streaming model is a popular model for writing high-throughput parallel applications. A streaming application is represented by a graph of computation stages that communicate with each other via FIFO channels. In this paper, we consider the problem of mapping streaming pipelines — streaming applications where the graph is a linear chain — onto a set of computing resources in order to maximize...

chapter

Algorithms for the relaxed Multiple-Organization Multiple-Machine Scheduling Problem

Anirudh Chakravorty, Neelima Gupta, Neha Lawaria, Pankaj Kumar, more

20th Annual International Conference on High Performance Computing > 30 - 38

2013 20th International Conference on High Performance Computing (HiPC)

In this paper we present the generalization of the relaxed Multi- Organization Scheduling Problem (α MOSP). In our generalized problem, we are given a set of organizations; each organization is comprised of a set of machines. We are interested in minimizing the global makespan while allowing a constant factor, α_O, degradation in the local objective of each organization and a constant factor, α_M, degradation...

chapter

Loop level speculation in a task based programming model

Rahulkumar Gayatri, Rosa. M Badia, Eduard Aygaude

20th Annual International Conference on High Performance Computing > 39 - 48

2013 20th International Conference on High Performance Computing (HiPC)

Uncountable loops (such as while loops in C) and if-conditions are some of the most common constructs in programming. While-loops are widely used to determine the convergence in linear algebra algorithms or goal finding problems from graph algorithms, to name a few. In general while-loops are used whenever the loop iteration space, the number of iterations a loop executes is unknown. Usually in while-loops,...

chapter

LiPS: A cost-efficient data and task co-scheduler for MapReduce

Moussa Ehsan, Yao Chen, Hui Kang, Radu Sion, more

20th Annual International Conference on High Performance Computing > 49 - 58

2013 20th International Conference on High Performance Computing (HiPC)

We introduce LiPS, a new cost-efficient data and task co-scheduler for MapReduce in a cloud environment. By using linear programming to simultaneously co-schedule data and tasks, LiPS helps to achieve minimized dollar cost globally. We evaluated LiPS both analytically and on Amazon EC2 in order to measure actual dollar charges. The results were significant; LiPS saved 62–81% of the dollar costs when...

chapter

Share-o-meter: An empirical analysis of KSM based memory sharing in virtualized systems

Shashank Rachamalla, Debadatta Mishra, Purushottam Kulkarni

20th Annual International Conference on High Performance Computing > 59 - 68

2013 20th International Conference on High Performance Computing (HiPC)

Content based memory sharing in virtualized environments has proven to be a useful technique for over-commitment based placement of virtual machines. Kernel-based Virtual Machine (KVM) on Linux uses Kernel SamePage Merging (KSM) to identify and exploit sharing opportunities. In this paper, we present an analysis of page sharing across virtual machines by comparing page sharing achieved by KSM to total...

chapter

Minimization of cloud task execution length with workload prediction errors

Sheng Di, Cho-Li Wang

20th Annual International Conference on High Performance Computing > 69 - 78

2013 20th International Conference on High Performance Computing (HiPC)

In cloud systems, it is non-trivial to optimize task's execution performance under user's affordable budget, especially with possible workload prediction errors. Based on an optimal algorithm that can minimize cloud task's execution length with predicted workload and budget, we theoretically derive the upper bound of the task execution length by taking into account the possible workload prediction...

chapter

Speculative dynamic vectorization to assist static vectorization in a HW/SW co-designed environment

Rakesh Kumar, Alejandro Martinez, Antonio Gonzalez

20th Annual International Conference on High Performance Computing > 79 - 88

2013 20th International Conference on High Performance Computing (HiPC)

Compiler based static vectorization is used widely to extract data level parallelism from computation intensive applications. Static vectorization is very effective in vectorizing traditional array based applications. However, compilers inability to reorder ambiguous memory references severely limits vectorization opportunities, especially in pointer rich applications. HW/SW co-designed processors...

chapter

A self-tuning system based on application Profiling and Performance Analysis for optimizing Hadoop MapReduce cluster configuration

Dili Wu, Aniruddha Gokhale

20th Annual International Conference on High Performance Computing > 89 - 98

2013 20th International Conference on High Performance Computing (HiPC)

One of the most widely used frameworks for programming MapReduce-based applications is Apache Hadoop. Despite its popularity, however, application developers face numerous challenges in using the Hadoop framework, which stem from them having to effectively manage the resources of a MapReduce cluster, and configuring the framework in a way that will optimize the performance and reliability of MapReduce...

chapter

Web-scale entity annotation using MapReduce

Shashank Gupta, Varun Chandramouli, Soumen Chakrabarti

20th Annual International Conference on High Performance Computing > 99 - 108

2013 20th International Conference on High Performance Computing (HiPC)

Cloud computing frameworks such as map-reduce (MR) are widely used in the context of log mining, inverted indexing, and scientific data analysis. Here we address the new and important task of annotating token spans in billions of Web pages that mention named entities from a large entity catalog such as Wikipedia or Freebase. The key step in annotation is disambiguation: given the token Albert, use...

chapter

X10-based distributed and parallel betweenness centrality and its application to social analytics

Charuwat Houngkaew, Toyotaro Suzumura

20th Annual International Conference on High Performance Computing > 109 - 118

2013 20th International Conference on High Performance Computing (HiPC)

Betweenness centrality is a measure that determines the relative importance of a vertex (or an edge) within a graph based on shortest paths. Recently, large-scale graphs have emerged in many different domains, as social networks, road networks, protein interaction networks, etc., and they are too large to fit into the memory of a single SMP. The algorithm proposed by Edmonds et al. [1] is capable...

chapter

Scheduling associative reductions with homogeneous costs when overlapping communications and computations

Louis-Claude Canon

20th Annual International Conference on High Performance Computing > 119 - 128

2013 20th International Conference on High Performance Computing (HiPC)

Reduction is a core operation in parallel computing that combines distributed elements into a single result. Optimizing its cost may greatly reduce the application execution time, notably in MPI and MapReduce computations. In this paper, we propose an algorithm for scheduling associative reductions. We focus on the case where communications and computations can be overlapped to fully exploit resources...

chapter

A Branch-and-Bound algorithm using multiple GPU-based LP solvers

Xavier Meyer, Bastien Chopard, Paul Albuquerque

20th Annual International Conference on High Performance Computing > 129 - 138

2013 20th International Conference on High Performance Computing (HiPC)

The Branch-and-Bound (B&B) method is a well-known optimization algorithm for solving integer linear programming (ILP) models in the field of operations research. It is part of software often employed by businesses for finding solutions to problems such as airline scheduling problems. It operates according to a divide-and-conquer principle by building a tree-like structure with nodes that represent...

chapter

Accelerating Strassen-Winograd's matrix multiplication algorithm on GPUs

Pai-Wei Lai, Humayun Arafat, Venmugil Elango, P. Sadayappan

20th Annual International Conference on High Performance Computing > 139 - 148

2013 20th International Conference on High Performance Computing (HiPC)

In this paper, we report on the development of an efficient GPU implementation of the Strassen-Winograd matrix multiplication algorithm for matrices of arbitrary sizes. We utilize multi-kernel streaming to exploit concurrency across sub-matrix operations in addition to intra-operation parallelism. We evaluate the performance of the implementation in comparison with CUBLAS-5.0 on Fermi and Kepler GPUs...

chapter

Accelerating inclusion-based pointer analysis on heterogeneous CPU-GPU systems

Yu Su, Ding Ye, Jingling Xue

20th Annual International Conference on High Performance Computing > 149 - 158

2013 20th International Conference on High Performance Computing (HiPC)

This paper describes the first implementation of Andersen's inclusion-based pointer analysis for C programs on a heterogeneous CPU-GPU system, where both its CPU and GPU cores are used. As an important graph algorithm, Andersen's analysis is difficult to parallelise because it makes extensive modifications to the structure of the underlying graph, in a way that is highly input-dependent and statically...

INFONA - science communication portal

20th Annual International Conference on High Performance Computing

Front cover

Message from program chair

Analyzing the performance impact of authorization constraints and optimizing the authorization methods for workflows

Program

Author index

iFlatLFS: Performance optimization for accessing massive small files

Adding data parallelism to streaming pipelines for throughput optimization

Algorithms for the relaxed Multiple-Organization Multiple-Machine Scheduling Problem

Loop level speculation in a task based programming model

LiPS: A cost-efficient data and task co-scheduler for MapReduce

Share-o-meter: An empirical analysis of KSM based memory sharing in virtualized systems

Minimization of cloud task execution length with workload prediction errors

Speculative dynamic vectorization to assist static vectorization in a HW/SW co-designed environment

A self-tuning system based on application Profiling and Performance Analysis for optimizing Hadoop MapReduce cluster configuration

Web-scale entity annotation using MapReduce

X10-based distributed and parallel betweenness centrality and its application to social analytics

Scheduling associative reductions with homogeneous costs when overlapping communications and computations

A Branch-and-Bound algorithm using multiple GPU-based LP solvers

Accelerating Strassen-Winograd's matrix multiplication algorithm on GPUs

Accelerating inclusion-based pointer analysis on heterogeneous CPU-GPU systems

Filter options

Publication date

Keywords

INFONA - science communication portal

20th Annual International Conference on High Performance Computing $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

20th Annual International Conference on High Performance Computing