SC15: International Conference for High Performance Computing, Networking, Storage and Analysis

chapter

Exploring network optimizations for large-scale graph analytics

Xinyu Que, Fabio Checconi, Fabrizio Petrini, Xing Liu, more

SC15: International Conference for High Performance Computing, Networking, Storage and Analysis > 1 - 10

Graph analytics are arguably one of the most demanding workloads for high-performance systems and interconnection networks. Graph applications often display all-to-all, fine-grained, high-rate communication patterns that expose the limits of the network protocol stacks. Load and communication imbalance generate hard-to-predict network hot-spots, and may require computational steering due to unpredictable...

chapter

Parallel distributed memory construction of suffix and longest common prefix arrays

Patrick Flick, Srinivas Aluru

SC15: International Conference for High Performance Computing, Networking, Storage and Analysis > 1 - 10

SC15: International Conference for High Performance Computing, Networking, Storage and Analysis

Suffix arrays and trees are fundamental string data structures of importance to many applications in computational biology. Consequently, their parallel construction is an actively studied problem. To date, algorithms with best practical performance lack efficient worst-case run-time guarantees, and vice versa. In addition, much of the recent work targeted low core count, shared memory parallelization...

chapter

Improving backfilling by using machine learning to predict running times

Eric Gaussier, David Glesser, Valentin Reis, Denis Trystram

SC15: International Conference for High Performance Computing, Networking, Storage and Analysis > 1 - 10

SC15: International Conference for High Performance Computing, Networking, Storage and Analysis

The job management system is the HPC middleware responsible for distributing computing power to applications. While such systems generate an ever increasing amount of data, they are characterized by uncertainties on some parameters like the job running times. The question raised in this work is: To what extent is it possible/useful to take into account predictions on the job running times for improving the global scheduling?...

chapter

Large-scale compute-intensive analysis via a combined in-situ and co-scheduling workflow approach

Christopher Sewell, Katrin Heitmann, Hal Finkel, George Zagaris, more

SC15: International Conference for High Performance Computing, Networking, Storage and Analysis > 1 - 11

SC15: International Conference for High Performance Computing, Networking, Storage and Analysis

Large-scale simulations can produce hundreds of terabytes to petabytes of data, complicating and limiting the efficiency of workflows. Traditionally, outputs are stored on the file system and analyzed in post-processing. With the rapidly increasing size and complexity of simulations, this approach faces an uncertain future. Trending techniques consist of performing the analysis in-situ, utilizing...

chapter

Massively parallel models of the human circulatory system

Amanda Randles, Erik W. Draeger, Tomas Oppelstrup, Liam Krauss, more

SC15: International Conference for High Performance Computing, Networking, Storage and Analysis > 1 - 11

SC15: International Conference for High Performance Computing, Networking, Storage and Analysis

The potential impact of blood flow simulations on the diagnosis and treatment of patients suffering from vascular disease is tremendous. Empowering models of the full arterial tree can provide insight into diseases such as arterial hypertension and enables the study of the influence of local factors on global hemodynamics. We present a new, highly scalable implementation of the lattice Boltzmann method...

chapter

Engineering inhibitory proteins with InSiPS: the in-silico protein synthesizer

Andrew Schoenrock, Daniel Burnside, Houman Moteshareie, Alex Wong, more

SC15: International Conference for High Performance Computing, Networking, Storage and Analysis > 1 - 11

SC15: International Conference for High Performance Computing, Networking, Storage and Analysis

Engineered proteins are synthetic novel proteins (not found in nature) that are designed to fulfill a predetermined biological function. Such proteins can be used as molecular markers, inhibitory agents, or drugs. For example, a synthetic protein could bind to a critical protein of a pathogen, thereby inhibiting the function of the target protein and potentially reducing the impact of the pathogen...

chapter

C²-bound: a capacity and concurrency driven analytical model for many-core design

Yu-Hang Liu, Xian-He Sun

SC15: International Conference for High Performance Computing, Networking, Storage and Analysis > 1 - 11

SC15: International Conference for High Performance Computing, Networking, Storage and Analysis

In this paper, we propose C²-Bound, a data-driven analytical model, that incorporates both memory capacity and data access concurrency factors to optimize many-core design. C²-Bound is characterized by combining the newly proposed latency model, concurrent average memory access time (C-AMAT), with the well-known memory-bounded speedup model (Sun-Ni's law) to facilitate computing tasks. Compared to...

chapter

HydraDB: a resilient RDMA-driven key-value middleware for in-memory cluster computing

Yandong Wang, Li Zhang, Jian Tan, Min Li, more

SC15: International Conference for High Performance Computing, Networking, Storage and Analysis > 1 - 11

SC15: International Conference for High Performance Computing, Networking, Storage and Analysis

In this paper, we describe our experiences and lessons learned from building a general-purpose in-memory key-value middleware, called HydraDB. HydraDB synthesizes a collection of state-of-the-art techniques, including continuous fault-tolerance, Remote Direct Memory Access (RDMA), as well as awareness for multicore systems, etc, to deliver a high-throughput, low-latency access service in a reliable...

chapter

Cost-effective diameter-two topologies: analysis and evaluation

Georgios Kathareios, Cyriel Minkenberg, Bogdan Prisacari, German Rodriguez, more

SC15: International Conference for High Performance Computing, Networking, Storage and Analysis > 1 - 11

SC15: International Conference for High Performance Computing, Networking, Storage and Analysis

HPC network topology design is currently shifting from high-performance, higher-cost Fat-Trees to more cost-effective architectures. Three diameter-two designs, the Slim Fly, Multi-Layer Full-Mesh, and Two-Level Orthogonal Fat-Tree excel in this, exhibiting a cost per endpoint of only 2 links and 3 router ports with lower end-to-end latency and higher scalability than traditional networks of the same...

chapter

Profile-based power shifting in interconnection networks with on/off links

Shinobu Miwa, Hiroshi Nakamura

SC15: International Conference for High Performance Computing, Networking, Storage and Analysis > 1 - 11

SC15: International Conference for High Performance Computing, Networking, Storage and Analysis

Overprovisioning hardware devices and coordinating their power budgets are proposed to improve the application performance of future power-constrained HPC systems. This coordination process is called power shifting. Meanwhile, recent studies have revealed that on/off links can save network power in HPC systems. Future HPC systems will thus adopt on/off links in addition to power shifting. This paper...

chapter

Scalable sparse tensor decompositions in distributed memory systems

Oguz Kaya, Bora Uçar

SC15: International Conference for High Performance Computing, Networking, Storage and Analysis > 1 - 11

SC15: International Conference for High Performance Computing, Networking, Storage and Analysis

We investigate an efficient parallelization of the most common iterative sparse tensor decomposition algorithms on distributed memory systems. A key operation in each iteration of these algorithms is the matricized tensor times Khatri-Rao product (MTTKRP). This operation amounts to element-wise vector multiplication and reduction depending on the sparsity of the tensor. We investigate a fine and a...

chapter

A parallel connectivity algorithm for de Bruijn graphs in metagenomic applications

Patrick Flick, Chirag Jain, Tony Pan, Srinivas Aluru

SC15: International Conference for High Performance Computing, Networking, Storage and Analysis > 1 - 11

SC15: International Conference for High Performance Computing, Networking, Storage and Analysis

Dramatic advances in DNA sequencing technology have made it possible to study microbial environments by direct sequencing of environmental DNA samples. Yet, due to the huge volume and high data complexity, current de novo assemblers cannot handle large metagenomic datasets or fail to perform assembly with acceptable quality. This paper presents the first parallel solution for decomposing the metagenomic...

chapter

Optimal scheduling of in-situ analysis for large-scale scientific simulations

Preeti Malakar, Venkatram Vishwanath, Todd Munson, Christopher Knight, more

SC15: International Conference for High Performance Computing, Networking, Storage and Analysis > 1 - 11

SC15: International Conference for High Performance Computing, Networking, Storage and Analysis

Today's leadership computing facilities have enabled the execution of transformative simulations at unprecedented scales. However, analyzing the huge amount of output from these simulations remains a challenge. Most analyses of this output is performed in post-processing mode at the end of the simulation. The time to read the output for the analysis can be significantly high due to poor I/O bandwidth,...

chapter

Performance of random sampling for computing low-rank approximations of a dense matrix on GPUs

Théo Mary, Ichitaro Yamazaki, Jakub Kurzak, Piotr Luszczek, more

SC15: International Conference for High Performance Computing, Networking, Storage and Analysis > 1 - 11

SC15: International Conference for High Performance Computing, Networking, Storage and Analysis

A low-rank approximation of a dense matrix plays an important role in many applications. To compute such an approximation, a common approach uses the QR factorization with column pivoting (QRCP). Though the reliability and efficiency of QRCP have been demonstrated, this deterministic approach requires costly communication at each step of the factorization. Since such communication is becoming increasingly...

chapter

STS-k: a multilevel sparse triangular solution scheme for NUMA multicores

Humayun Kabir, Joshua Dennis Booth, Guillaume Aupy, Anne Benoit, more

SC15: International Conference for High Performance Computing, Networking, Storage and Analysis > 1 - 11

SC15: International Conference for High Performance Computing, Networking, Storage and Analysis

We consider techniques to improve the performance of parallel sparse triangular solution on non-uniform memory architecture multicores by extending earlier coloring and level set schemes for single-core multiprocessors. We develop STS-k, where k represents a small number of transformations for latency reduction from increased spatial and temporal locality of data accesses. We propose a graph model...

chapter

Particle tracking in open simulation laboratories

Kalin Kanov, Randal Burns

SC15: International Conference for High Performance Computing, Networking, Storage and Analysis > 1 - 11

SC15: International Conference for High Performance Computing, Networking, Storage and Analysis

Particle tracking along streamlines and pathlines is a common scientific analysis technique, which has demanding data, computation and communication requirements. It has been studied in the context of high-performance computing due to the difficulty in its efficient parallelization and its high demands on communication and computational load. In this paper, we study efficient evaluation methods for...

chapter

Node variability in large-scale power measurements: perspectives from the Green500, Top500 and EEHPCWG

Thomas Scogland, Jonathan Azose, David Rohr, Suzanne Rivoire, more

SC15: International Conference for High Performance Computing, Networking, Storage and Analysis > 1 - 11

SC15: International Conference for High Performance Computing, Networking, Storage and Analysis

The last decade has seen power consumption move from an afterthought to the foremost design constraint of new supercomputers. Measuring the power of a supercomputer can be a daunting proposition, and as a result, many published measurements are extrapolated. This paper explores the validity of these extrapolations in the context of inter-node power variability and power variations over time within...

chapter

Dynamic power sharing for higher job throughput

Daniel A. Ellsworth, Allen D. Malony, Barry Rountree, Martin Schulz

SC15: International Conference for High Performance Computing, Networking, Storage and Analysis > 1 - 11

SC15: International Conference for High Performance Computing, Networking, Storage and Analysis

Current trends for high-performance systems are leading towards hardware overprovisioning where it is no longer possible to run all components at peak power without exceeding a system- or facility-wide power bound. The standard practice of static power scheduling is likely to lead to inefficiencies with over- and under-provisioning of power to components at runtime. In this paper we investigate the...

chapter

HipMer: an extreme-scale de novo genome assembler

Evangelos Georganas, Aydın Buluç, Jarrod Chapman, Steven Hofmeyr, more

SC15: International Conference for High Performance Computing, Networking, Storage and Analysis > 1 - 11

SC15: International Conference for High Performance Computing, Networking, Storage and Analysis

De novo whole genome assembly reconstructs genomic sequences from short, overlapping, and potentially erroneous DNA segments and is one of the most important computations in modern genomics. This work presents HipMer, the first high-quality end-to-end de novo assembler designed for extreme scale analysis, via efficient parallelization of the Meraculous code. First, we significantly improve scalability...

chapter

Energy-aware data transfer algorithms

Ismail Alan, Engin Arslan, Tevfik Kosar

SC15: International Conference for High Performance Computing, Networking, Storage and Analysis > 1 - 12

SC15: International Conference for High Performance Computing, Networking, Storage and Analysis

The amount of data moved over the Internet per year has already exceeded the Exabyte scale and soon will hit the Zettabyte range. To support this massive amount of data movement across the globe, the networking infrastructure as well as the source and destination nodes consume immense amount of electric power, with an estimated cost measured in billions of dollars. Although considerable amount of...

INFONA - science communication portal

SC15: International Conference for High Performance Computing, Networking, Storage and Analysis

Exploring network optimizations for large-scale graph analytics

Parallel distributed memory construction of suffix and longest common prefix arrays

Improving backfilling by using machine learning to predict running times

Large-scale compute-intensive analysis via a combined in-situ and co-scheduling workflow approach

Massively parallel models of the human circulatory system

Engineering inhibitory proteins with InSiPS: the in-silico protein synthesizer

C²-bound: a capacity and concurrency driven analytical model for many-core design

HydraDB: a resilient RDMA-driven key-value middleware for in-memory cluster computing

Cost-effective diameter-two topologies: analysis and evaluation

Profile-based power shifting in interconnection networks with on/off links

Scalable sparse tensor decompositions in distributed memory systems

A parallel connectivity algorithm for de Bruijn graphs in metagenomic applications

Optimal scheduling of in-situ analysis for large-scale scientific simulations

Performance of random sampling for computing low-rank approximations of a dense matrix on GPUs

STS-k: a multilevel sparse triangular solution scheme for NUMA multicores

Particle tracking in open simulation laboratories

Node variability in large-scale power measurements: perspectives from the Green500, Top500 and EEHPCWG

Dynamic power sharing for higher job throughput

HipMer: an extreme-scale de novo genome assembler

Energy-aware data transfer algorithms

Filter options

Publication date

Keywords

INFONA - science communication portal

SC15: International Conference for High Performance Computing, Networking, Storage and Analysis $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

SC15: International Conference for High Performance Computing, Networking, Storage and Analysis