SC14: International Conference for High Performance Computing, Networking, Storage and Analysis

Items from 1 to 20 out of 94 results

book

SC14: International Conference for High Performance Computing, Networking, Storage and Analysis

IEEE

SC14: International Conference for High Performance Computing, Networking, Storage and Analysis

chapter

Title Page iii

SC14: International Conference for High Performance Computing, Networking, Storage and Analysis > iii

SC14: International Conference for High Performance Computing, Networking, Storage and Analysis

chapter

Petascale High Order Dynamic Rupture Earthquake Simulations on Heterogeneous Supercomputers

Alexander Heinecke, Alexander Breuer, Sebastian Rettenberger, Michael Bader, more

SC14: International Conference for High Performance Computing, Networking, Storage and Analysis > 3 - 14

SC14: International Conference for High Performance Computing, Networking, Storage and Analysis

We present an end-to-end optimization of the innovative Arbitrary high-order DERivative Discontinuous Galerkin (ADER-DG) software SeisSol targeting Intel® Xeon Phi coprocessor platforms, achieving unprecedented earthquake model complexity through coupled simulation of full frictional sliding and seismic wave propagation. SeisSol exploits unstructured meshes to flexibly adapt for complicated geometries...

chapter

24.77 Pflops on a Gravitational Tree-Code to Simulate the Milky Way Galaxy with 18600 GPUs

Jeroen Bedorf, Evghenii Gaburov, Michiko S. Fujii, Keigo Nitadori, more

SC14: International Conference for High Performance Computing, Networking, Storage and Analysis > 54 - 65

SC14: International Conference for High Performance Computing, Networking, Storage and Analysis

We have simulated, for the first time, the long term evolution of the Milky Way Galaxy using 51 billion particles on the Swiss Piz Daint supercomputer with our N-body gravitational tree-code Bonsai. Herein, we describe the scientific motivation and numerical algorithms. The Milky Way model was simulated for 6 billion years, during which the bar structure and spiral arms were fully formed. This improves...

chapter

Parallelization of Reordering Algorithms for Bandwidth and Wavefront Reduction

Konstantinos I. Karantasis, Andrew Lenharth, Donald Nguyen, Mara J. Garzaran, more

SC14: International Conference for High Performance Computing, Networking, Storage and Analysis > 921 - 932

SC14: International Conference for High Performance Computing, Networking, Storage and Analysis

Many sparse matrix computations can be speeded up if the matrix is first reordered. Reordering was originally developed for direct methods but it has recently become popular for improving the cache locality of parallel iterative solvers since reordering the matrix to reduce bandwidth and wave front can improve the locality of reference of sparse matrix-vector multiplication (SpMV), the key kernel...

chapter

Scalable Computation of Stream Surfaces on Large Scale Vector Fields

Kewei Lu, Han-Wei Shen, Tom Peterka

SC14: International Conference for High Performance Computing, Networking, Storage and Analysis > 1008 - 1019

SC14: International Conference for High Performance Computing, Networking, Storage and Analysis

Stream surfaces and streamlines are two popular methods for visualizing three-dimensional flow fields. While several parallel streamline computation algorithms exist, relatively little research has been done to parallelize stream surface generation. This is because load-balanced parallel stream surface computation is nontrivial, due to the strong dependency in computing the positions of the particles...

chapter

Reciprocal Resource Fairness: Towards Cooperative Multiple-Resource Fair Sharing in IaaS Clouds

Haikun Liu, Bingsheng He

SC14: International Conference for High Performance Computing, Networking, Storage and Analysis > 970 - 981

SC14: International Conference for High Performance Computing, Networking, Storage and Analysis

Resource sharing in virtualized environments have been demonstrated significant benefits to improve application performance and resource/energy efficiency. However, resource sharing, especially for multiple resource types, poses several severe and challenging problems in pay-as-you-use cloud environments, such as sharing incentive, free-riding, lying and economic fairness. To address those problems,...

chapter

In-Situ Feature Extraction of Large Scale Combustion Simulations Using Segmented Merge Trees

Aaditya G. Landge, Valerio Pascucci, Attila Gyulassy, Janine C. Bennett, more

SC14: International Conference for High Performance Computing, Networking, Storage and Analysis > 1020 - 1031

SC14: International Conference for High Performance Computing, Networking, Storage and Analysis

The ever increasing amount of data generated by scientific simulations coupled with system I/O constraints are fueling a need for in-situ analysis techniques. Of particular interest are approaches that produce reduced data representations while maintaining the ability to redefine, extract, and study features in a post-process to obtain scientific insights. This paper presents two variants of in-situ...

chapter

Faster Parallel Traversal of Scale Free Graphs at Extreme Scale with Vertex Delegates

Roger Pearce, Maya Gokhale, Nancy M. Amato

SC14: International Conference for High Performance Computing, Networking, Storage and Analysis > 549 - 559

SC14: International Conference for High Performance Computing, Networking, Storage and Analysis

At extreme scale, irregularities in the structure of scale-free graphs such as social network graphs limit our ability to analyze these important and growing datasets. A key challenge is the presence of high-degree vertices (hubs), that leads to parallel workload and storage imbalances. The imbalances occur because existing partitioning techniques are not able to effectively partition high-degree...

chapter

Understanding Soft Error Resiliency of Blue Gene/Q Compute Chip through Hardware Proton Irradiation and Software Fault Injection

Chen-Yong Cher, Meeta S. Gupta, Pradip Bose, K. Paul Muller

SC14: International Conference for High Performance Computing, Networking, Storage and Analysis > 587 - 596

SC14: International Conference for High Performance Computing, Networking, Storage and Analysis

Soft Error Resiliency is a major concern for Petascale high performance computing (HPC) systems. Blue Gene/Q (BG/Q) is the third generation of IBM's massively parallel, energy efficient Blue Gene series of supercomputers. The principal goal of this work is to understand the interaction between Blue-Gene/Q's hardware resiliency features and high-performance applications through proton irradiation of...

chapter

Scalable and High Performance Betweenness Centrality on the GPU

Adam McLaughlin, David A. Bader

SC14: International Conference for High Performance Computing, Networking, Storage and Analysis > 572 - 583

SC14: International Conference for High Performance Computing, Networking, Storage and Analysis

Graphs that model social networks, numerical simulations, and the structure of the Internet are enormous and cannot be manually inspected. A popular metric used to analyze these networks is between ness centrality, which has applications in community detection, power grid contingency analysis, and the study of the human brain. However, these analyses come with a high computational cost that prevents...

chapter

Parallel Programming with Migratable Objects: Charm++ in Practice

Bilge Acun, Abhishek Gupta, Nikhil Jain, Akhil Langer, more

SC14: International Conference for High Performance Computing, Networking, Storage and Analysis > 647 - 658

SC14: International Conference for High Performance Computing, Networking, Storage and Analysis

The advent of petascale computing has introduced new challenges (e.g. Heterogeneity, system failure) for programming scalable parallel applications. Increased complexity and dynamism in science and engineering applications of today have further exacerbated the situation. Addressing these challenges requires more emphasis on concepts that were previously of secondary importance, including migratability,...

chapter

IndexFS: Scaling File System Metadata Performance with Stateless Caching and Bulk Insertion

Kai Ren, Qing Zheng, Swapnil Patil, Garth Gibson

SC14: International Conference for High Performance Computing, Networking, Storage and Analysis > 237 - 248

SC14: International Conference for High Performance Computing, Networking, Storage and Analysis

The growing size of modern storage systems is expected to exceed billions of objects, making metadata scalability critical to overall performance. Many existing distributed file systems only focus on providing highly parallel fast access to file data, and lack a scalable metadata service. In this paper, we introduce a middleware design called Index FS that adds support to existing file systems such...

chapter

Dissecting On-Node Memory Access Performance: A Semantic Approach

Alfredo Gimenez, Todd Gamblin, Barry Rountree, Abhinav Bhatele, more

SC14: International Conference for High Performance Computing, Networking, Storage and Analysis > 166 - 176

SC14: International Conference for High Performance Computing, Networking, Storage and Analysis

Optimizing memory access is critical for performance and power efficiency. CPU manufacturers have developed sampling-based performance measurement units (PMUs) that report precise costs of memory accesses at specific addresses. However, this data is too low-level to be meaningfully interpreted and contains an excessive amount of irrelevant or uninteresting information. We have developed a method to...

chapter

Pipelining Computational Stages of the Tomographic Reconstructor for Multi-Object Adaptive Optics on a Multi-GPU System

Ali Charara, Hatem Ltaief, Damien Gratadour, David Keyes, more

SC14: International Conference for High Performance Computing, Networking, Storage and Analysis > 262 - 273

SC14: International Conference for High Performance Computing, Networking, Storage and Analysis

The European Extremely Large Telescope project (E-ELT) is one of Europe's highest priorities in ground-based astronomy. ELTs are built on top of a variety of highly sensitive and critical astronomical instruments. In particular, a new instrument called MOSAIC has been proposed to perform multi-object spectroscopy using the Multi-Object Adaptive Optics (MOAO) technique. The core implementation of the...

chapter

Best Practices and Lessons Learned from Deploying and Operating Large-Scale Data-Centric Parallel File Systems

Sarp Oral, James Simmons, Jason Hill, Dustin Leverman, more

SC14: International Conference for High Performance Computing, Networking, Storage and Analysis > 217 - 228

SC14: International Conference for High Performance Computing, Networking, Storage and Analysis

The Oak Ridge Leadership Computing Facility (OLCF) has deployed multiple large-scale parallel file systems (PFS) to support its operations. During this process, OLCF acquired significant expertise in large-scale storage system design, file system software development, technology evaluation, benchmarking, procurement, deployment, and operational practices. Based on the lessons learned from each new...

chapter

DISC: A Domain-Interaction Based Programming Model with Support for Heterogeneous Execution

Mehmet Can Kurt, Gagan Agrawal

SC14: International Conference for High Performance Computing, Networking, Storage and Analysis > 869 - 880

SC14: International Conference for High Performance Computing, Networking, Storage and Analysis

Several emerging trends are pointing to increasing heterogeneity among nodes and/or cores in HPC systems. Existing programming models, especially for distributed memory execution, typically have been designed to facilitate high performance on homogeneous systems. This paper describes a programming model and an associated runtime system we have developed to address the above need. The main concepts...

chapter

A System Software Approach to Proactive Memory-Error Avoidance

Carlos H.A. Costa, Yoonho Park, Bryan S. Rosenburg, Chen-Yong Cher, more

SC14: International Conference for High Performance Computing, Networking, Storage and Analysis > 707 - 718

SC14: International Conference for High Performance Computing, Networking, Storage and Analysis

Today's HPC systems use two mechanisms to address main-memory errors. Error-correcting codes make correctable errors transparent to software, while checkpoint/restart (CR) enables recovery from uncorrectable errors. Unfortunately, CR overhead will be enormous at exascale due to the high failure rate of memory. We propose a new OS-based approach that proactively avoids memory errors using prediction...

chapter

Fault-Tolerant Dynamic Task Graph Scheduling

Mehmet Can Kurt, Sriram Krishnamoorthy, Kunal Agrawal, Gagan Agrawal

SC14: International Conference for High Performance Computing, Networking, Storage and Analysis > 719 - 730

SC14: International Conference for High Performance Computing, Networking, Storage and Analysis

In this paper, we present an approach to fault tolerant execution of dynamic task graphs scheduled using work stealing. In particular, we focus on selective and localized recovery of tasks in the presence of soft faults. From users, we elicit the basic task graph structure in terms of successor and predecessor relationships. The work-stealing-based algorithm to schedule such a task graph is augmented...

chapter

Optimized Scheduling Strategies for Hybrid Density Functional theory Electronic Structure Calculations

William Dawson, Francois Gygi

SC14: International Conference for High Performance Computing, Networking, Storage and Analysis > 685 - 692

SC14: International Conference for High Performance Computing, Networking, Storage and Analysis

Hybrid Density Functional Theory (DFT) has recently gained popularity as an accurate model of electronic interactions in chemistry and materials science applications. The most computationally expensive part of hybrid DFT simulations is the calculation of exchange integrals between pairs of electrons. We present strategies to achieve improved load balancing and scalability for the parallel computation...

Publication date

Set your own date range

Content availability

Available (93)
None (1)

Keywords

GPU (3)
HIGH PERFORMANCE COMPUTING (3)
MPI (3)
PARALLEL ALGORITHMS (3)
CILK (2)
CLOUD COMPUTING (2)
CUDA (2)
DIVIDE-AND-CONQUER (2)
FAULT TOLERANCE (2)
PREDICTION (2)
ROUTING (2)
SCALABILITY (2)
STORAGE (2)
ACSR (1)
ADER-DG (1)
ALGORITHMS (1)
AMD (1)
AND FAULT-TOLERANCE (1)
AND VERY LA (1)
APPROXIMATE CLUSTERING ALGORITHM (1)
AVAILABILITY (1)
BAYESIAN NETWORKS (1)
BIG DATA (1)
BUG DETECTION (1)
BULK INSERTION (1)
CHEMISTRY (1)
CHIP IRRADIATION (1)
CLOUD STORAGE (1)
CLUSTER COMPUTING (1)
CO-DESIGN (1)
COMMUNICATION-AVOIDING ALGORITHMS (1)
COMPRESSED SPARSE ROW (CSR) (1)
COMPUTATIONAL ASTRONOMY (1)
COMPUTATIONAL GEOMETRY (1)
COMPUTING MODEL (1)
CSR (1)
DAG (1)
DATA ANALYTICS (1)
DATA FLOW ANALSIS (1)
DATA LOCALITY (1)
DELAUNAY TESSELLATION (1)
DENSE LINEAR ALGEBRA (1)
DENSITY BASED CLUSTERING (1)
DENSITY FUNCTIONAL THEORY (1)
DISJOINT-SET DATA STRUCTURE (1)
DISTRIBUTED ARCHITECTURES (1)
DISTRIBUTED FILE SYSTEMS (1)
DOMAIN DECOMPOSITION (1)
DRAGONFLY NETWORKS (1)
DYNAMIC LOAD BALANCING (1)
DYNAMIC RUPTURE (1)
DYNAMIC SCHEDULER (1)
EARTHQUAKE SIMULATION (1)
ENDPOINTS (1)
EXASCALE (1)
FAIL-IN-PLACE (1)
FAIRNESS (1)
FAULT INJECTION (1)
FEATURE EXTRACTION (1)
FENCE INSTRUCTIONS (1)
FIELD TESTING (1)
FILE SYSTEM METADATA (1)
FLOW VISUALIZATION (1)
FORK/JOIN (1)
FORMAL VERIFICATION (1)
G.1.3 [NUMERICAL ANALYSIS]: NUMERICAL LINEAR ALGEBRA SPARSE (1)
GENE NETWORKS (1)
GENERAL PURPOSE COMPUTATION ON GRAPHICS PROCESSING UNITS (GPGPU) (1)
GEODYNAMICS (1)
GPU COMPUTING (1)
GPUS (1)
GRAMMAR (1)
GRAPH (1)
GRAPH ALGORITHMS (1)
HETEROGENEOUS (1)
HETEROGENEOUS SUPERCOMPUTERS (1)
HETEROGENEOUS SUPPORT (1)
HIERARCHICAL SCHEDULING (1)
HIGH-PERFORMANCE APPLICATIONS (1)
HPC (1)
HPC CLUSTER (1)
HYB (1)
HYBRID PARALLEL PROGRAMMING (1)
HYBRID PARALLELIZATION (1)
I/O (1)
IAAS (1)
IN SITU ANALYSIS (1)
INNOVATIVE HARDWARE/SOFTWARE CO-DESIGN (1)
INTEL® XEON PHI COPROCESSOR (1)
INTERCONNECT TESTING (1)
INTERMITTENT ERROR (1)
ITERATIVE COMPUTATION (1)
JOB PLACEMENT (1)
LATENCY PROPAGATION (1)
LATTICE QCD CATEGORIES AND SUBJECT DESCRIPTORS: D.3.4 [PROGRAMMING LANGUAGES]: PROCESSORS OPTIMIZATION (1)
LINEAR AND NONLINEAR SYSTEMS (1)
LINEAR PROGRAMMING (1)
LINPACK (1)
LOAD BALANCING (1)
LOG-STRUCTURED MERGE TREE (1)
more

INFONA - science communication portal

SC14: International Conference for High Performance Computing, Networking, Storage and Analysis