2017 IEEE International Conference on Cluster Computing (CLUSTER)

chapter

Monitoring Infrastructure: The Challenges of Moving Beyond Petascale

Amanda Bonnie, Mike Mason, Daniel Illescas

2017 IEEE International Conference on Cluster Computing (CLUSTER) > 785 - 788

Scaling clusters is no longer the only struggle in moving towards exascale in HPC. While scaling components such as the network and file systems is a widely accepted need, monitoring, on the other hand, is often left behind in the procurement of these large systems. Monitoring is often quite an afterthought that is expected to be incorporated in existing infrastructure. While that often works for...

chapter

PFAnalyzer: A Toolset for Analyzing Application-Aware Dynamic Interconnects

Keichi Takahashi, Susumu Date, Dashdavaa Khureltulga, Yoshiyuki Kido, more

2017 IEEE International Conference on Cluster Computing (CLUSTER) > 789 - 796

2017 IEEE International Conference on Cluster Computing (CLUSTER)

Recent rapid scale out of high performance computing systems has rapidly and continuously increased the scale and complexity of the interconnects. As a result, current static and over-provisioned interconnects are becoming cost-ineffective. Against this background, we have been working on the integration of network programmability into the interconnect control, based on the idea that dynamically controlling...

chapter

Holistic Measurement-Driven System Assessment

Saurabh Jha, Jim Brandt, Ann Gentile, Zbigniew Kalbarczyk, more

2017 IEEE International Conference on Cluster Computing (CLUSTER) > 797 - 800

2017 IEEE International Conference on Cluster Computing (CLUSTER)

In high-performance computing systems, application performance and throughput are dependent on a complex interplay of hardware and software subsystems and variable workloads with competing resource demands. Data-driven insights into the potentially widespread scope and propagationof impact of events, such as faults and contention for shared resources, can be used to drive more effective use of resources,...

chapter

lo2s — Multi-core System and Application Performance Analysis for Linux

Thomas Ilsche, Robert Schone, Mario Bielert, Andreas Gocht, more

2017 IEEE International Conference on Cluster Computing (CLUSTER) > 801 - 804

2017 IEEE International Conference on Cluster Computing (CLUSTER)

In this paper we present lo2s - a lightweight performance monitoring tool to sample applications as well as the executing system. It enables the user to analyze the performance of a parallel application without requiring the time-consuming and error-prone process of application instrumentation. The collected performance data is complemented with various metric data, i.e., perf counters, kernel tracepoints,...

chapter

Measuring Minimum Switch Port Metric Retrieval Time and Impact for Multi-layer InfiniBand Fabrics

Michael Aguilar, Benjamin A. Allan, Sergei Polevitzky

2017 IEEE International Conference on Cluster Computing (CLUSTER) > 805 - 808

2017 IEEE International Conference on Cluster Computing (CLUSTER)

In this work, we seek to gain an understanding of the InfiniBand network processing limitations that might exist in gathering performance metric information from InfiniBand switches using our new LDMS ibfabric sampler. The limitations studied consist of delays in gathering InfiniBand metric information from a specific switch device due to the switch's processor response delays or RDMA contention for...

chapter

Understanding Performance Variability on the Aries Dragonfly Network

Taylor Groves, Yizi Gu, Nicholas J. Wright

2017 IEEE International Conference on Cluster Computing (CLUSTER) > 809 - 813

2017 IEEE International Conference on Cluster Computing (CLUSTER)

This work evaluates performance variability in the Cray Aries dragonfly network and characterizes its impact on MPI Allreduce. The execution time of Allreduce is limited by the performance of the slowest participating process, which can vary by more than an order of magnitude. We utilize counters from the network routers to provide a better understanding of how competing workloads can influence performance...

chapter

YAViT (Yet Another Viz Tool): Raising the Level of Abstraction in End-User HPC Interactions

Omar Aaziz, Ujjwal Panthi, Jonathan Cook

2017 IEEE International Conference on Cluster Computing (CLUSTER) > 814 - 817

2017 IEEE International Conference on Cluster Computing (CLUSTER)

Because data collection in HPC systems happens on the nodes and is easily related to the job running on the node, tools presenting the data and subsequent analyses to the user generally present them at the job level. Our position is that this is the wrong level of abstraction and thus limits the value of the analyses, often dissuading users from using any of the offered tools. In this paper we present...

chapter

Assessing Representativeness of Kernels Using Descriptive Statistics

Youngsung Kim, John M. Dennis, Christopher Kerr

2017 IEEE International Conference on Cluster Computing (CLUSTER) > 818 - 825

2017 IEEE International Conference on Cluster Computing (CLUSTER)

A kernel or mini-app is a self-contained small application that retains certain characteristics of the original application [7]. Working on a kernel or mini-app in the place of the original application can dramatically reduce the resources and effort required for performing software tasks such as performance optimization and porting to new platforms. However, using kernel as a proxy is based on the...

chapter

A Performance Projection of Mini-Applications onto Benchmarks Toward the Performance Projection of Real-Applications

Miwako Tsuji, William T. C. Kramer, Mitsuhisa Sato

2017 IEEE International Conference on Cluster Computing (CLUSTER) > 826 - 833

2017 IEEE International Conference on Cluster Computing (CLUSTER)

Widely used benchmarks, such as High Performance Linpack (HPL), do not always provide direct insights are notoriously poor indicators of into the actual application performance of systems. When real applications are used, and there have been are criticisms indicating that the performance of simplified benchmarks such as HPL no longer strongly correlate to real application performance. In contrast,...

chapter

Achieving Performance Portability for a Heat Conduction Solver Mini-Application on Modern Multi-core Systems

Richard O. Kirk, Gihan R. Mudalige, Istvan Z. Reguly, Steven A. Wright, more

2017 IEEE International Conference on Cluster Computing (CLUSTER) > 834 - 841

2017 IEEE International Conference on Cluster Computing (CLUSTER)

Modernizing production-grade, often legacy applications to take advantage of modern multi-core and many-core architectures can be a difficult and costly undertaking. This is especially true currently, as it is unclear which architectures will dominate future systems. The complexity of these codes can mean that parallelisation for a given architecture requires significant re-engineering. One way to...

chapter

TeaLeaf: A Mini-Application to Enable Design-Space Explorations for Iterative Sparse Linear Solvers

Simon McIntosh-Smith, Matthew Martineau, Tom Deakin, Grzegorz Pawelczak, more

2017 IEEE International Conference on Cluster Computing (CLUSTER) > 842 - 849

2017 IEEE International Conference on Cluster Computing (CLUSTER)

Iterative sparse linear solvers are an important class of algorithm in high performance computing, and form a crucial component of many scientific codes. As intra and inter node parallelism continues to increase rapidly, the design of new, scalable solvers which can target next generation architectures becomes increasingly important. In this work we present TeaLeaf, a recent mini-app constructed to...

chapter

The Arch Project: Physics Mini-Apps for Algorithmic Exploration and Evaluating Programming Environments on HPC Architectures

Matthew Martineau, Simon McIntosh-Smith

2017 IEEE International Conference on Cluster Computing (CLUSTER) > 850 - 857

2017 IEEE International Conference on Cluster Computing (CLUSTER)

The arch project is a suite of mini-apps that have been developed with consistent coding practices, under a common infrastructural layer. Great emphasis has been placed on making the applications concise and easy to manipulate, while capturing the key performance characteristics of their proxied algorithmic classes. The suite is intended for traditional exploration of performance, portability and...

chapter

Thoughtful Precision in Mini-Apps

Shane Fogerty, Siddhartha Bishnu, Yuliana Zamora, Laura Monroe, more

2017 IEEE International Conference on Cluster Computing (CLUSTER) > 858 - 865

2017 IEEE International Conference on Cluster Computing (CLUSTER)

Approximate computing addresses many of the identified challenges for exascale computing, leading to performance improvements that may include changes in fidelity of calculation. In this paper, we examine approximate approaches for a range of DOE-relevant computational problems run on a variety of architectures as a proxy for the wider set of exascaleclass applications.We show anticipated improvements...

chapter

Quicksilver: A Proxy App for the Monte Carlo Transport Code Mercury

David F. Richards, Ryan C. Bleile, Patrick S. Brantley, Shawn A. Dawson, more

2017 IEEE International Conference on Cluster Computing (CLUSTER) > 866 - 873

2017 IEEE International Conference on Cluster Computing (CLUSTER)

Like many other code teams, the developers of the Mercury Monte Carlo Transport code at Lawrence Livermore National Laboratory are being forced by the arrival of GPUbased supercomputers to substantially refactor their application to obtain acceptable performance on new architectures. This paper describes how we have designed, developed, and used Quicksilver, a proxy application for Mercury, to assist...

chapter

Pushing the Limits of Irregular Access Patterns on Emerging Network Architecture: A Case Study

Roberto Gioiosa, Thomas Warfel, Antonino Tumeo, Ryan Friese

2017 IEEE International Conference on Cluster Computing (CLUSTER) > 874 - 881

2017 IEEE International Conference on Cluster Computing (CLUSTER)

Irregular applications pose considerable challenges to modern computer systems, especially in distributed environments, where traditional high-performance networks are optimized for large message transfers. In this work, we analyze performance of an irregular application proxy benchmark running over traditional MPI/Infiniband as well as over the Data Vortex network, an emerging network architecture...

chapter

Author Index

2017 IEEE International Conference on Cluster Computing (CLUSTER) > 882 - 887

2017 IEEE International Conference on Cluster Computing (CLUSTER)

chapter

Publisher's Information

2017 IEEE International Conference on Cluster Computing (CLUSTER) > 888

2017 IEEE International Conference on Cluster Computing (CLUSTER)

INFONA - science communication portal

2017 IEEE International Conference on Cluster Computing (CLUSTER)

Monitoring Infrastructure: The Challenges of Moving Beyond Petascale

PFAnalyzer: A Toolset for Analyzing Application-Aware Dynamic Interconnects

Holistic Measurement-Driven System Assessment

lo2s — Multi-core System and Application Performance Analysis for Linux

Measuring Minimum Switch Port Metric Retrieval Time and Impact for Multi-layer InfiniBand Fabrics

Understanding Performance Variability on the Aries Dragonfly Network

YAViT (Yet Another Viz Tool): Raising the Level of Abstraction in End-User HPC Interactions

Assessing Representativeness of Kernels Using Descriptive Statistics

A Performance Projection of Mini-Applications onto Benchmarks Toward the Performance Projection of Real-Applications

Achieving Performance Portability for a Heat Conduction Solver Mini-Application on Modern Multi-core Systems

TeaLeaf: A Mini-Application to Enable Design-Space Explorations for Iterative Sparse Linear Solvers

The Arch Project: Physics Mini-Apps for Algorithmic Exploration and Evaluating Programming Environments on HPC Architectures

Thoughtful Precision in Mini-Apps

Quicksilver: A Proxy App for the Monte Carlo Transport Code Mercury

Pushing the Limits of Irregular Access Patterns on Emerging Network Architecture: A Case Study

Author Index

Publisher's Information

Filter options

Publication date

Keywords

INFONA - science communication portal

2017 IEEE International Conference on Cluster Computing (CLUSTER) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 IEEE International Conference on Cluster Computing (CLUSTER)