2017 IEEE International Conference on Cluster Computing (CLUSTER)

chapter

GraphH: High Performance Big Graph Analytics in Small Clusters

Peng Sun, Yonggang Wen, Ta Nguyen Binh Duong, Xiaokui Xiao

2017 IEEE International Conference on Cluster Computing (CLUSTER) > 256 - 266

It is common for real-world applications to analyze big graphs using distributed graph processing systems. Popular in-memory systems require an enormous amount of resources to handle big graphs. While several out-of-core approaches have been proposed for processing big graphs on disk, the high disk I/O overhead could significantly reduce performance. In this paper, we propose GraphH to enable high-performance...

chapter

Parallel Multivariate Spatio-Temporal Clustering of Large Ecological Datasets on Hybrid Supercomputers

Sarat Sreepathi, Jitendra Kumar, Richard T. Mills, Forrest M. Hoffman, more

2017 IEEE International Conference on Cluster Computing (CLUSTER) > 267 - 277

2017 IEEE International Conference on Cluster Computing (CLUSTER)

A proliferation of data from vast networks of remote sensing platforms (satellites, unmanned aircraft systems (UAS), airborne etc.), observational facilities (meteorological, eddy covariance etc.), state-of-the-art sensors, and simulation models offer unprecedented opportunities for scientific discovery. Unsupervised classification is a widely applied data mining approach to derive insights from such...

chapter

A Comparison of Graph-Based Synthetic Data Generators for Benchmarking Next-Generation Intrusion Detection Systems

Stefano Iannucci, Hisham A. Kholidy, Amrita Dhakal Ghimire, Rui Jia, more

2017 IEEE International Conference on Cluster Computing (CLUSTER) > 278 - 289

2017 IEEE International Conference on Cluster Computing (CLUSTER)

Property-graphs are becoming popular for Intrusion Detection Systems (IDSs) because they allow to leverage distributed graph processing platforms in order to identify malicious network traffic patterns. However, a benchmark for studying their performance when operating on big data has not yet been reported. In general, benchmarking a system involves the execution of workloads on datasets, where both...

chapter

Dynamically Compiled Artifact Sharing for Clouds

Panagiotis Patros, Dayal Dilli, Kenneth B. Kent, Michael Dawson

2017 IEEE International Conference on Cluster Computing (CLUSTER) > 290 - 300

2017 IEEE International Conference on Cluster Computing (CLUSTER)

Platform as a Service (PaaS) clouds provide part of the hardware/software stack and related services to tenant applications. Increased load is handled elastically by scaling, which either modifies the number of instances an application has available on the cloud or increases their available resources. However, because all these instances run inside isolated containers, experience gained by the first...

chapter

ConVGPU: GPU Management Middleware in Container Based Virtualized Environment

Daeyoun Kang, Tae Joon Jun, Dohyeun Kim, Jaewook Kim, more

2017 IEEE International Conference on Cluster Computing (CLUSTER) > 301 - 309

2017 IEEE International Conference on Cluster Computing (CLUSTER)

Nowadays, Graphics Processing Unit (GPU) is essential for general-purpose high-performance computing, because of its dominant performance in parallel computing compare to that of CPU. There have been many successful trials on the use of GPU in virtualized environment. Especially, NVIDIA Docker obtained a most practical way to bring GPU into the container-based virtualized environment. However, most...

chapter

Enabling Diverse Software Stacks on Supercomputers Using High Performance Virtual Clusters

Andrew J. Younge, Kevin Pedretti, Ryan E. Grant, Brian L. Gaines, more

2017 IEEE International Conference on Cluster Computing (CLUSTER) > 310 - 321

2017 IEEE International Conference on Cluster Computing (CLUSTER)

While large-scale simulations have been the hallmark of the High Performance Computing (HPC) community for decades, Large Scale Data Analytics (LSDA) workloads are gaining attention within the scientific community not only as a processing component to large HPC simulations, but also as standalone scientific tools for knowledge discovery. With the path towards Exascale, new HPC runtime systems are...

chapter

EclipseMR: Distributed and Parallel Task Processing with Consistent Hashing

Vicente A. B. Sanchez, Wonbae Kim, Youngmoon Eom, Kibeom Jin, more

2017 IEEE International Conference on Cluster Computing (CLUSTER) > 322 - 332

2017 IEEE International Conference on Cluster Computing (CLUSTER)

We present EclipseMR, a novel MapReduce framework prototype that efficiently utilizes a large distributed memory in cluster environments. EclipseMR consists of double-layered consistent hash rings - a decentralized DHT-based file system and an in-memory key-value store that employs consistent hashing. The in-memory key-value store in EclipseMR is designed not only to cache local data but also remote...

chapter

Understanding the Role of GPGPU-Accelerated SoC-Based ARM Clusters

Reza Azimi, Tyler Fox, Sherief Reda

2017 IEEE International Conference on Cluster Computing (CLUSTER) > 333 - 343

2017 IEEE International Conference on Cluster Computing (CLUSTER)

The last few years saw the emergence of 64-bit ARM SoCs targeted for mobile systems and servers. Mobile-class SoCs rely on the heterogeneous integration of a mix of CPU cores, GPGPU cores, and accelerators, whereas server-class SoCs instead rely on integrating a larger number of CPU cores with no GPGPU support and a number of network accelerators. Previous works, such as the Mont-Blanc project, built...

chapter

Effective Running of End-to-End HPC Workflows on Emerging Heterogeneous Architectures

Kun Tang, Devesh Tiwari, Saurabh Gupta, Sudharshan S. Vazhkudai, more

2017 IEEE International Conference on Cluster Computing (CLUSTER) > 344 - 348

2017 IEEE International Conference on Cluster Computing (CLUSTER)

In high-performance computing (HPC), end-to-end workflows are typically utilized to gain insights from scientific simulations. An end-to-end workflow consists of scientific simulation and data analysis, and can be executed in-situ, in-transit, and offline. Existing studies on end-to-end workflows have largely focused on the high-performance execution approaches. However, the emerging heterogeneous...

chapter

High Throughput and Low Latency on Hadoop Clusters Using Explicit Congestion Notification: The Untold Truth

Renan Fischer e Silva, Paul M. Carpenter

2017 IEEE International Conference on Cluster Computing (CLUSTER) > 349 - 353

2017 IEEE International Conference on Cluster Computing (CLUSTER)

Various extensions of TCP/IP have been proposed to reduce network latency; examples include Explicit Congestion Notification (ECN), Data Center TCP (DCTCP) and several proposals for Active Queue Management (AQM). Combining these techniques requires adjusting various parameters, and recent studies have found that it is difficult to do so while obtaining both high performance and low latency. This is...

chapter

A Scalable Network-Based Performance Analysis Tool for MPI on Large-Scale HPC Systems

Hari Subramoni, Xiaoyi Lu, Dhabaleswar K. Panda

2017 IEEE International Conference on Cluster Computing (CLUSTER) > 354 - 358

2017 IEEE International Conference on Cluster Computing (CLUSTER)

Studying the interaction among applications, MPI runtimes, and the fabric they run on is critical to understanding application performance. There exists no high-performance and scalable tool that enables understanding this interplay on modern multi-petaflop systems. Designing such a tool is non-trivial and involves multiple components including 1) data profiling/collection from network/MPI library,...

chapter

SoMeta: Scalable Object-Centric Metadata Management for High Performance Computing

Houjun Tang, Suren Byna, Bin Dong, Jialin Liu, more

2017 IEEE International Conference on Cluster Computing (CLUSTER) > 359 - 369

2017 IEEE International Conference on Cluster Computing (CLUSTER)

Scientific data sets, which grow rapidly in volume, are often attached with plentiful metadata, such as their associated experiment or simulation information. Thus, it becomes difficult for them to be utilized and their value is lost over time. Ideally, metadata should be managed along with its corresponding data by a single storage system, and can be accessed and updated directly. However, existing...

chapter

Automatic Data Filtering for In Situ Workflows

Clement Mommessin, Matthieu Dreher, Bruno Raffin, Tom Peterka

2017 IEEE International Conference on Cluster Computing (CLUSTER) > 370 - 378

2017 IEEE International Conference on Cluster Computing (CLUSTER)

In situ workflows contain tasks that exchange messages composed of several data fields. However, a consumer task may not necessarily need all the data fields from its producer. For example, a molecular dynamics simulation can produce atom positions, velocities, and forces; but some analyses require only atom positions. The user should decide whether to specialize the output of a producer task for...

chapter

Task Allocation for Stream Processing with Recovery Latency Guarantee

Hongliang Li, Jie Wu, Zhen Jiang, Xiang Li, more

2017 IEEE International Conference on Cluster Computing (CLUSTER) > 379 - 383

2017 IEEE International Conference on Cluster Computing (CLUSTER)

Stream processing applications continuously process large amounts of online streaming data in real-time or near real-time. They have strict latency constraints, but they are also vulnerable to failures. Failure recoveries may slow down the entire processing pipeline and break latency constraints. Upstream backup is one of the most widely applied fault-tolerant schemes for stream processing systems...

chapter

A Comparative Analysis of Materialized Views Selection and Concurrency Control Mechanisms in NoSQL Databases

Ashish Tapdiya, Yuan Xue, Daniel Fabbri

2017 IEEE International Conference on Cluster Computing (CLUSTER) > 384 - 388

2017 IEEE International Conference on Cluster Computing (CLUSTER)

Relational databases are well suited for vertical scaling; however, specialized hardware can be expensive. Conversely, NewSQL and NoSQL data stores are designed to scale horizontally. NewSQL databases provide ACID transaction support; however, joins are limited to the partition keys, resulting in restricted query expressiveness. On the other hand, NoSQL databases are designed to scale out on commodity...

chapter

Automatic, Abstracted and Portable Topology-Aware Thread Placement

Jens Gustedt, Emmanuel Jeannot, Farouk Mansouri

2017 IEEE International Conference on Cluster Computing (CLUSTER) > 389 - 399

2017 IEEE International Conference on Cluster Computing (CLUSTER)

Efficiently programming shared-memory machines is a difficult challenge because mapping application threads onto the memory hierarchy has a strong impact on the performance. However, optimizing such thread placement is difficult: architectures become increasingly complex and application behavior changes with implementations and input parameters, e.g problem size and number of threads. In this work,...

chapter

Dynamic Co-Scheduling Driven by Main Memory Bandwidth Utilization

Jens Breitbart, Simon Pickartz, Stefan Lankes, Josef Weidendorfer, more

2017 IEEE International Conference on Cluster Computing (CLUSTER) > 400 - 409

2017 IEEE International Conference on Cluster Computing (CLUSTER)

Most applications running on supercomputers achieve only a fraction of a system's peak performance. It has been demonstrated that the co-scheduling of applications can improve the overall system utilization. However, following this approach, applications need to fulfill certain criteria such that the mutual slowdown is kept at a minimum. In this paper, we present an HPC scheduler that applies co-scheduling...

chapter

Tracking System Behavior from Resource Usage Data

Niyazi Sorkunlu, Varun Chandola, Abani Patra

2017 IEEE International Conference on Cluster Computing (CLUSTER) > 410 - 418

2017 IEEE International Conference on Cluster Computing (CLUSTER)

Resource usage data, collected using tools such as TACC_Stats, capture the resource utilization by nodes within a high performance computing system. We present methods to analyze the resource usage data to understand the system performance and identify performance anomalies. The core idea is to model the data as a three-way tensor corresponding to the compute nodes, usage metrics, and time. Using...

chapter

Flexible Data Aggregation for Performance Profiling

David Boehme, David Beckingsale, Martin Schulz

2017 IEEE International Conference on Cluster Computing (CLUSTER) > 419 - 428

2017 IEEE International Conference on Cluster Computing (CLUSTER)

Almost all performance analysis tools in the HPC space perform some form of aggregation to compute summary information of a series of performance measurements, from summations to more complex operations like histograms. Aggregation not only reduces data volumes and consequently storage space requirements and overheads, but is also crucial to extract insights from recorded measurement data. In current...

chapter

Implementing Lattice QCD Application with XcalableACC Language on Accelerated Cluster

Masahiro Nakao, Hitoshi Murai, Hidetoshi Iwashita, Akihiro Tabuchi, more

2017 IEEE International Conference on Cluster Computing (CLUSTER) > 429 - 438

2017 IEEE International Conference on Cluster Computing (CLUSTER)

Accelerated clusters, which are distributed memory systems equipped with accelerators, have been used in various fields. For accelerated clusters, programmers often implement their applications by a combination of MPI and CUDA (MPI+CUDA). However, the approach faces programming complexity issues. This paper introduces the XcalableACC (XACC) language, which is a hybrid model of XcalableMP (XMP) and...

INFONA - science communication portal

2017 IEEE International Conference on Cluster Computing (CLUSTER)

GraphH: High Performance Big Graph Analytics in Small Clusters

Parallel Multivariate Spatio-Temporal Clustering of Large Ecological Datasets on Hybrid Supercomputers

A Comparison of Graph-Based Synthetic Data Generators for Benchmarking Next-Generation Intrusion Detection Systems

Dynamically Compiled Artifact Sharing for Clouds

ConVGPU: GPU Management Middleware in Container Based Virtualized Environment

Enabling Diverse Software Stacks on Supercomputers Using High Performance Virtual Clusters

EclipseMR: Distributed and Parallel Task Processing with Consistent Hashing

Understanding the Role of GPGPU-Accelerated SoC-Based ARM Clusters

Effective Running of End-to-End HPC Workflows on Emerging Heterogeneous Architectures

High Throughput and Low Latency on Hadoop Clusters Using Explicit Congestion Notification: The Untold Truth

A Scalable Network-Based Performance Analysis Tool for MPI on Large-Scale HPC Systems

SoMeta: Scalable Object-Centric Metadata Management for High Performance Computing

Automatic Data Filtering for In Situ Workflows

Task Allocation for Stream Processing with Recovery Latency Guarantee

A Comparative Analysis of Materialized Views Selection and Concurrency Control Mechanisms in NoSQL Databases

Automatic, Abstracted and Portable Topology-Aware Thread Placement

Dynamic Co-Scheduling Driven by Main Memory Bandwidth Utilization

Tracking System Behavior from Resource Usage Data

Flexible Data Aggregation for Performance Profiling

Implementing Lattice QCD Application with XcalableACC Language on Accelerated Cluster

Filter options

Publication date

Keywords

INFONA - science communication portal

2017 IEEE International Conference on Cluster Computing (CLUSTER) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2017 IEEE International Conference on Cluster Computing (CLUSTER)