2016 9th Workshop on Many-Task Computing on Clouds, Grids, and Supercomputers (MTAGS)

book

2016 9th Workshop on Many-Task Computing on Clouds, Grids, and Supercomputers (MTAGS)

IEEE

chapter

Design of Fault Tolerant Pwrake Workflow System Supported by Gfarm File System

Masahiro Tanaka, Osamu Tatebe

2016 9th Workshop on Many-Task Computing on Clouds, Grids, and Supercomputers (MTAGS) > 7 - 12

2016 9th Workshop on Many-Task Computing on Clouds, Grids, and Supercomputers (MTAGS)

We have been developing a light-weight workflow system called Pwrake to execute data-intensive many-task workflows with the help of high-performance parallel I/O of Gfarm file system. This paper discusses the design of fault tolerance mechanism implemented in Pwrake. To avoid a workflow abort in the occurrence of a worker node failure, Pwrake detects a node failure based on the result of a task retry...

chapter

The AllScale Runtime Interface — Theoretical Foundation and Concept

Arne Hendricks, Thomas Heller, Herbert Jordan, Peter Thoman, more

2016 9th Workshop on Many-Task Computing on Clouds, Grids, and Supercomputers (MTAGS) > 13 - 19

2016 9th Workshop on Many-Task Computing on Clouds, Grids, and Supercomputers (MTAGS)

Extreme scale HPC systems are expected to reach exascale performance around the year 2020. While it is widely known that theses systems pose new challenges regarding energy efficiency of architectures, concurrency and resiliency, they also challenge developers of applications trying to efficiently utilizing resources: Managing parallel control flows, hardware resources and dependencies is a complex...

chapter

Author index

2016 9th Workshop on Many-Task Computing on Clouds, Grids, and Supercomputers (MTAGS) > 26

2016 9th Workshop on Many-Task Computing on Clouds, Grids, and Supercomputers (MTAGS)

Presents an index of the authors whose articles are published in the conference proceedings record.

chapter

[Copyright notice]

2016 9th Workshop on Many-Task Computing on Clouds, Grids, and Supercomputers (MTAGS) > ii

2016 9th Workshop on Many-Task Computing on Clouds, Grids, and Supercomputers (MTAGS)

chapter

Learning to Diagnose Stragglers in Distributed Computing

Cong Li, Huanxing Shen, Tai Huang

2016 9th Workshop on Many-Task Computing on Clouds, Grids, and Supercomputers (MTAGS) > 1 - 6

2016 9th Workshop on Many-Task Computing on Clouds, Grids, and Supercomputers (MTAGS)

In cloud computing and high performance computing, a large job is typically divided into many small tasks for parallel execution in a distributed environment. Due to different reasons, some tasks (so-called ‘stragglers’) are considerably slower than the others, delaying the completion of the job. We propose a new machine learning approach to automatically identify and diagnose the stragglers. To first...

chapter

[Title page]

2016 9th Workshop on Many-Task Computing on Clouds, Grids, and Supercomputers (MTAGS) > i

2016 9th Workshop on Many-Task Computing on Clouds, Grids, and Supercomputers (MTAGS)

Presents the title page of the proceedings record.

chapter

Clustering Based on Task Dependency for Data-Intensive Workflow Scheduling Optimization

Ei Ei Mon, Myint Myint Thein, May Thu Aung

2016 9th Workshop on Many-Task Computing on Clouds, Grids, and Supercomputers (MTAGS) > 20 - 25

2016 9th Workshop on Many-Task Computing on Clouds, Grids, and Supercomputers (MTAGS)

Scientists in each experiment team share their data and use distributed resources for conducting their experiments. These experiments are being accompanied in collaboration with teams that are globally dispersed. Scientific data need to be replicated or cached at distributed locations around the world. Data locality problem and transferred data overhead are important challenges for scheduling such...

INFONA - science communication portal

2016 9th Workshop on Many-Task Computing on Clouds, Grids, and Supercomputers (MTAGS)