Euro-Par 2001 Parallel Processing

chapter

The Anatomy of the Grid: Enabling Scalable Virtual Organizations

Ian Foster

Lecture Notes in Computer Science > Euro-Par 2001 Parallel Processing > Invited Talks > 1-4

The term “the Grid” was coined in the mid-1990s to denote a proposed distributed computing infrastructure for advanced science and engineering [4]. Considerable progress has since been made on the construction of such an infrastructure (e.g., [1,6,7) but the term “Grid” has also been conflated, at least in popular perception, to embrace everything from advanced networking to artificial intelligence...

chapter

Software Component Technology for High Performance Parallel and Grid Computing

Dennis Gannon

Lecture Notes in Computer Science > Euro-Par 2001 Parallel Processing > Invited Talks > 5-5

A software component framework is one where an application designer programs by composing well understood and tested “components”e rather than writing large volumes of not-very-reusable code. The software industry has been using component technology to build desktop applications for about ten years now. More recently this idea has been extended to application in distributed systems with frameworks...

chapter

Macro- and Micro-parallelism in a DBMS

Martin Kersten, Stefan Manegold, Peter Boncz, Niels Nes

Lecture Notes in Computer Science > Euro-Par 2001 Parallel Processing > Invited Talks > 6-15

Large memories have become an affordable storage medium for databases involving hundreds of Gigabytes on multi-processor systems. In this short note, we review our research on building relational engines to exploit this major shift in hardware perspective. It illustrates that key design issues related to parallelism poses architectural problems at all levels of a system architecture and whose impact...

chapter

An Introduction to the Gilgamesh PIM Architecture

Thomas Sterling

Lecture Notes in Computer Science > Euro-Par 2001 Parallel Processing > Invited Talks > 16-32

Throughout the history of computer implementation, the technologies employed for logic to build ALUs and the technologies employed to realize high speed and high-density storage for main memory have been disparate, requiring different fabrication techniques. This was certainly true at the beginning of the era of electronic digital computers where logic was constructed from vacuum tubes and main memory...

chapter

High Performance Computing and Trends: Connecting Computational Requirements with Computing Resources

Jack Dongarra

Lecture Notes in Computer Science > Euro-Par 2001 Parallel Processing > Invited Talks > 33-33

Today networking, distributed computing, and parallel computation research have matured to make it possible for distributed systems to support high-performance applications, but: Resources are dispersed, Connectivity is variable, Dedicated access is not possible. In this talk we advocate the‘Computational Grids’ to support ‘large-scale’ applications. These must provide transparent access to...

chapter

Support Tools and Environments

Michael Gerndt

Lecture Notes in Computer Science > Euro-Par 2001 Parallel Processing > Topic 1 > 34-35

Parallel computing is a key technology for many areas in science and industry. Outstanding examples are the ASCI and Blue Gene programs that target only very few but critical applications. A much broader spectrum of applications can be found on any of the machines of supercomputing centers all over the world.

chapter

Dynamic Performance Tuning Environment

Anna Morajko, Eduardo César, Tomás Margalef, Joan Sorribes, more

Lecture Notes in Computer Science > Euro-Par 2001 Parallel Processing > Topic 1 > 36-45

Performance nalysis nd tuning of parallel/distributed applications re very difficult tasks for non-expert programmers. It is necessary to provide tools that utomatically carry out these tasks. Many pplications have different behavior ccording to the input data set or even change their behavior dynamically during the execution. Therefore, it is necessary that the performance tuning can be done on the...

chapter

Self-Organizing Hierarchical Cluster Timestamps

Paul A. S. Ward, David J. Taylor

Lecture Notes in Computer Science > Euro-Par 2001 Parallel Processing > Topic 1 > 46-56

Distributed-system observation tools require an efficient data structure to store and query the partial-order of execution. Such data structures typically use vector timestamps to efficiently answer precedence queries. Many current vector-timestamp algorithms either have a poor time/space complexity tradeoff or are static. This limits the scalability of such observation tools. One algorithm, centralized...

chapter

A Tool for Binding Threads to Processors

Magnus Broberg, Lars Lundberg, Håakan Grahn

Lecture Notes in Computer Science > Euro-Par 2001 Parallel Processing > Topic 1 > 57-61

Many multiprocessor systems are based on distributed shared memory. It is often important to statically bind threads to processors in order to avoid remote memory access, due to performance. Finding a good allocation takes long time and it is hard to know when to stop searching for a better one. It is sometimes impossible to run the application on the target machine. The developer needs a tool that...

chapter

VizzScheduler - A Framework for the Visualization of Scheduling Algorithms

Welf Löwe, Alex Liebrich

Lecture Notes in Computer Science > Euro-Par 2001 Parallel Processing > Topic 1 > 62-66

Efficient scheduling of task graphs for parallel machines is a major issue in parallel computing. Such algorithms are often hard to understand and hard to evaluate. We present a framework for the visualization of scheduling algorithms. Using the LogP cost model for parallel machines, we simulate the effects of scheduling algorithms for specific target machines and task graphs before performing time...

chapter

A Distributed Object Infrastructure for Interaction and Steering

Rajeev Muralidhar, Manish Parashar

Lecture Notes in Computer Science > Euro-Par 2001 Parallel Processing > Topic 1 > 67-75

This paper presents the design, implementation and experimental evaluation of DIOS, an infrastructure for enabling the runtime monitoring and computational steering of parallel and distributed applications. DIOS enables existing application objects (data structures) to be enhanced with sensors and actuators so that they can be interrogated and controlled at runtime. Application objects can be distributed...

chapter

Checkpointing Facility on a Metasystem

Yudith Cardinale, Emilio Hernández

Lecture Notes in Computer Science > Euro-Par 2001 Parallel Processing > Topic 1 > 75-79

A metasystem allows seamless access to a collection of distributed computational resources. Checkpointing is an important service in high throughput computing, especially for process migration and recovery after system crash. This article describes the experiences on incorporating checkpointing and recovery facilities in a Java-based metasystem. Our case study is suma, a metasystem for execution of...

chapter

Optimising the MPI Library for the T3E

Stephen Booth

Lecture Notes in Computer Science > Euro-Par 2001 Parallel Processing > Topic 1 > 80-83

This paper describes an optimised MPI library for the T3E.¹ Previous versions of MPI for the T3E were built on top of the SHMEM interface. This paper describes an optimised version that also uses additional capabilities of the low-level communication hardware.

chapter

Performance Evaluation and Prediction

Allen D. Malony, Graham D. Riley, Bernd Mohr, Mark Bull, more

Lecture Notes in Computer Science > Euro-Par 2001 Parallel Processing > Topic 2 > 84-85

The performance of parallel and distributed systems and applications — its evaluation, analysis, and optimization — is at once a fundamental topic for research investigation and a technological problem that requires innovations in tools and techniques to keep pace with system and application evolution. This dual view of performance “science” and performance “technology” jointly spans broad fields...

chapter

Optimal Polling for Latency-Throughput Tradeoffs in Queue-Based Network Interfaces for Clusters

Dmitry Ponomarev, Kanad Ghose, Eugeny Saksonov

Lecture Notes in Computer Science > Euro-Par 2001 Parallel Processing > Topic 2 > 86-95

We consider a networking subsystem for message-passing clusters that uses two unidirectional queues for data transfers between the network interface card (NIC) and the lower protocol layers, with polling as the primary mechanism for reading data off these queues. We suggest that for accurate mathematical analysis of such an organization, the values of the system’s states probabilities have to be taken...

chapter

Performance Prediction of Oblivious BSP Programs

Jesús A. González, Coromoto León, Fabiana Piccoli, Marcela Printista, more

Lecture Notes in Computer Science > Euro-Par 2001 Parallel Processing > Topic 2 > 96-105

The BSP model can be extended with a zero cost synchronization mechanism, which can be used when the number of messages due to receives is known. This mechanism, usually known as“oblivious synchronization” implies that different processors can be in different supersteps at the same time. An unwanted consequence of this software improvement is a loss of accuracy in prediction. This paper proposes an...

chapter

Performance Prediction of Data-Dependent Task Parallel Programs

Hasyim Gautama, Arjan J. C. Gemund

Lecture Notes in Computer Science > Euro-Par 2001 Parallel Processing > Topic 2 > 106-116

Current analytic solutions to the execution time prediction Y of binary parallel compositions of tasks with arbitrary execution time distributions X ₁ and X ₂ are either computationally complex or very inaccurate. In this paper we introduce an analytical approach based on the use of lambda distributions to approximate execution...

chapter

The Tuning Problem on Pipelines

Luz Marina Moreno, Francisco Almeida, Daniel González, Casiano Rodríguez

Lecture Notes in Computer Science > Euro-Par 2001 Parallel Processing > Topic 2 > 117-121

Performance analysis and prediction is an important factor determining the efficiency of parallel programs. Considerable efforts have been made both in pure theoretical analysis and in practical automatic profiling. Unfortunately, contributions in one area seem to ignore the results of the other.We introduce a general performance prediction methodology based on the integration of analytical models...

chapter

The Hardware Performance Monitor Toolkit

Luiz A. DeRose

Lecture Notes in Computer Science > Euro-Par 2001 Parallel Processing > Topic 2 > 122-132

In this paper we present the Hardware Performance Monitor (HPM) Toolkit, a language independent performance analysis and visualization system developed for performance measurements of applications running on the IBM Power 3 with AIX and on Intel clusters with Linux. The HPM Toolkit supports analysis of applications written in Fortran, C, and C++. It was designed to collect hardware events with low...

chapter

VIA Communication Performance on a Gigabit Ethernet Cluster

Mark Baker, Paul A. Farrell, Hong Ong, Stephen L Scott

Lecture Notes in Computer Science > Euro-Par 2001 Parallel Processing > Topic 2 > 132-142

As the technology for high-speed networks has evolved over the last decade, the interconnection of commodity computers (e.g., PCs and workstations) at gigabit rates has become a reality. However, the improved performance of high-speed networks has not been matched so far by a proportional improvement in the ability of the TCP/IP protocol stack. As a result the Virtual Interface Architecture (VIA)...

INFONA - science communication portal

Euro-Par 2001 Parallel Processing
7th International Euro-Par Conference Manchester, UK, August 28–31, 2001 Proceedings

The Anatomy of the Grid: Enabling Scalable Virtual Organizations

Software Component Technology for High Performance Parallel and Grid Computing

Macro- and Micro-parallelism in a DBMS

An Introduction to the Gilgamesh PIM Architecture

High Performance Computing and Trends: Connecting Computational Requirements with Computing Resources

Support Tools and Environments

Dynamic Performance Tuning Environment

Self-Organizing Hierarchical Cluster Timestamps

A Tool for Binding Threads to Processors

VizzScheduler - A Framework for the Visualization of Scheduling Algorithms

A Distributed Object Infrastructure for Interaction and Steering

Checkpointing Facility on a Metasystem

Optimising the MPI Library for the T3E

Performance Evaluation and Prediction

Optimal Polling for Latency-Throughput Tradeoffs in Queue-Based Network Interfaces for Clusters

Performance Prediction of Oblivious BSP Programs

Performance Prediction of Data-Dependent Task Parallel Programs

The Tuning Problem on Pipelines

The Hardware Performance Monitor Toolkit

VIA Communication Performance on a Gigabit Ethernet Cluster

Filter options

Publication date

Content availability

Publication language

Keywords

INFONA - science communication portal

Euro-Par 2001 Parallel Processing 7th International Euro-Par Conference Manchester, UK, August 28–31, 2001 Proceedings $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication language

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

Euro-Par 2001 Parallel Processing
7th International Euro-Par Conference Manchester, UK, August 28–31, 2001 Proceedings