Search results for: R Rabenseifner

Items from 1 to 6 out of 6 results

chapter

Synchronizing the Timestamps of Concurrent Events in Traces of Hybrid MPI/OpenMP Applications

D Becker, M Geimer, R Rabenseifner, F Wolf

2010 IEEE International Conference on Cluster Computing > 38 - 47

2010 IEEE International Conference on Cluster Computing (CLUSTER 2010)

Event traces are helpful in understanding the performance behavior of parallel applications since they allow the in-depth analysis of communication and synchronization patterns. However, the absence of synchronized clocks on most cluster systems may render the analysis ineffective because inaccurate relative event timings may misrepresent the logical event order and lead to errors when quantifying...

chapter

Hybrid MPI/OpenMP Parallel Programming on Clusters of Multi-Core SMP Nodes

R. Rabenseifner, G. Hager, G. Jost

2009 17th Euromicro International Conference on Parallel, Distributed and Network-based Processing > 427 - 436

2009 17th Euromicro International Conference on Parallel, Distributed and Network-based Processing

Today most systems in high-performance computing (HPC) feature a hierarchical hardware design: Shared memory nodes with several multi-core CPUs are connected via a network infrastructure. Parallel programming must combine distributed memory parallelization on the node interconnect with shared memory parallelization inside each node. We describe potentials and challenges of the dominant programming...

chapter

Implications of non-constant clock drifts for the timestamps of concurrent events

D. Becker, R. Rabenseifner, F. Wolf

2008 IEEE International Conference on Cluster Computing > 59 - 68

2008 IEEE International Conference on Cluster Computing (CLUSTER)

To support the development of efficient parallel codes on cluster systems, event tracing is a widely used technique with a broad spectrum of applications ranging from performance analysis, performance prediction and modeling to debugging. Usually, events are recorded along with the time of their occurrence to measure the temporal distance between them and/or to establish a total event ordering. Obviously,...

chapter

Replay-Based Synchronization of Timestamps in Event Traces of Massively Parallel Applications

D. Becker, J.C. Linford, R. Rabenseifner, F. Wolf

2008 International Conference on Parallel Processing - Workshops > 212 - 219

2008 International Conference on Parallel Processing Workshops (ICPP-W)

Event traces are helpful in understanding the performance behavior of message-passing applications since they allow in-depth analyses of communication and synchronization patterns. However, the absence of synchronized hardware clocks may render the analysis ineffective because inaccurate relative event timings can misrepresent the logical event order and lead to errors when quantifying the impact...

chapter

Parallel I/O Performance Characterization of Columbia and NEC SX-8 Superclusters

S. Saini, D. Talcott, Rajeev Thakur, P. Adamidis, more

2007 IEEE International Parallel and Distributed Processing Symposium > 1 - 10

2007 IEEE International Parallel and Distributed Processing Symposium

Many scientific applications running on today's supercomputers deal with increasingly large data sets and are correspondingly bottlenecked by the time it takes to read or write the data from/to the file system. We therefore undertook a study to characterize the parallel I/O performance of two of today's leading parallel supercomputers: the Columbia system at NASA Ames Research Center and the NEC SX-8...

chapter

Performance evaluation of supercomputers using HPCC and IMB benchmarks

S. Saini, R. Ciotti, B.T.N. Gunney, T.E. Spelce, more

Proceedings 20th IEEE International Parallel &amp; Distributed Processing Symposium > 8 pp.

Proceedings. 20th International Parallel and Distributed Processing Symposium

The HPC Challenge (HPCC) benchmark suite and the Intel MPI Benchmark (IMB) are used to compare and evaluate the combined performance of processor, memory subsystem and interconnect fabric of five leading supercomputers - SGI Altix BX2, Cray XI, Cray Opteron Cluster, Dell Xeon cluster, and NEC SX-8. These five systems use five different networks (SGI NUMALINK4, Cray network, Myrinet, InfiniBand, and...

Filter options

Publication date

Set your own date range

Keywords

CLOCKS (3)
INTERPOLATION (3)
MESSAGE PASSING (3)
PARALLEL PROCESSING (3)
SYNCHRONIZATION (3)
TIMESTAMPS (3)
ALGORITHM DESIGN AND ANALYSIS (2)
EVENT TRACING (2)
HARDWARE (2)
HYBRID PROGRAMMING (2)
LINEAR OFFSET INTERPOLATION (2)
PARALLEL MACHINES (2)
PERFORMANCE BEHAVIOR (2)
PERFORMANCE EVALUATION (2)
PROGRAM PROCESSORS (2)
SOFTWARE PERFORMANCE EVALUATION (2)
WORKSTATION CLUSTERS (2)
ACCURACY (1)
APPLICATION PROGRAM INTERFACES (1)
BENCHMARK TESTING (1)
CLUSTER SYSTEM (1)
COLUMBIA SYSTEM (1)
COMMUNICATION PATTERNS (1)
COMMUNICATION PATTERNS ANALYSIS (1)
COMPUTATIONAL MODELING (1)
CONCURRENCY CONTROL (1)
CONCURRENT EVENT (1)
CONCURRENT EVENTS (1)
CRAY NETWORK (1)
CRAY OPTERON CLUSTER (1)
CRAY XI (1)
DATA MINING (1)
DELL XEON CLUSTER (1)
DISTRIBUTED MEMORY PARALLELIZATION (1)
DISTRIBUTED MEMORY SYSTEMS (1)
EVENT TIMINGS (1)
FILE SYSTEM (1)
HIGH PRODUCTIVITY COMPUTING SYSTEMS (1)
HIGH-PERFORMANCE COMPUTING (1)
HPC CHALLENGE BENCHMARK (1)
HYBRID MPI/OPENMP APPLICATIONS (1)
INFINIBAND (1)
INPUT-OUTPUT PROGRAMS (1)
INTEL MPI BENCHMARK (1)
INTERCONNECT FABRIC (1)
INTERPROCESS TIMINGS POSTMORTEM (1)
LOAD BALANCE IMPROVEMENT (1)
LOGICAL EVENT ORDER (1)
LOGICAL EVENT ORDER POSTMORTEM (1)
MAGNETIC CORES (1)
MASSIVELY PARALLEL APPLICATIONS (1)
MEMORY SUBSYSTEM (1)
MESSAGE PASSING INTERFACE (1)
MESSAGE SYSTEMS (1)
MESSAGE-PASSING APPLICATIONS (1)
MESSAGE-PASSING EVENT SEMANTICS (1)
MPI (1)
MPI-OPENMP PARALLEL PROGRAMMING (1)
MULTI-CORE (1)
MULTICORE SMP NODES (1)
MYRINET (1)
NATURAL SCIENCES COMPUTING (1)
NEC IXS (1)
NEC SX-8 (1)
NEC SX-8 SUPERCLUSTERS (1)
NETWORK OPERATING SYSTEMS (1)
NONCONSTANT CLOCK DRIFTS (1)
OPENMP (1)
PARALLEL APPLICATIONS (1)
PARALLEL CODE (1)
PARALLEL I/O PERFORMANCE CHARACTERIZATION (1)
PARALLEL PROGRAMMING (1)
PARALLEL SUPERCOMPUTERS (1)
PROGRAM DIAGNOSTICS (1)
PROGRAMMING (1)
RADIATION DETECTORS (1)
REPLAY-BASED SYNCHRONIZATION (1)
RESOURCE ALLOCATION (1)
SCALASCA TRACE-ANALYSIS FRAMEWORK (1)
SEMANTICS (1)
SERVERS (1)
SGI ALTIX BX2 (1)
SGI NUMALINK4 (1)
SHARED MEMORY PARALLELIZATION (1)
SHARED MEMORY SYSTEMS (1)
SHARED-MEMORY EVENT SEMANTICS (1)
SMP (1)
SOCKETS (1)
SOFTWARE (1)
SUPERCOMPUTERS (1)
SYNCHRONISATION (1)
SYNCHRONIZATION PATTERNS (1)
SYNCHRONIZATION PATTERNS ANALYSIS (1)
SYNCHRONIZED CLOCKS (1)
TIME-LINE VISUALIZATION TOOLS (1)
TIMING (1)
TOPOLOGY (1)
TOTAL EVENT ORDERING (1)
more

INFONA - science communication portal

Search results for: R Rabenseifner

Synchronizing the Timestamps of Concurrent Events in Traces of Hybrid MPI/OpenMP Applications

Hybrid MPI/OpenMP Parallel Programming on Clusters of Multi-Core SMP Nodes

Implications of non-constant clock drifts for the timestamps of concurrent events

Replay-Based Synchronization of Timestamps in Event Traces of Massively Parallel Applications

Parallel I/O Performance Characterization of Columbia and NEC SX-8 Superclusters

Performance evaluation of supercomputers using HPCC and IMB benchmarks

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options