2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC)

Items from 1 to 11 out of 11 results

chapter

Towards Online Application Cache Behaviors Identification in CMPs

Xiaomin Jia, Jiang Jiang, Tianlei Zhao, Shubo Qi, more

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC) > 1 - 8

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC 2010)

On chip multiprocessors (CMPs) platforms, multiple co-scheduled applications can severely degrade performance and quality of service (QoS) when they contend for last-level cache (LLC) resources. Whether an application will impose destructive interference on co-scheduled applications is largely dependent on its own inherent cache access behavior characteristics. In this work, we first present case...

chapter

A Scheduling Heuristic to Handle Local and Remote Memory in Cluster Computers

Mónica Serrano, Julio Sahuquillo, Houcine Hassan, Salvador Petit, more

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC) > 35 - 42

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC 2010)

In cluster computers, RAM memory is spread among the motherboards hosting the running applications. In these systems, it is common to constrain the memory address space of a given processor to the local motherboard. Constraining the system in this way is much cheaper than using a full-fledged shared memory implementation among motherboards. However, in this case, memory usage might widely differ among...

chapter

QuickTM: A Hardware Solution to a High Performance Unbounded Transactional Memory

S Sanyal, S Roy

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC) > 62 - 71

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC 2010)

Transactional Memory (TM) is an emerging technology which simplifies the concurrency control in a parallel program. In this paper we propose Quick TM, a new hardware transactional memory (HTM) architecture. It incorporates three features to address known bottlenecks in the existing HTM architectures. First, we propose hardware-only dynamic detection of true-shared variables. Our result shows that...

chapter

MPIActor - A Multicore-Architecture Adaptive and Thread-Based MPI Program Accelerator

Zhiqiang Liu, Kaijun Ren, Junqiang Song

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC) > 98 - 107

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC 2010)

Improving MPI foundational software to suit multicore systems is a key issue for developing effective parallel software on high performance communication domain. Towards this issue, in this paper, we propose a novel technique, called MPI Accelerator or MPIActor in short, which is a transparent middleware to enhance conventional MPI libraries. The main idea is to optimize MPI routines for multicore...

chapter

Enhancing Muesli's Data Parallel Skeletons for Multi-core Computer Architectures

P Ciechanowicz, H Kuchen

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC) > 108 - 113

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC 2010)

Algorithmic skeletons encapsulate typical parallel programming patterns such that they can be easily applied by users. Existing skeleton libraries usually work on distributed memory machines. We present an extension of our skeleton library Muesli which now allows to use the same application without modifications on a variety of parallel machines ranging from multi-processor distributed memory to many-core...

chapter

Virtual Application Appliances in Practice: Basic Mechanisms and Overheads

Erkan Unal, Paul Lu, Cam Macdonell

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC) > 213 - 222

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC 2010)

Virtual application appliances (VAA) (i.e., prebuilt virtual machines (VM) for specific scientific applications) are useful mechanisms to deal with the packaging of complex software systems and heterogeneous software environments (e.g., library version conflicts on different clusters and clouds). As an experience paper, we discuss some basic techniques for creating VAAs (e.g., virtual disk repositories...

chapter

A Novel Memory Subsystem Evaluation Framework for Chip Multiprocessors

Fucen Zeng, Lin Qiao, Mingliang Liu, Zhizhong Tang

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC) > 231 - 238

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC 2010)

This paper presents a fast and cycle-accurate memory subsystem modeling and evaluating framework for Chip Multiprocessors (CMPs), called TSIM (Tsinghua SIMulator), which gives a flexible and extensible approach to evaluating architecture designs, models or algorithms, including the network-on-chip interconnection, cache hardware prefetcher, memory system protocol, replacement policy, etc. TSIM is...

chapter

Developing a Parameterized Performance Proxy for Sequential Scientific Kernels

Hongzhang Shan, Erich Strohmaier

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC) > 247 - 256

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC 2010)

A simple, synthetic performance proxy for scientific applications is of great interest to the scientific computing community for the development of new products, procurements, and performance related questions in general. To develop such a performance proxy, we enhance the capability of the memory performance benchmark, Apex-MAP, by adding new concepts to capture the effects of computational details...

chapter

Performance Analysis of Scientific and Engineering Applications Using MPInside and TAU

S Saini, P Mehrotra, K Taylor, S Shende, more

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC) > 265 - 272

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC 2010)

In this paper, we present performance analysis of two NASA applications using performance tools like Tuning and Analysis Utilities (TAU) and SGI MP Inside. MITgcmUV and OVERFLOW are two production-quality applications used extensively by scientists and engineers at NASA. MITgcmUV is a global ocean simulation model, developed by the Estimating the Circulation and Climate of the Ocean (ECCO) Consortium,...

chapter

Analyzing and Modeling the Performance in Xen-Based Virtual Cluster Environment

Kejiang Ye, Xiaohong Jiang, Siding Chen, Dawei Huang, more

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC) > 273 - 280

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC 2010)

Virtualization technology is currently widely used due to its benefits on high resource utilization, flexible manageability and powerful system security. However, its use for high performance computing (HPC) is still not popular due to the unclearness of the virtualization overheads. It's worthy to evaluate the virtualization cost and to find the performance bottleneck when running HPC applications...

chapter

Evaluating Thread Placement Based on Memory Access Patterns for Multi-core Processors

Matthias Diener, Felipe L Madruga, Eduardo L Rodrigues, Marco A Z Alves, more

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC) > 491 - 496

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC 2010)

Process placement is a technique widely used on parallel machines with heterogeneous interconnects to reduce the overall communication time. For instance, two processes which communicate frequently are mapped close to each other. Finding the optimal mapping between threads and cores in a shared-memory environment (for example, OpenMP and Pthreads) is an even more complex task due to implicit communication...

Filter options

Keywords:
BENCHMARK TESTING

Publication date

Set your own date range

Keywords

MESSAGE PASSING (5)
SHARED MEMORY SYSTEMS (5)
INSTRUCTION SETS (4)
HARDWARE (3)
MESSAGE SYSTEMS (3)
BANDWIDTH (2)
CLUSTER COMPUTERS (2)
COMPUTER ARCHITECTURE (2)
HEURISTIC ALGORITHMS (2)
KERNEL (2)
LIBRARIES (2)
MEASUREMENT (2)
MESSAGE PASSING INTERFACE (2)
MULTI-THREADING (2)
MULTICORE PROCESSING (2)
NATURAL SCIENCES COMPUTING (2)
OPENMP (2)
PARALLEL ARCHITECTURES (2)
PARALLEL PROGRAMMING (2)
PERFORMANCE EVALUATION (2)
PROTOCOLS (2)
QUALITY OF SERVICE (2)
RESOURCE ALLOCATION (2)
SERVERS (2)
SHARED CACHE (2)
SOFTWARE LIBRARIES (2)
STORAGE MANAGEMENT (2)
VIRTUAL MACHINES (2)
ACCURACY (1)
AEROSPACE ELECTRONICS (1)
ALGORITHM DESIGN AND ANALYSIS (1)
ANALYSIS OF PERFORMANCE (1)
APEX-MAP (1)
APPLICATION CACHE BEHAVIORS (1)
APPROXIMATION METHODS (1)
ARRAYS (1)
AUTHENTICATION (1)
BIOINFORMATICS (1)
CACHE ACCESS BEHAVIOR (1)
CACHE ASSOCIATIVITY (1)
CACHE HARDWARE PREFETCHER (1)
CACHE PARTITIONING ALGORITHM (1)
CACHE PARTITIONING ALGORITHMS (1)
CACHE RESOURCE MANAGEMENT (1)
CACHE SPILLING TECHNIQUE (1)
CACHE STORAGE (1)
CACHE-COHERENCE (1)
CFD PROBLEM (1)
CHIP MULTI-PROCESSORS (CMPS) (1)
CHIP MULTIPROCESSOR PLATFORM (1)
CHIP MULTIPROCESSORS (1)
CLOUD COMPUTING (1)
CLUSTER COMPUTING (1)
CMP (1)
COMMUNICATION (1)
COMMUNICATION TIME (1)
COMPLEX SOFTWARE SYSTEM PACKAGING (1)
COMPUTATIONAL FLUID DYNAMICS (1)
COMPUTATIONAL MODELING (1)
COMPUTE TIME (1)
COMPUTERS (1)
CONCURRENCY CONTROL (1)
CONFIGURATION MANAGEMENT (1)
COPY OVER SHARED MEMORY MECHANISM (1)
DATA HANDLING (1)
DATA MOVEMENT (1)
DATA PARALLEL SKELETON (1)
DATA SHARING PATTERN (1)
DEGRADATION (1)
DIGITAL STORAGE (1)
DISTRIBUTED COMPUTING (1)
DISTRIBUTED DATABASES (1)
DISTRIBUTED MEMORY MACHINES (1)
DISTRIBUTED MEMORY SYSTEMS (1)
DRIVER CIRCUITS (1)
DYNAMIC SEPARATION (1)
ENGINEERING APPLICATION (1)
EXECUTION SCRIPTING (1)
FLEXIBLE MANAGEABILITY (1)
FLUID MOTION EQUATION (1)
GAFOLDER (1)
GENERAL-PURPOSE NAVIER-STOKES SOLVER (1)
GEOPHYSICS COMPUTING (1)
GLOBAL OCEAN SIMULATION MODEL (1)
GROMACS (1)
GUEST MACHINE (1)
HARDWARE TRANSACTIONAL MEMORY (1)
HARDWARE TRANSACTIONAL MEMORY ARCHITECTURE (1)
HARDWARE-ONLY DYNAMIC DETECTION (1)
HETEROGENEOUS INTERCONNECT (1)
HETEROGENEOUS SOFTWARE ENVIRONMENT (1)
HIGH PERFORMANCE COMMUNICATION DOMAIN (1)
HIGH PERFORMANCE COMPUTING (1)
HIGH PERFORMANCE UNBOUNDED TRANSACTIONAL MEMORY (1)
HMMER (1)
HOST MACHINE (1)
HPC (1)
HYDROSTATIC APPROXIMATION (1)
HYDROSTATICS (1)
more

INFONA - science communication portal

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC)