Search results

chapter

Combining Grid and Cloud Resources by Use of Middleware for SPMD Applications

Brian Amedro, Francoise Baude, Fabrice Huet, Elton Mathias

2010 IEEE Second International Conference on Cloud Computing Technology and Science > 177 - 184

2010 IEEE 2nd International Conference on Cloud Computing Technology and Science (CloudCom 2010)

Distributed computing environments have evolved from in-house clusters to Grids and now Cloud platforms. We, as others, provide HPC benchmarks results over Amazon EC2 that show a lower performance of Cloud resources compared to private resources., So, it is not yet clear how much of impact Clouds will have in high performance computing (HPC). But hybrid Grid/Cloud computing may offer opportunities...

chapter

Analyzing and Modeling the Performance in Xen-Based Virtual Cluster Environment

Kejiang Ye, Xiaohong Jiang, Siding Chen, Dawei Huang, more

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC) > 273 - 280

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC 2010)

Virtualization technology is currently widely used due to its benefits on high resource utilization, flexible manageability and powerful system security. However, its use for high performance computing (HPC) is still not popular due to the unclearness of the virtualization overheads. It's worthy to evaluate the virtualization cost and to find the performance bottleneck when running HPC applications...

chapter

Power-Efficient Work Distribution Method for CPU-GPU Heterogeneous System

Guibin Wang, Xiaoguang Ren

International Symposium on Parallel and Distributed Processing with Applications > 122 - 129

2010 International Symposium on Parallel and Distributed Processing with Applications (ISPA 2010)

As the system scales up continuously, the problem of power consumption for high performance computing (HPC) system becomes more severe. Heterogeneous system integrating two or more kinds of processors, could be better adapted to heterogeneity in applications and provide much higher energy efficiency in theory. Many studies have shown heterogeneous system is preferable on energy consumption to homogeneous...

chapter

PIR: PMaC's Idiom Recognizer

C Olschanowsky, A Snavely, M R Meswani, L Carrington

2010 39th International Conference on Parallel Processing Workshops > 189 - 196

2010 39th International Conference on Parallel Processing Workshops (ICPPW)

The speed of the memory subsystem often constrains the performance of large-scale parallel applications. Experts tune such applications to use hierarchical memory subsystems efficiently. Hardware accelerators, such as GPUs, can potentially improve memory performance beyond the capabilities of traditional hierarchical systems. However, the addition of such specialized hardware complicates code porting...

chapter

Improving Application Performance and Predictability Using Multiple Virtual Lanes in Modern Multi-core InfiniBand Clusters

H Subramoni, Ping Lai, S Sur, D K Panda

2010 39th International Conference on Parallel Processing > 462 - 471

39th International Conference on Parallel Processing (ICPP 2010)

Network congestion is an important factor affecting the performance of large scale jobs in supercomputing clusters, especially with the wide deployment of multi-core processors. The blocking nature of current day collectives makes such congestion a critical factor in their performance. On the other hand, modern interconnects like InfiniBand provide us with many novel features such as Virtual Lanes...

chapter

Implementation and Performance Evaluation of XcalableMP: A Parallel Programming Language for Distributed Memory Systems

Jinpil Lee, Mitsuhisa Sato

2010 39th International Conference on Parallel Processing Workshops > 413 - 420

2010 39th International Conference on Parallel Processing Workshops (ICPPW)

Although MPI is a de-facto standard for parallel programming on distributed memory systems, writing MPI programs is often a time-consuming and complicated process. XcalableMP is a language extension of C and Fortran for parallel programming on distributed memory systems that helps users to reduce those programming efforts. XcalableMP provides two programming models. The first one is the global view...

chapter

HiAL-Ckpt: A hierarchical application-level checkpointing for CPU-GPU hybrid systems

Xinhai Xu, Yufei Lin, Tao Tang, Yisong Lin

2010 5th International Conference on Computer Science&Education > 1895 - 1899

2010 5th International Conference on Computer Science & Education (ICCSE 2010)

In light of its powerful computing capacity and high energy efficiency, GPU (graphics processing unit) has become a focus in the research field of HPC (High Performance Computing). CPU-GPU heterogeneous parallel systems have become a new development trend of super-computer. However, the inherent unreliability of the GPU hardware deteriorates the reliability of super-computer. We have researched on...

chapter

Exploring Best Practices for the DSRCs with Benchmarking

Robert Rosenberg, Stephen Bique, Matt Koop, Kris Andersen, more

2010 DoD High Performance Computing Modernization Program Users Group Conference > 498 - 507

2010 DoD High Performance Computing Modernization Program Users Group Conference (HPCMP-UGC)

With the introduction of the TI-09 platforms at the DoD Supercomputing Resource Centers (DSRCs), users now have access to significantly faster and larger systems for their computations, including the first systems with greater than 10,000 cores. We wanted to benchmark these latest systems and compare to the older platforms while assessing their computational environments.

chapter

A Service for Virtual Cluster Performance Evaluation

M Rak, A Cuomo, U Villano

2010 19th IEEE International Workshops on Enabling Technologies: Infrastructures for Collaborative Enterprises > 249 - 251

2010 19th IEEE International Workshop On Enabling Technologies: Infrastructures For Collaborative Enterprises (WETICE)

Virtualization overhead is the main reason for the slow diffusion of virtualization techniques into high-performance computing environments. This paper discusses the issues linked to the performance evaluation of virtual clusters, and presents the implementation of a service that automates the benchmarking and results collection procedure in an existing cloud-on-GRID environment.

chapter

High performance solid state storage under Linux

E Seppanen, M T O'Keefe, D J Lilja

2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST) > 1 - 12

2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST 2010)

Solid state drives (SSDs) allow single-drive performance that is far greater than disks can produce. Their low latency and potential for parallel operations mean that they are able to read and write data at speeds that strain operating system I/O interfaces. Additionally, their performance characteristics expose gaps in existing benchmarking methodologies. We discuss the impact on Linux system design...

chapter

Team-Based Message Logging: Preliminary Results

Esteban Meneses, Celso L. Mendes, Laxmikant V. Kalé

2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing > 697 - 702

2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing (CCGrid)

Fault tolerance will be a fundamental imperative in the next decade as machines containing hundreds of thousands of cores will be installed at various locations. In this context, the traditional checkpoint/restart model does not seem to be a suitable option, since it makes all the processors roll back to their latest checkpoint in case of a single failure in one of the processors. In-memory message...

chapter

Evaluating MPI Implementations Using HPL on an Infiniband Nehalem Linux Cluster

Mohamad Sindi

2010 Seventh International Conference on Information Technology: New Generations > 19 - 25

Seventh International Conference on Information Technology: New Generations (ITNG 2010)

In conjunction with Moore's Law, computer speeds are expected to double approximately every two years, but with the current challenges that computer manufacturers are facing to double speeds of individual processors, due to various reasons, such as processor temperatures, multiprocessor architectures have become more popular nowadays. Eventually, this has led to an increased interest in standards...

chapter

A Case for FPGA Based Accelerated Communication

Holger Fröning, Mondrian Nüssle, Heiner Litz, Ulrich Brüning

2010 Ninth International Conference on Networks > 28 - 33

2010 Ninth International Conference on Networks (ICN 2010)

The use of Field Programmable Gate Arrays (FPGAs) in the area of High Performance Computing (HPC) to accelerate computations is well known. We present here a case where FPGAs can be used to speed up communication instead of computation. Current interconnects for HPC are in particular missing support for fine grain communication, which is increasingly found in various applications. In order to overcome...

chapter

Characterizing energy efficiency of I/O intensive parallel applications on power-aware clusters

Rong Ge, Xizhou Feng, Sindhu Subramanya, Xian-he Sun

2010 IEEE International Symposium on Parallel&Distributed Processing, Workshops and Phd Forum (IPDPSW) > 1 - 8

2010 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW 2010)

Energy efficiency and parallel I/O performance have become two critical measures in high performance computing (HPC). However, there is little empirical data that characterize the energy-performance behaviors of parallel I/O workload. In this paper, we present a methodology to profile the performance, energy, and energy efficiency of parallel I/O access patterns and report our findings on the impacting...

chapter

Investigation of Factors Impacting Thread-Level Parallelism from Desktop, Multimedia and HPC Applications

Yaobin Wang, Hong An, Jie Yan, Qi Li, more

2009 Fourth International Conference on Frontier of Computer Science and Technology > 27 - 32

2009 Fourth International Conference on Frontier of Computer Science and Technology (FCST 2009)

Applications of different categories contain varying levels of data, instruction and thread-level parallelism inherently. It's important to explore the potential coarse-grain thread-level parallelism in different applications to guide the computing resources allocation problem in multicore chips. Up to now, lots of depth researches have been mainly concentrated in the desktop applications. In order...

chapter

Performance Evaluation of Parallel Programming in Virtual Machine Environment

Cong Xu, Yuebin Bai, Cheng Luo

2009 Sixth IFIP International Conference on Network and Parallel Computing > 140 - 147

2009 Sixth IFIP International Conference on Network and Parallel Computing. NPC 2009

As multi-core processors become increasingly mainstream, architects have likewise become more interested in how best to make use of the computing capacity of the CPU, for instance, through multiple simultaneous threads or processes of execution with OpenMP or MPI. At the same time, the increasingly mature and prevailing virtualization technique in server consolidation and HPC promotes the emergence...

chapter

A quantitative analysis of high performance computing with Amazon's EC2 infrastructure: The death of the local cluster?

Z. Hill, M. Humphrey

2009 10th IEEE/ACM International Conference on Grid Computing > 26 - 33

2009 10th IEEE/ACM International Conference on Grid Computing (GRID)

The introduction of affordable infrastructure on demand, specifically Amazon's Elastic Compute Cloud (EC2), has had a significant impact in the business IT community and provides reasonable and attractive alternatives to locally-owned infrastructure. For scientific computation however, the viability of EC2 has come into question due to its use of virtualization and network shaping and the performance...

chapter

Memory Affinity for Hierarchical Shared Memory Multiprocessors

C.P. Ribeiro, J.-F. Mehaut, A. Carissimi, M. Castro, more

2009 21st International Symposium on Computer Architecture and High Performance Computing > 59 - 66

2009 21st International Symposium on Computer Architecture and High Performance Computing. SBAC-PAD 2009

Currently, parallel platforms based on large scale hierarchical shared memory multiprocessors with Non-Uniform Memory Access (NUMA) are becoming a trend in scientific High Performance Computing (HPC). Due to their memory access constraints, these platforms require a very careful data distribution. Many solutions were proposed to resolve this issue. However, most of these solutions did not include...

chapter

Performance Characterization of a Hierarchical MPI Implementation on Large-scale Distributed-memory Platforms

S.R. Alam, R. Barrett, J. Kuehn, S. Poole

2009 International Conference on Parallel Processing > 132 - 139

2009 International Conference on Parallel Processing (ICPP 2009)

The building blocks of emerging Petascale massively parallel processing (MPP) systems are multi-core processors with four or more cores as a single processing element and a customized network interface. The resulting memory and communication hierarchy of these platforms are now exposed to application developers and end users by creating a hierarchical or multi-core aware message-passing (MPI) programming...

chapter

An Effective Methodology to Multi-objective Design of Application Domain-specific Embedded Architectures

V. Catania, A. Nuovo, M. Palesi, D. Patti, more

2009 12th Euromicro Conference on Digital System Design, Architectures, Methods and Tools > 643 - 650

2009 12th EUROMICRO Conference on Digital System Design, Architectures, Methods and Tools (DSD 2009)

Today's computer systems have become unbelievably complex. Nowadays register-level design is an overwhelming task, especially in the embedded system area where the time-to-market is very short. Platform based design shifts the challenge on how to tune parametric platforms to achieve the best performance at the smallest cost. This task, called multi-objective design space exploration, requires accurate...

INFONA - science communication portal

Search results

Combining Grid and Cloud Resources by Use of Middleware for SPMD Applications

Analyzing and Modeling the Performance in Xen-Based Virtual Cluster Environment

Power-Efficient Work Distribution Method for CPU-GPU Heterogeneous System

PIR: PMaC's Idiom Recognizer

Improving Application Performance and Predictability Using Multiple Virtual Lanes in Modern Multi-core InfiniBand Clusters

Implementation and Performance Evaluation of XcalableMP: A Parallel Programming Language for Distributed Memory Systems

HiAL-Ckpt: A hierarchical application-level checkpointing for CPU-GPU hybrid systems

Exploring Best Practices for the DSRCs with Benchmarking

A Service for Virtual Cluster Performance Evaluation

High performance solid state storage under Linux

Team-Based Message Logging: Preliminary Results

Evaluating MPI Implementations Using HPL on an Infiniband Nehalem Linux Cluster

A Case for FPGA Based Accelerated Communication

Characterizing energy efficiency of I/O intensive parallel applications on power-aware clusters

Investigation of Factors Impacting Thread-Level Parallelism from Desktop, Multimedia and HPC Applications

Performance Evaluation of Parallel Programming in Virtual Machine Environment

A quantitative analysis of high performance computing with Amazon's EC2 infrastructure: The death of the local cluster?

Memory Affinity for Hierarchical Shared Memory Multiprocessors

Performance Characterization of a Hierarchical MPI Implementation on Large-scale Distributed-memory Platforms

An Effective Methodology to Multi-objective Design of Application Domain-specific Embedded Architectures

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options