Computer Architecture and High Performance Computing (SBAC-PAD), 2012 IEEE 24th International Symposium on

Network design aspects that influence cost and performance can be classified according to their distance from the applications, into issues concerning topology, switch technology, link technology, network adapter, and communication library. The network adapter has a privileged position to take decisions with more global information than any other component in the network. It receives feedback from...

chapter

HAT: Heterogeneous Adaptive Throttling for On-Chip Networks

Kevin Kai-Wei Chang, Rachata Ausavarungnirun, Chris Fallin, Onur Mutlu

2012 IEEE 24th International Symposium on Computer Architecture and High Performance Computing > 9 - 18

2012 24th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD)

The network-on-chip (NoC) is a primary shared resource in a chip multiprocessor (CMP) system. As core counts continue to increase and applications become increasingly data-intensive, the network load will also increase, leading to more congestion in the network. This network congestion can degrade system performance if the network load is not appropriately controlled. Prior works have proposed source-throttling...

chapter

On the Efficiency of Register File versus Broadcast Interconnect for Collective Communications in Data-Parallel Hardware Accelerators

Ardavan Pedram, Andreas Gerstlauer, Robert A. van de Geijn

2012 IEEE 24th International Symposium on Computer Architecture and High Performance Computing > 19 - 26

2012 24th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD)

Reducing power consumption and increasing efficiency is a key concern for many applications. How to design highly efficient computing elements while maintaining enough flexibility within a domain of applications is a fundamental question. In this paper, we present how broadcast buses can eliminate the use of power hungry multi-ported register files in the context of data-parallel hardware accelerators...

chapter

Network Endpoints for Clusters of SMPs

Gabriel Tanase, Gheorghe Almasi, Hanhong Xue, Charles Archer

2012 IEEE 24th International Symposium on Computer Architecture and High Performance Computing > 27 - 34

2012 24th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD)

Modern large scale parallel machines feature an increasingly deep hierarchy of interconnections. Individual processing cores employ simultaneous multithreading (SMT) to better exploit functional units, multiple coherent processors are collocated in a node to better exploit links to cache, memory and network (SMP), and multiple nodes are interconnected by specialized low latency/high speed networks...

chapter

Assessing Energy Efficiency of Fault Tolerance Protocols for HPC Systems

Esteban Meneses, Osman Sarood, Laxmikant V. Kale

2012 IEEE 24th International Symposium on Computer Architecture and High Performance Computing > 35 - 42

2012 24th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD)

An exascale machine is expected to be delivered in the time frame 2018-2020. Such a machine will be able to tackle some of the hardest computational problems and to extend our understanding of Nature and the universe. However, to make that a reality, the HPC community has to solve a few important challenges. Resilience will become a prominent problem because an exascale machine will experience frequent...

chapter

Using Heterogeneous Networks to Improve Energy Efficiency in Direct Coherence Protocols for Many-Core CMPs

Alberto Ros, Ricardo Fernandez-Pascual, Manuel E. Acacio

2012 IEEE 24th International Symposium on Computer Architecture and High Performance Computing > 43 - 50

2012 24th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD)

Direct coherence protocols have been recently proposed as an alternative to directory-based protocols to keep cache coherence in many-core CMPs. Differently from directory-based protocols, in direct coherence the responsible for providing the requested data in case of a cache miss (i.e., the owner cache) is also tasked with keeping the updated directory information and serializing the different accesses...

chapter

Energy Savings via Dead Sub-Block Prediction

Marco A.Z. Alves, Khubaib, Eiman Ebrahimi, Veynu T. Narasiman, more

2012 IEEE 24th International Symposium on Computer Architecture and High Performance Computing > 51 - 58

2012 24th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD)

Cache memories have traditionally been designed to exploit spatial locality by fetching entire cache lines from memory upon a miss. However, recent studies have shown that often the number of sub-blocks within a line that are actually used is low. Furthermore, those sub-blocks that are used are accessed only a few times before becoming dead (i.e., never accessed again). This results in considerable...

chapter

Scalable Thread Scheduling in Asymmetric Multicores for Power Efficiency

Rance Rodrigues, Arunachalam Annamalai, Israel Koren, Sandip Kundu

2012 IEEE 24th International Symposium on Computer Architecture and High Performance Computing > 59 - 66

2012 24th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD)

The emergence of asymmetric multicore processors(AMPs) has elevated the problem of thread scheduling in such systems. The computing needs of a thread often vary during its execution (phases) and hence, reassigning threads to cores(thread swapping) upon detection of such a change, can significantly improve the AMP's power efficiency. Even though identifying a change in the resource requirements of...

chapter

Divergence Analysis with Affine Constraints

Diogo Sampaio, Rafael Martins, Sylvain Collange, Fernando Magno Quintao Pereira

2012 IEEE 24th International Symposium on Computer Architecture and High Performance Computing > 67 - 74

2012 24th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD)

The rising popularity of graphics processing units is bringing renewed interest in code optimization techniques for SIMD processors. Many of these optimizations rely on divergence analyses, which classify variables as uniform, if they have the same value on every thread, or divergent, if they might not. This paper introduces a new kind of divergence analysis, that is able to represent variables as...

INFONA - science communication portal

2012 IEEE 24th International Symposium on Computer Architecture and High Performance Computing

[Back cover]

[Title page i]

[Title page iii]

[Copyright notice]

Message from the General Chairs

Table of contents

Message from the Program Chairs

Program Committee

Committees

[Keynotes: Abstracts of three keynote presentations]

External reviewers

The Network Adapter: The Missing Link between MPI Applications and Network Performance

HAT: Heterogeneous Adaptive Throttling for On-Chip Networks

On the Efficiency of Register File versus Broadcast Interconnect for Collective Communications in Data-Parallel Hardware Accelerators

Network Endpoints for Clusters of SMPs

Assessing Energy Efficiency of Fault Tolerance Protocols for HPC Systems

Using Heterogeneous Networks to Improve Energy Efficiency in Direct Coherence Protocols for Many-Core CMPs

Energy Savings via Dead Sub-Block Prediction

Scalable Thread Scheduling in Asymmetric Multicores for Power Efficiency

Divergence Analysis with Affine Constraints

Filter options

Publication date

Keywords

INFONA - science communication portal

2012 IEEE 24th International Symposium on Computer Architecture and High Performance Computing $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2012 IEEE 24th International Symposium on Computer Architecture and High Performance Computing