Computer Architecture (ISCA), 2014 ACM/IEEE 41st International Symposium on

The design of network architectures has become increasingly complex as the chips connected by inter-node networks have emerged as distributed systems in their own right, complete with their own on-chip networks. In Anton 2, a massively parallel special-purpose supercomputer for molecular dynamics simulations, we managed this complexity by reusing the on-chip network as a switch for inter-node traffic...

chapter

A reconfigurable fabric for accelerating large-scale datacenter services

Andrew Putnam, Adrian M. Caulfield, Eric S. Chung, Derek Chiou, more

2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA) > 13 - 24

2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA)

Datacenter workloads demand high computational capabilities, flexibility, power efficiency, and low cost. It is challenging to improve all of these factors simultaneously. To advance datacenter capabilities beyond what commodity server designs can provide, we have designed and built a composable, reconfigurable fabric to accelerate portions of large-scale software services. Each instantiation of the...

chapter

SCORPIO: A 36-core research chip demonstrating snoopy coherence on a scalable mesh NoC with in-network ordering

Bhavya K. Daya, Chia-Hsin Owen Chen, Suvinay Subramanian, Woo-Cheol Kwon, more

2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA) > 25 - 36

2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA)

In the many-core era, scalable coherence and on-chip interconnects are crucial for shared memory processors. While snoopy coherence is common in small multicore systems, directory-based coherence is the de facto choice for scalability to many cores, as snoopy relies on ordered interconnects which do not scale. However, directory-based coherence does not scale beyond tens of cores due to excessive...

chapter

Avoiding core's DUE & SDC via acoustic wave detectors and tailored error containment and recovery

Gaurang Upasani, Xavier Vera, Antonio Gonzalez

2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA) > 37 - 48

2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA)

The trend of downsizing transistors and operating voltage scaling has made the processor chip more sensitive against radiation phenomena making soft errors an important challenge. New reliability techniques for handling soft errors in the logic and memories that allow meeting the desired failures-in-time (FIT) target are key to keep harnessing the benefits of Moore's law. The failure to scale the...

chapter

MemGuard: A low cost and energy efficient design to support and enhance memory system reliability

Long Chen, Zhao Zhang

2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA) > 49 - 60

2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA)

Memory system reliability is increasingly a concern as memory cell density and capacity continue to grow. The conventional approach is to use redundant memory bits for error detection and correction, with significant storage, cost and power overheads. In this paper, we propose a novel, system-level scheme called MemGuard for memory error detection. With OS-based checkpointing, it is also able to recover...

chapter

GangES: Gang error simulation for hardware resiliency evaluation

Siva Kumar Sastry Hari, Radha Venkatagiri, Sarita V. Adve, Helia Naeimi

2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA) > 61 - 72

2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA)

As technology scales, the hardware reliability challenge affects a broad computing market, rendering traditional redundancy based solutions too expensive. Software anomaly based hardware error detection has emerged as a low cost reliability solution, but suffers from Silent Data Corruptions (SDCs). It is crucial to accurately evaluate SDC rates and identify SDC producing software locations to develop...

chapter

Real-world design and evaluation of compiler-managed GPU redundant multithreading

Jack Wadden, Alexander Lyashevsky, Sudhanva Gurumurthi, Vilas Sridharan, more

2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA) > 73 - 84

2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA)

Reliability for general purpose processing on the GPU (GPGPU) is becoming a weak link in the construction of reliable supercomputer systems. Because hardware protection is expensive to develop, requires dedicated on-chip resources, and is not portable across different architectures, the efficiency of software solutions such as redundant multithreading (RMT) must be explored. This paper presents a...

chapter

ArchRanker: A ranking approach to design space exploration

Tianshi Chen, Qi Guo, Ke Tang, Olivier Temam, more

2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA) > 85 - 96

2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA)

Architectural Design Space Exploration (DSE) is a notoriously difficult problem due to the exponentially large size of the design space and long simulation times. Previously, many studies proposed to formulate DSE as a regression problem which predicts architecture responses (e.g., time, power) of a given architectural configuration. Several of these techniques achieve high accuracy, though often...

chapter

Aladdin: A pre-RTL, power-performance accelerator simulator enabling large design space exploration of customized architectures

Yakun Sophia Shao, Brandon Reagen, Gu-Yeon Wei, David Brooks

2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA) > 97 - 108

2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA)

Hardware specialization, in the form of accelerators that provide custom datapath and control for specific algorithms and applications, promises impressive performance and energy advantages compared to traditional architectures. Current research in accelerator analysis relies on RTL-based synthesis flows to produce accurate timing, power, and area estimates. Such techniques not only require significant...

chapter

SynFull: Synthetic traffic models capturing cache coherent behaviour

Mario Badr, Natalie Enright Jerger

2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA) > 109 - 120

2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA)

Modern and future many-core systems represent complex architectures. The communication fabrics of these large systems heavily influence their performance and power consumption. Current simulation methodologies for evaluating networks-on-chip (NoCs) are not keeping pace with the increased complexity of our systems; architects often want to explore many different design knobs quickly. Methodologies...

chapter

Harnessing ISA diversity: Design of a heterogeneous-ISA chip multiprocessor

Ashish Venkat, Dean M. Tullsen

2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA) > 121 - 132

2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA)

Heterogeneous multicore architectures have the potential for high performance and energy efficiency. These architectures may be composed of small power-efficient cores, large high-performance cores, and/or specialized cores that accelerate the performance of a particular class of computation. Architects have explored multiple dimensions of heterogeneity, both in terms of micro-architecture and specialization...

chapter

Navigating the cache hierarchy with a single lookup

Andreas Sembrant, Erik Hagersten, David Black-Schaffer

2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA) > 133 - 144

2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA)

Modern processors optimize for cache energy and performance by employing multiple levels of caching that address bandwidth, low-latency and high-capacity. A request typically traverses the cache hierarchy, level by level, until the data is found, thereby wasting time and energy in each level. In this paper, we present the Direct-to-Data (D2D) cache that locates data across the entire cache hierarchy...

chapter

SC²: A statistical compression cache scheme

Angelos Arelakis, Per Stenstrom

2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA) > 145 - 156

2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA)

Low utilization of on-chip cache capacity limits performance and wastes energy because of the long latency, limited bandwidth, and energy consumption associated with off-chip memory accesses. Value replication is an important source of low capacity utilization. While prior cache compression techniques manage to code frequent values densely, they trade off a high compression ratio for low decompression...

chapter

The Dirty-Block Index

Vivek Seshadri, Abhishek Bhowmick, Onur Mutlu, Phillip B. Gibbons, more

2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA) > 157 - 168

2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA)

On-chip caches maintain multiple pieces of metadata about each cached block—e.g., dirty bit, coherence information, ECC. Traditionally, such metadata for each block is stored in the corresponding tag entry in the tag store. While this approach is simple to implement and scalable, it necessitates a full tag store lookup for any metadata query—resulting in high latency and energy consumption. We Vnd...

chapter

Going vertical in memory management: Handling multiplicity by multi-policy

Lei Liu, Yong Li, Zehan Cui, Yungang Bao, more

2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA) > 169 - 180

2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA)

Many emerging applications from various domains often exhibit heterogeneous memory characteristics. When running in combination on parallel platforms, these applications present a daunting variety of workload behaviors that challenge the effectiveness of any memory allocation strategy. Prior partitioning-based or random memory allocation schemes typically manage only one level of the memory hierarchy...

INFONA - science communication portal

2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA)

[Front cover]

Table of contents

ISCA 2014 organization

ISCA 2014 program committee

ISCA 2014 external reviewers

Unifying on-chip and inter-node switching within the Anton 2 network

A reconfigurable fabric for accelerating large-scale datacenter services

SCORPIO: A 36-core research chip demonstrating snoopy coherence on a scalable mesh NoC with in-network ordering

Avoiding core's DUE & SDC via acoustic wave detectors and tailored error containment and recovery

MemGuard: A low cost and energy efficient design to support and enhance memory system reliability

GangES: Gang error simulation for hardware resiliency evaluation

Real-world design and evaluation of compiler-managed GPU redundant multithreading

ArchRanker: A ranking approach to design space exploration

Aladdin: A pre-RTL, power-performance accelerator simulator enabling large design space exploration of customized architectures

SynFull: Synthetic traffic models capturing cache coherent behaviour

Harnessing ISA diversity: Design of a heterogeneous-ISA chip multiprocessor

Navigating the cache hierarchy with a single lookup

SC²: A statistical compression cache scheme

The Dirty-Block Index

Going vertical in memory management: Handling multiplicity by multi-policy

Filter options

Publication date

Keywords

INFONA - science communication portal

2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2014 ACM/IEEE 41st International Symposium on Computer Architecture (ISCA)