35th International Symposium on Computer Architecture

Items from 1 to 20 out of 50 results

chapter

3D-Stacked Memory Architectures for Multi-core Processors

G.H. Loh

2008 International Symposium on Computer Architecture > 453 - 464

35th International Symposium on Computer Architecture

Three-dimensional integration enables stacking memory directly on top of a microprocessor, thereby significantly reducing wire delay between the two. Previous studies have examined the performance benefits of such an approach, but all of these works only consider commodity 2D DRAM organizations. In this work, we explore more aggressive 3D DRAM organizations that make better use of the additional die-to-die...

chapter

VEAL: Virtualized Execution Accelerator for Loops

N. Clark, A. Hormati, S. Mahlke

2008 International Symposium on Computer Architecture > 389 - 400

35th International Symposium on Computer Architecture

Performance improvement solely through transistor scaling is becoming more and more difficult, thus it is increasingly common to see domain specific accelerators used in conjunction with general purpose processors to achieve future performance goals. There is a serious drawback to accelerators, though: binary compatibility. An application compiled to utilize an accelerator cannot run on a processor...

chapter

Flexible Hardware Acceleration for Instruction-Grain Program Monitoring

S. Chen, M. Kozuch, T. Strigkos, B. Falsafi, more

2008 International Symposium on Computer Architecture > 377 - 388

35th International Symposium on Computer Architecture

Instruction-grain program monitoring tools, which check and analyze executing programs at the granularity of individual instructions, are invaluable for quickly detecting bugs and security attacks and then limiting their damage (via containment and/or recovery). Unfortunately, their fine-grain nature implies very high monitoring overheads for software-only tools, which are typically based on dynamic...

chapter

Microcoded Architectures for Ion-Tap Quantum Computers

L. Kreger-Stickles, M. Oskin

2008 International Symposium on Computer Architecture > 165 - 176

35th International Symposium on Computer Architecture

In this paper we present the first ever systematic design space exploration of microcoded software fault tolerant ion-trap quantum computers. This exploration reveals the critical importance of a well-tuned microcode for providing high performance and ensuring system reliability. In addition, we find that, despite recent advances in the reliability of quantum memory, the impact of errors due to stored...

chapter

ReVIVaL: A Variation-Tolerant Architecture Using Voltage Interpolation and Variable Latency

Xiaoyao Liang, Gu-Yeon Wei, D. Brooks

2008 International Symposium on Computer Architecture > 191 - 202

35th International Symposium on Computer Architecture

Process variations are poised to significantly degrade performance benefits sought by moving to the next nanoscale technology node. Parameter fluctuations in devices can introduce large variations in peak operation among chips, among cores on a single chip, and among microarchitectural blocks within one core. Hence, it will be difficult to only rely on traditional frequency binning to efficiently...

chapter

Running a Quantum Circuit at the Speed of Data

N. Isailovic, M. Whitney, Y. Patel, J. Kubiatowicz

2008 International Symposium on Computer Architecture > 177 - 188

35th International Symposium on Computer Architecture

We analyze circuits for kernels from popular quantum computing applications, characterizing the hardware resources necessary to take ancilla preparation off the critical path. The result is a chip entirely dominated by ancilla generation circuits. To address this issue, we introduce optimized ancilla factories and analyze theirstructure and physical layout for ion trap technology. We introduce a new...

chapter

Corona: System Implications of Emerging Nanophotonic Technology

D. Vantrease, R. Schreiber, M. Monchiero, M. McLaren, more

2008 International Symposium on Computer Architecture > 153 - 164

35th International Symposium on Computer Architecture

We expect that many-core microprocessors will push performance per chip from the 10 gigaflop to the 10 teraflop range in the coming decade. To support this increased performance, memory and inter-core bandwidths will also have to scale by orders of magnitude. Pin limitations, the energy cost of electrical signaling, and the non-scalability of chip-length global wires are significant bandwidth impediments...

chapter

Using Hardware Memory Protection to Build a High-Performance, Strongly-Atomic Hybrid Transactional Memory

L. Baugh, N. Neelakantam, C. Zilles

2008 International Symposium on Computer Architecture > 115 - 126

35th International Symposium on Computer Architecture

We demonstrate how fine-grained memory protection can be used in support of transactional memory systems: first showing how a software transactional memory system (STM) can be made strongly atomic by using memory protection on transactionally-held state, then showing how such a strongly-atomic STM can be used with a bounded hardware TM system to build a hybrid TM system in which zero-overhead hardware...

chapter

Polymorphic On-Chip Networks

M.M. Kim, J.D. Davis, M. Oskin, T. Austin

2008 International Symposium on Computer Architecture > 101 - 112

35th International Symposium on Computer Architecture

As the number of cores per die increases, be they processors, memory blocks, or custom accelerators, the on-chip interconnect the cores use to communicate gains importance. We begin this study with an area-performance analysis of the interconnect design space. We find that there is no single network design that yields optimal performance across a range of traffic patterns. This indicates that there...

chapter

Trading off Cache Capacity for Reliability to Enable Low Voltage Operation

C. Wilkerson, Hongliang Gao, A.R. Alameldeen, Z. Chishti, more

2008 International Symposium on Computer Architecture > 203 - 214

35th International Symposium on Computer Architecture

One of the most effective techniques to reduce a processor's power consumption is to reduce supply voltage. However, reducing voltage in the context of manufacturing-induced parameter variations can cause many types of memory circuits to fail. As a result, voltage scaling is limited by a minimum voltage, often called Vccmin, beyond which circuits may not operate reliably. Large memory structures (e...

chapter

TokenTM: Efficient Execution of Large Transactions with Hardware Transactional Memory

J. Bobba, N. Goyal, M.D. Hill, M.M. Swift, more

2008 International Symposium on Computer Architecture > 127 - 138

35th International Symposium on Computer Architecture

Current hardware transactional memory systems seek to simplify parallel programming, but assume that large transactions are rare, so it is acceptable to penalize their performance or concurrency. However, future programmers may wish to use large transactions more often in order to integrate with higher-level programming models (e.g., database transactions) or perform selected I/O operations. To prevent...

chapter

Title Page iii

2008 International Symposium on Computer Architecture > iii

35th International Symposium on Computer Architecture

chapter

Message from the Program Chair

2008 International Symposium on Computer Architecture > xi

35th International Symposium on Computer Architecture

chapter

Achieving Out-of-Order Performance with Almost In-Order Complexity

F. Tseng, Y.N. Patt

2008 International Symposium on Computer Architecture > 3 - 12

35th International Symposium on Computer Architecture

There is still much performance to be gained by out-of-order processors with wider issue widths. However, traditional methods of increasing issue width do not scale; that is, they drastically increase design complexity and power requirements. This paper introduces the braid, a compile-time identified entity that enables the execution core to scale to wider widths by exploiting the small fanout and...

chapter

A Proactive Wearout Recovery Approach for Exploiting Microarchitectural Redundancy to Extend Cache SRAM Lifetime

Jeonghee Shin, V. Zyuban, P. Bose, T.M. Pinkston

2008 International Symposium on Computer Architecture > 353 - 362

35th International Symposium on Computer Architecture

Microarchitectural redundancy has been proposed as a means of improving chip lifetime reliability. It is typically used in a reactive way, allowing chips to maintain operability in the presence of failures by detecting and isolating, correcting, and/or replacing components on a first-come, first-served basis only after they become faulty. In this paper, we explore an alternative, more preferred method...

chapter

Publisher's Information

2008 International Symposium on Computer Architecture > 468

35th International Symposium on Computer Architecture

chapter

Counting Dependence Predictors

F. Roesner, D. Burger, S.W. Keckler

2008 International Symposium on Computer Architecture > 215 - 226

35th International Symposium on Computer Architecture

Modern processors rely on memory dependence prediction to execute load instructions as early as possible, speculating that they are not dependent on an earlier, unissued store. To date, the most sophisticated dependence predictors, such as Store Sets, have been tightly coupled to the fetch and execution streams, requiring global knowledge of the in-flight stream of stores to synchronize loads with...

chapter

Flexible Decoupled Transactional Memory Support

A. Shriraman, S. Dwarkadas, M.L. Scott

2008 International Symposium on Computer Architecture > 139 - 150

35th International Symposium on Computer Architecture

A high-concurrency transactional memory (TM) implementation needs to track concurrent accesses, buffer speculative updates, and manage conflicts. We present a system, FlexTM (FLEXible Transactional Memory), that coordinates four decoupled hardware mechanisms: read and write signatures, which summarize per-thread access sets; per-thread conflict summary tables (CSTs), which identify the threads with...

chapter

Author Index

2008 International Symposium on Computer Architecture > 465 - 466

35th International Symposium on Computer Architecture

chapter

Learning and Leveraging the Relationship between Architecture-Level Measurements and Individual User Satisfaction

A. Shye, B. Ozisikyilmaz, A. Mallik, G. Memik, more

2008 International Symposium on Computer Architecture > 427 - 438

35th International Symposium on Computer Architecture

The ultimate goal of computer design is to satisfy the end-user. In particular computing domains, such as interactive applications, there exists a variation in user expectations and user satisfaction relative to the performance of existing computer systems. In this work, we leverage this variation to develop more efficient architectures that are customized to end-users. We first investigate the relationship...

Publication date

Set your own date range

Content availability

Available (49)
None (1)

Keywords

HARDWARE (17)
COMPUTER ARCHITECTURE (14)
SOFTWARE (10)
MAGNETIC CORES (8)
PROGRAM PROCESSORS (8)
RANDOM ACCESS MEMORY (8)
REGISTERS (8)
DELAY (7)
MICROARCHITECTURE (6)
BANDWIDTH (5)
MICROPROCESSOR CHIPS (5)
RELIABILITY (5)
SWITCHES (5)
CACHE STORAGE (4)
CHIP MULTIPROCESSORS (4)
DRAM CHIPS (4)
LOGIC DESIGN (4)
NETWORK-ON-CHIP (4)
PARALLEL PROCESSING (4)
POWER DEMAND (4)
PROPOSALS (4)
PROTOCOLS (4)
SERVERS (4)
SHARED MEMORY SYSTEMS (4)
STORAGE MANAGEMENT (4)
SYSTEM-ON-A-CHIP (4)
THROUGHPUT (4)
TRANSACTIONAL MEMORY (4)
ARRAYS (3)
BENCHMARK TESTING (3)
CLOCKS (3)
COHERENCE (3)
MEMORY MANAGEMENT (3)
MEMORY SYSTEMS (3)
MULTI-THREADING (3)
MULTIPROCESSING SYSTEMS (3)
MULTIPROCESSOR INTERCONNECTION NETWORKS (3)
MULTIPROCESSORS (3)
NETWORK ROUTING (3)
PARALLEL PROGRAMMING (3)
POWER AWARE COMPUTING (3)
PROCESSOR SCHEDULING (3)
QUALITY OF SERVICE (3)
RADIATION DETECTORS (3)
RESOURCE MANAGEMENT (3)
SPACE EXPLORATION (3)
TIMING (3)
TRANSACTION PROCESSING (3)
ACCELERATION (2)
CACHE (2)
CHIP MULTIPROCESSOR (2)
COMPLEXITY THEORY (2)
COMPUTER BUGS (2)
FAULT TOLERANCE (2)
FAULT TOLERANT SYSTEMS (2)
INTEGRATED CIRCUIT DESIGN (2)
INTEGRATED CIRCUIT INTERCONNECTIONS (2)
INTEGRATED CIRCUIT MODELING (2)
INTERCONNECTS (2)
LOGIC GATES (2)
MEMORY ACCESSES (2)
MEMORY ARCHITECTURE (2)
MICROCONTROLLERS (2)
MICROPROCESSOR (2)
MULTICORE PROCESSORS (2)
NETWORK TOPOLOGY (2)
NETWORK-ON-CHIP ARCHITECTURE (2)
ON-CHIP NETWORK (2)
OPTIMIZATION (2)
PARALLEL ARCHITECTURES (2)
PARTICLE TRAPS (2)
PERFORMANCE EVALUATION (2)
PERMISSION (2)
PROCESS CONTROL (2)
PROGRAM COMPILERS (2)
PROGRAM DEBUGGING (2)
QUANTUM (2)
QUANTUM COMPUTING (2)
RESOURCE ALLOCATION (2)
ROUTING (2)
SCHEDULES (2)
SECURITY OF DATA (2)
SRAM (2)
SRAM CHIPS (2)
THREAD-LEVEL PARALLELISM (2)
THREE DIMENSIONAL DISPLAYS (2)
TOPOLOGY (2)
TRANSISTORS (2)
2D NOC DESIGN (1)
3D (1)
3D INTEGRATION (1)
3D MANY-CORE ARCHITECTURE (1)
3D STACKING (1)
3D-STACKED MEMORY ARCHITECTURES (1)
ABORT HANDLER (1)
ACCESS SCHEDULING POLICIES (1)
ADAPTIVE INTER-ROUTER DUAL-FUNCTION ENERGY LINK (1)
ADAPTIVE ROUTING DECISION (1)
AGGRESSIVE PROCESS SCALING (1)
ALMOST IN-ORDER COMPLEXITY (1)
more

INFONA - science communication portal

35th International Symposium on Computer Architecture

3D-Stacked Memory Architectures for Multi-core Processors

VEAL: Virtualized Execution Accelerator for Loops

Flexible Hardware Acceleration for Instruction-Grain Program Monitoring

Microcoded Architectures for Ion-Tap Quantum Computers

ReVIVaL: A Variation-Tolerant Architecture Using Voltage Interpolation and Variable Latency

Running a Quantum Circuit at the Speed of Data

Corona: System Implications of Emerging Nanophotonic Technology

Using Hardware Memory Protection to Build a High-Performance, Strongly-Atomic Hybrid Transactional Memory

Polymorphic On-Chip Networks

Trading off Cache Capacity for Reliability to Enable Low Voltage Operation

TokenTM: Efficient Execution of Large Transactions with Hardware Transactional Memory

Title Page iii

Message from the Program Chair

Achieving Out-of-Order Performance with Almost In-Order Complexity

A Proactive Wearout Recovery Approach for Exploiting Microarchitectural Redundancy to Extend Cache SRAM Lifetime

Publisher's Information

Counting Dependence Predictors

Flexible Decoupled Transactional Memory Support

Author Index

Learning and Leveraging the Relationship between Architecture-Level Measurements and Individual User Satisfaction

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

35th International Symposium on Computer Architecture $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

35th International Symposium on Computer Architecture