Andreas Sandberg

chapter

Efficient techniques for predicting cache sharing and throughput

Andreas Sandberg, David Black-Schaffer, Erik Hagersten

2012 21st International Conference on Parallel Architectures and Compilation Techniques (PACT) > 305 - 314

2012 21st International Conference on Parallel Architectures and Compilation Techniques (PACT)

This work addresses the modeling of shared cache contention in multicore systems and its impact on throughput and bandwidth. We develop two simple and fast cache sharing models for accurately predicting shared cache allocations for random and LRU caches.

chapter

CoolSim: Statistical techniques to replace cache warming with efficient, virtualized profiling

Nikos Nikoleris, Andreas Sandberg, Erik Hagersten, Trevor E. Carlson

2016 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation (SAMOS) > 106 - 115

2016 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation (SAMOS)

Simulation is an important part of the evaluation of next-generation computing systems. Detailed, cycle-accurate simulation, however, can be very slow when evaluating realistic workloads on modern microarchitectures. Sampled simulation (e.g., SMARTS and SimPoint) improves simulation performance by an order of magnitude or more through the reduction of large workloads into a small but representative...

chapter

NoMali: Simulating a realistic graphics driver stack using a stub GPU

Rene de Jong, Andreas Sandberg

2016 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS) > 255 - 262

2016 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)

Since the advent of the smartphone, all high-end mobile devices have required graphics acceleration in the form of a GPU. Today, even low-power devices such as smartwatches use GPUs for rendering and composition. However, the computer architecture community has largely ignored these developments when evaluating new architecture proposals.

chapter

CoolSim: Eliminating traditional cache warming with fast, virtualized profiling

Nikos Nikoleris, Andreas Sandberg, Erik Hagersten, Trevor E. Carlson

2016 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS) > 149 - 150

2016 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)

Sampling (e.g., SMARTS and SimPoint) improves simulation performance by an order of magnitude or more through the reduction of large workloads into a small but representative sample. Virtualized fast-forwarding (e.g., FSA) speeds up simulation further by advancing execution at near-native speed between simulation points, making cache warming the critical limiting factor for simulation performance...

chapter

Full Speed Ahead: Detailed Architectural Simulation at Near-Native Speed

Andreas Sandberg, Nikos Nikoleris, Trevor E. Carlson, Erik Hagersten, more

2015 IEEE International Symposium on Workload Characterization > 183 - 192

2015 IEEE International Symposium on Workload Characterization (IISWC)

Cycle-level micro architectural simulation is the de-facto standard to estimate performance of next-generation platforms. Unfortunately, the level of detail needed for accurate simulation requires complex, and therefore slow, simulation models that run at speeds that are thousands of times slower than native execution. With the introduction of sampled simulation, it has become possible to simulate...

chapter

A Case for Resource Efficient Prefetching in Multicores

Muneeb Khan, Andreas Sandberg, Erik Hagersten

2014 43rd International Conference on Parallel Processing > 101 - 110

2014 43nd International Conference on Parallel Processing (ICPP)

Modern processors typically employ sophisticated prefetching techniques for hiding memory latency. Hardware prefetching has proven very effective and can speed up some SPEC CPU 2006 benchmarks by more than 40% when running in isolation. However, this speedup often comes at the cost of prefetching a significant volume of useless data (sometimes more than twice the data required) which wastes shared...

chapter

A case for resource efficient prefetching in multicores

Muneeb Khan, Andreas Sandberg, Erik Hagersten

2014 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS) > 137 - 138

2014 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)

Hardware prefetching has proven very effective for hiding memory latency and can speed up some applications by more than 40%. However, this speedup comes at the cost of often prefetching a significant volume of useless data which wastes shared last level cache space and off-chip bandwidth. This directly impacts the performance of co-scheduled applications which compete for shared resources in multicores...

chapter

Modeling performance variation due to cache sharing

Andreas Sandberg, Andreas Sembrant, Erik Hagersten, David Black-Schaffer

2013 IEEE 19th International Symposium on High Performance Computer Architecture (HPCA) > 155 - 166

2013 IEEE 19th International Symposium on High Performance Computer Architecture (HPCA)

Shared cache contention can cause significant variability in the performance of co-running applications from run to run. This variability arises from different overlappings of the applications' phases, which can be the result of offsets in application start times or other delays in the system. Understanding this variability is important for generating an accurate view of the expected impact of cache...

INFONA - science communication portal

Search results for: Andreas Sandberg

Efficient techniques for predicting cache sharing and throughput

CoolSim: Statistical techniques to replace cache warming with efficient, virtualized profiling

NoMali: Simulating a realistic graphics driver stack using a stub GPU

CoolSim: Eliminating traditional cache warming with fast, virtualized profiling

Full Speed Ahead: Detailed Architectural Simulation at Near-Native Speed

A Case for Resource Efficient Prefetching in Multicores

A case for resource efficient prefetching in multicores

Modeling performance variation due to cache sharing

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results for: Andreas Sandberg

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options