Eiman Ebrahimi

rozdział

Parallel application memory scheduling

Eiman Ebrahimi, Rustam Miftakhutdinov, Chris Fallin, Chang Joo Lee, więcej

2011 44th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO) > 362 - 373

2011 44th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO)

A primary use of chip-multiprocessor (CMP) systems is to speed up a single application by exploiting thread-level parallelism. In such systems, threads may slow each other down by issuing memory requests that interfere in the shared memory subsystem. This inter-thread memory system interference can significantly degrade parallel application performance. Better memory request scheduling may mitigate...

rozdział

MCM-GPU: Multi-chip-module GPUs for continued performance scalability

Akhil Arunkumar, Evgeny Bolotin, Benjamin Cho, Ugljesa Milic, więcej

2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA) > 320 - 332

2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA)

Historically, improvements in GPU-based high performance computing have been tightly coupled to transistor scaling. As Moore's law slows down, and the number of transistors per die no longer grows at historical rates, the performance curve of single monolithic GPUs will ultimately plateau. However, the need for higher performing GPUs continues to exist in many domains. To address this need, in this...

rozdział

Accelerating Dependent Cache Misses with an Enhanced Memory Controller

Milad Hashemi, Khubaib, Eiman Ebrahimi, Onur Mutlu, więcej

2016 ACM/IEEE 43rd Annual International Symposium on Computer Architecture (ISCA) > 444 - 455

2016 ACM/IEEE 43rd Annual International Symposium on Computer Architecture (ISCA)

On-chip contention increases memory access latency for multi-core processors. We identify that this additional latency has a substantial effect on performance for an important class of latency-critical memory operations: those that result in a cache miss and are dependent on data from a prior cache miss. We observe that the number of instructions between the first cache miss and its dependent cache...

rozdział

Selective GPU caches to eliminate CPU-GPU HW cache coherence

Neha Agarwal, David Nellans, Eiman Ebrahimi, Thomas F. Wenisch, więcej

2016 IEEE International Symposium on High Performance Computer Architecture (HPCA) > 494 - 506

2016 IEEE International Symposium on High Performance Computer Architecture (HPCA)

Cache coherence is ubiquitous in shared memory multiprocessors because it provides a simple, high performance memory abstraction to programmers. Recent work suggests extending hardware cache coherence between CPUs and GPUs to help support programming models with tightly coordinated sharing between CPU and GPU threads. However, implementing hardware cache coherence is particularly challenging in systems...

rozdział

Flexible software profiling of GPU architectures

Mark Stephenson, Siva Kumar Sastry Hari, Yunsup Lee, Eiman Ebrahimi, więcej

2015 ACM/IEEE 42nd Annual International Symposium on Computer Architecture (ISCA) > 185 - 197

2015 ACM/IEEE 42nd Annual International Symposium on Computer Architecture (ISCA)

To aid application characterization and architecture design space exploration, researchers and engineers have developed a wide range of tools for CPUs, including simulators, profilers, and binary instrumentation tools. With the advent of GPU computing, GPU manufacturers have developed similar tools leveraging hardware profiling and debugging hooks. To date, these tools are largely limited by the fixed...

rozdział

Predicting Performance Impact of DVFS for Realistic Memory Systems

Rustam Miftakhutdinov, Eiman Ebrahimi, Yale N. Patt

2012 45th Annual IEEE/ACM International Symposium on Microarchitecture > 155 - 165

2012 45th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO)

Dynamic voltage and frequency scaling (DVFS) can make modern processors more power and energy efficient if we can accurately predict the effect of frequency scaling on processor performance. State-of-the-art DVFS performance predictors, however, fail to accurately predict performance when confronted with realistic memory systems. We propose CRIT+BW, the first DVFS performance predictor designed for...

rozdział

Energy Savings via Dead Sub-Block Prediction

Marco A.Z. Alves, Khubaib, Eiman Ebrahimi, Veynu T. Narasiman, więcej

2012 IEEE 24th International Symposium on Computer Architecture and High Performance Computing > 51 - 58

2012 24th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD)

Cache memories have traditionally been designed to exploit spatial locality by fetching entire cache lines from memory upon a miss. However, recent studies have shown that often the number of sub-blocks within a line that are actually used is low. Furthermore, those sub-blocks that are used are accessed only a few times before becoming dead (i.e., never accessed again). This results in considerable...

rozdział

Prefetch-aware shared-resource management for multi-core systems

Eiman Ebrahimi, Chang Joo Lee, Onur Mutlu, Yale N. Patt

2011 38th Annual International Symposium on Computer Architecture (ISCA) > 141 - 152

2011 ACM/IEEE 38th International Symposium on Computer Architecture (ISCA)

Chip multiprocessors (CMPs) share a large portion of the memory subsystem among multiple cores. Recent proposals have addressed high-performance and fair management of these shared resources; however, none of them take into account prefetch requests. Without prefetching, significant performance is lost, which is why existing systems prefetch. By not taking into account prefetch requests, recent shared-resource...

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania dla: Eiman Ebrahimi

Parallel application memory scheduling

MCM-GPU: Multi-chip-module GPUs for continued performance scalability

Accelerating Dependent Cache Misses with an Enhanced Memory Controller

Selective GPU caches to eliminate CPU-GPU HW cache coherence

Flexible software profiling of GPU architectures

Predicting Performance Impact of DVFS for Realistic Memory Systems

Energy Savings via Dead Sub-Block Prediction

Prefetch-aware shared-resource management for multi-core systems

Opcje filtrowania

Data publikacji

Słowa kluczowe

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania dla: Eiman Ebrahimi

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Słowa kluczowe

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu