Eiman Ebrahimi

chapter

Parallel application memory scheduling

Eiman Ebrahimi, Rustam Miftakhutdinov, Chris Fallin, Chang Joo Lee, more

2011 44th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO) > 362 - 373

2011 44th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO)

A primary use of chip-multiprocessor (CMP) systems is to speed up a single application by exploiting thread-level parallelism. In such systems, threads may slow each other down by issuing memory requests that interfere in the shared memory subsystem. This inter-thread memory system interference can significantly degrade parallel application performance. Better memory request scheduling may mitigate...

chapter

MCM-GPU: Multi-chip-module GPUs for continued performance scalability

Akhil Arunkumar, Evgeny Bolotin, Benjamin Cho, Ugljesa Milic, more

2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA) > 320 - 332

2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA)

Historically, improvements in GPU-based high performance computing have been tightly coupled to transistor scaling. As Moore's law slows down, and the number of transistors per die no longer grows at historical rates, the performance curve of single monolithic GPUs will ultimately plateau. However, the need for higher performing GPUs continues to exist in many domains. To address this need, in this...

chapter

Accelerating Dependent Cache Misses with an Enhanced Memory Controller

Milad Hashemi, Khubaib, Eiman Ebrahimi, Onur Mutlu, more

2016 ACM/IEEE 43rd Annual International Symposium on Computer Architecture (ISCA) > 444 - 455

2016 ACM/IEEE 43rd Annual International Symposium on Computer Architecture (ISCA)

On-chip contention increases memory access latency for multi-core processors. We identify that this additional latency has a substantial effect on performance for an important class of latency-critical memory operations: those that result in a cache miss and are dependent on data from a prior cache miss. We observe that the number of instructions between the first cache miss and its dependent cache...

chapter

Selective GPU caches to eliminate CPU-GPU HW cache coherence

Neha Agarwal, David Nellans, Eiman Ebrahimi, Thomas F. Wenisch, more

2016 IEEE International Symposium on High Performance Computer Architecture (HPCA) > 494 - 506

2016 IEEE International Symposium on High Performance Computer Architecture (HPCA)

Cache coherence is ubiquitous in shared memory multiprocessors because it provides a simple, high performance memory abstraction to programmers. Recent work suggests extending hardware cache coherence between CPUs and GPUs to help support programming models with tightly coordinated sharing between CPU and GPU threads. However, implementing hardware cache coherence is particularly challenging in systems...

chapter

Flexible software profiling of GPU architectures

Mark Stephenson, Siva Kumar Sastry Hari, Yunsup Lee, Eiman Ebrahimi, more

2015 ACM/IEEE 42nd Annual International Symposium on Computer Architecture (ISCA) > 185 - 197

2015 ACM/IEEE 42nd Annual International Symposium on Computer Architecture (ISCA)

To aid application characterization and architecture design space exploration, researchers and engineers have developed a wide range of tools for CPUs, including simulators, profilers, and binary instrumentation tools. With the advent of GPU computing, GPU manufacturers have developed similar tools leveraging hardware profiling and debugging hooks. To date, these tools are largely limited by the fixed...

chapter

Predicting Performance Impact of DVFS for Realistic Memory Systems

Rustam Miftakhutdinov, Eiman Ebrahimi, Yale N. Patt

2012 45th Annual IEEE/ACM International Symposium on Microarchitecture > 155 - 165

2012 45th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO)

Dynamic voltage and frequency scaling (DVFS) can make modern processors more power and energy efficient if we can accurately predict the effect of frequency scaling on processor performance. State-of-the-art DVFS performance predictors, however, fail to accurately predict performance when confronted with realistic memory systems. We propose CRIT+BW, the first DVFS performance predictor designed for...

chapter

Energy Savings via Dead Sub-Block Prediction

Marco A.Z. Alves, Khubaib, Eiman Ebrahimi, Veynu T. Narasiman, more

2012 IEEE 24th International Symposium on Computer Architecture and High Performance Computing > 51 - 58

2012 24th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD)

Cache memories have traditionally been designed to exploit spatial locality by fetching entire cache lines from memory upon a miss. However, recent studies have shown that often the number of sub-blocks within a line that are actually used is low. Furthermore, those sub-blocks that are used are accessed only a few times before becoming dead (i.e., never accessed again). This results in considerable...

chapter

Prefetch-aware shared-resource management for multi-core systems

Eiman Ebrahimi, Chang Joo Lee, Onur Mutlu, Yale N. Patt

2011 38th Annual International Symposium on Computer Architecture (ISCA) > 141 - 152

2011 ACM/IEEE 38th International Symposium on Computer Architecture (ISCA)

Chip multiprocessors (CMPs) share a large portion of the memory subsystem among multiple cores. Recent proposals have addressed high-performance and fair management of these shared resources; however, none of them take into account prefetch requests. Without prefetching, significant performance is lost, which is why existing systems prefetch. By not taking into account prefetch requests, recent shared-resource...

INFONA - science communication portal

Search results for: Eiman Ebrahimi

Parallel application memory scheduling

MCM-GPU: Multi-chip-module GPUs for continued performance scalability

Accelerating Dependent Cache Misses with an Enhanced Memory Controller

Selective GPU caches to eliminate CPU-GPU HW cache coherence

Flexible software profiling of GPU architectures

Predicting Performance Impact of DVFS for Realistic Memory Systems

Energy Savings via Dead Sub-Block Prediction

Prefetch-aware shared-resource management for multi-core systems

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results for: Eiman Ebrahimi

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options