Search results for: Junli Gu

Items from 1 to 6 out of 6 results

chapter

Heterogeneous system coherence for integrated CPU-GPU systems

Jason Power, Arkaprava Basu, Junli Gu, Sooraj Puthoor, more

2013 46th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO) > 457 - 467

2013 46th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO)

Many future heterogeneous systems will integrate CPUs and GPUs physically on a single chip and logically connect them via shared memory to avoid explicit data copying. Making this shared memory coherent facilitates programming and fine-grained sharing, but throughput-oriented GPUs can overwhelm CPUs with coherence requests not well-filtered by caches. Meanwhile, region coherence has been proposed...

chapter

PPEP: Online Performance, Power, and Energy Prediction Framework and DVFS Space Exploration

Bo Su, Junli Gu, Li Shen, Wei Huang, more

2014 47th Annual IEEE/ACM International Symposium on Microarchitecture > 445 - 457

2014 47th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO)

Performance, power, and energy (PPE) are critical aspects of modern computing. It is challenging to accurately predict, in real time, the effect of dynamic voltage and frequency scaling (DVFS) on PPE across a wide range of voltages and frequencies. This results in the use of reactive, iterative, and inefficient algorithms for dynamically finding good DVFS states. We propose PPEP, an online PPE prediction...

chapter

iCHAT: Inter-cache Hardware-Assistant Data Transfer for Heterogeneous Chip Multiprocessors

Junli Gu, Bradford M. Beckmann, Ting Cao, Yu Hu

2014 9th IEEE International Conference on Networking, Architecture, and Storage > 242 - 251

2014 9th IEEE International Conference on Networking, Architecture, and Storage (NAS)

Modern heterogeneous multiprocessors integrate CPU and GPU together to provide a boost to computational performance. Data sharing and communication between CPU and GPU has been a critical issue for the final speedup. With tighter integration of CPU and GPU, it has the advantage of sharing and moving data more efficiently in order to leverage the computational power that a GPU can provide. Initially,...

article

Optimizing a Parallel Video Encoder with Message Passing and a Shared Memory Architecture

Junli Gu, Yihe Sun

Tsinghua Science & Technology > 2011 > 16 > 4 > 393-398

Implementing video applications on emerging multi-core processors is a promising technique for personal, real-time multi-media applications. However, when porting the legacy parallel video encoders developed for clusters to shared-memory multi-cores, the existing parallel algorithms result in workload imbalances on different cores and communication inefficiencies. This paper describes a strip-wise...

chapter

MOPED: Orchestrating interprocess message data on CMPs

Junli Gu, S S Lumetta, R Kumar, Yihe Sun

2011 IEEE 17th International Symposium on High Performance Computer Architecture > 111 - 120

2011 IEEE 17th International Symposium on High Performance Computer Architecture (HPCA)

Future CMPs will combine many simple cores with deep cache hierarchies. With more cores, cache resources per core are fewer, and must be shared carefully to avoid poor utilization due to conflicts and pollution. Explicit motion of data in these architectures, such as message passing, can provide hints about program behavior that can be used to hide latency and improve cache behavior. However, to make...

article

MOPED: Accelerating Data Communication on Future CMPs

Junli Gu, Yihe Sun, Steven S. Lumetta, Rakeshh Kumar

IEEE Micro > 2011 > 31 > 4 > 42 - 50

The Message Orchestration and Performance Enhancement Device (MOPED) provides an explicit hardware communication mechanism that offloads synchronization and data communication from CPUs to enable overlap between computation and communication, while also transferring data efficiently. The device achieves significant improvement in performance of real applications and reduction of on-chip cache misses,...

Filter options

Publication date

Set your own date range

INFONA - science communication portal

Search results for: Junli Gu

Heterogeneous system coherence for integrated CPU-GPU systems

PPEP: Online Performance, Power, and Energy Prediction Framework and DVFS Space Exploration

iCHAT: Inter-cache Hardware-Assistant Data Transfer for Heterogeneous Chip Multiprocessors

Optimizing a Parallel Video Encoder with Message Passing and a Shared Memory Architecture

MOPED: Orchestrating interprocess message data on CMPs

MOPED: Accelerating Data Communication on Future CMPs

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Data set

Journal

Reporting an error / abuse

Sending the report failed

Accessibility options