Search results for: Bo Wu

Items from 1 to 5 out of 5 results

article

Optimizing Data Placement on GPU Memory: A Portable Approach

Guoyang Chen, Xipeng Shen, Bo Wu, Dong Li

IEEE Transactions on Computers > 2017 > 66 > 3 > 473 - 487

Modern GPUs feature complex memory system designs. One GPU may contain many types of memory of different properties. The best way to place data in memory is sensitive to many factors (e.g., program inputs, architectures), making portable optimizations of GPU data placement a difficult challenge. PORPLE is a recently proposed method that overcomes the difficulties by enabling online optimizations of...

article

Enabling Portable Optimizations of Data Placement on GPU

Guoyang Chen, Bo Wu, Dong Li, Xipeng Shen

IEEE Micro > 2015 > 35 > 4 > 16 - 24

Modern GPU memory systems manifest more varieties, increasing complexities, and rapid changes. Different placements of data on memory systems often cause significant differences in program performance. Most current GPU programming systems rely on programmers to indicate the appropriate placements, but finding the appropriate placements is difficult for programmers in practice owing to the complexity...

chapter

PORPLE: An Extensible Optimizer for Portable Data Placement on GPU

Guoyang Chen, Bo Wu, Dong Li, Xipeng Shen

2014 47th Annual IEEE/ACM International Symposium on Microarchitecture > 88 - 100

2014 47th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO)

GPU is often equipped with complex memory systems, including globalmemory, texture memory, shared memory, constant memory, and variouslevels of cache. Where to place the data is important for theperformance of a GPU program. However, the decision is difficult for aprogrammer to make because of architecture complexity and thesensitivity of suitable data placements to input and architecturechanges.This...

chapter

SM-centric transformation: Circumventing hardware restrictions for flexible GPU scheduling

Bo Wu, Guoyang Chen, Dong Li, Xipeng Shen, more

2014 23rd International Conference on Parallel Architecture and Compilation (PACT) > 497 - 498

2014 23rd International Conference on Parallel Architecture and Compilation (PACT)

To circumvent the limitation from the hardware scheduler on GPU, we create an SM-centric transformation technique. This technique enables complete control of the mapping between tasks and streaming multi-processors (SMs), and enables controlling the number of active thread blocks on each SM. Results show that our approach achieves better speedup than previous ones with kernel co-run cases.

chapter

Enhancing Data Locality for Dynamic Simulations through Asynchronous Data Transformations and Adaptive Control

Bo Wu, Eddy Z. Zhang, Xipeng Shen

2011 International Conference on Parallel Architectures and Compilation Techniques > 243 - 252

2011 International Conference on Parallel Architectures and Compilation Techniques (PACT)

Many dynamic simulation programs contain complex, irregular memory reference patterns, and require runtime optimizations to enhance data locality. Current approaches periodically stop the execution of an application to reorder the computation or data based on the current program state to improve the data locality for the next period of execution. In this work, we examine the implications that modern...

Filter options

Keywords:
RUNTIME

Publication date

Set your own date range

INFONA - science communication portal

Search results for: Bo Wu

Optimizing Data Placement on GPU Memory: A Portable Approach

Enabling Portable Optimizations of Data Placement on GPU

PORPLE: An Extensible Optimizer for Portable Data Placement on GPU

SM-centric transformation: Circumventing hardware restrictions for flexible GPU scheduling

Enhancing Data Locality for Dynamic Simulations through Asynchronous Data Transformations and Adaptive Control

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Journal

Reporting an error / abuse

Sending the report failed

Accessibility options