Search results for: Gunjae Koo

Items from 1 to 7 out of 7 results

article

Improving Energy Efficiency of GPUs through Data Compression and Compressed Execution

Sangpil Lee, Keunsoo Kim, Gunjae Koo, Hyeran Jeon, more

IEEE Transactions on Computers > 2017 > 66 > 5 > 834 - 847

GPU design trends show that the register file size will continue to increase to enable even more thread level parallelism. As a result register file consumes a large fraction of the total GPU chip power. This paper explores register file data compression for GPUs to improve power efficiency. Compression reduces the width of the register file read and write operations, which in turn reduces dynamic...

chapter

Access pattern-aware cache management for improving data utilization in GPU

Gunjae Koo, Yunho Oh, Won Woo Ro, Murali Annavaram

2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA) > 307 - 319

2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA)

Long latency of memory operation is a prominent performance bottleneck in graphics processing units (GPUs). The small data cache that must be shared across dozens of warps (a collection of threads) creates significant cache contention and premature data eviction. Prior works have recognized this problem and proposed warp throttling which reduces the number of active warps contending for cache space...

chapter

Warped-preexecution: A GPU pre-execution approach for improving latency hiding

Sangpil Lee, Won Woo Ro, Keunsoo Kim, Gunjae Koo, more

2016 IEEE International Symposium on High Performance Computer Architecture (HPCA) > 163 - 175

2016 IEEE International Symposium on High Performance Computer Architecture (HPCA)

This paper presents a pre-execution approach for improving GPU performance, called P-mode (pre-execution mode). GPUs utilize a number of concurrent threads for hiding processing delay of operations. However, certain long-latency operations such as off-chip memory accesses often take hundreds of cycles and hence leads to stalls even in the presence of thread concurrency and fast thread switching capability...

chapter

Revealing Critical Loads and Hidden Data Locality in GPGPU Applications

Gunjae Koo, Hyeran Jeon, Murali Annavaram

2015 IEEE International Symposium on Workload Characterization > 120 - 129

2015 IEEE International Symposium on Workload Characterization (IISWC)

In graphics processing units (GPUs), memory access latency is one of the most critical performance hurdles. Several warp schedulers and memory prefetching algorithms have been proposed to avoid the long memory access latency. Prior application characterization studies shed light on the interaction between applications, GPU micro architecture and memory subsystem behavior. Most of these studies, however,...

chapter

Warped-Compression: Enabling power efficient GPUs through register compression

Sangpil Lee, Keunsoo Kim, Gunjae Koo, Hyeran Jeon, more

2015 ACM/IEEE 42nd Annual International Symposium on Computer Architecture (ISCA) > 502 - 514

2015 ACM/IEEE 42nd Annual International Symposium on Computer Architecture (ISCA)

This paper presents Warped-Compression, a warp-level register compression scheme for reducing GPU power consumption. This work is motivated by the observation that the register values of threads within the same warp are similar, namely the arithmetic differences between two successive thread registers is small. Removing data redundancy of register values through register compression reduces the effective...

chapter

Complementary block-based motion estimation for frame rate up-conversion

Gunjae Koo, Kyoung Won Lim, Seung Jong Choi

2011 IEEE International Conference on Consumer Electronics (ICCE) > 523 - 524

2011 IEEE International Conference on Consumer Electronics (ICCE)

In this paper, we present complementary motion estimation algorithm for motion compensated frame rate up-conversion. The proposed algorithm combines forward and backward motion estimation results to make up for the weakness of each motion estimation method. It also allocates true motion vectors in occlusion regions by using the temporal relations of the forward and backward motion estimation. Thus,...

chapter

A robust PRML read channel with digital timing recovery for multi-format optical disc

Gunjae Koo, Woochul Jung, Heesub Lee

2006 IEEE International Symposium on Circuits and Systems > 4 pp.

2006 IEEE International Symposium on Circuits and Systems

In this paper, a PRML read channel that supports multiple optical disc formats, i.e. CD, DVD and BD is presented. The read channel includes digital timing recovery that generates timing matched data by interpolation, which can acquire high controllability and stability with small hardware. PRML bit detection is applied to the read channel in order to reduce bit errors for severe channel condition...

Filter options

Publication date

Set your own date range

Publication type

book (6)
article (1)

Keywords

INSTRUCTION SETS (3)
GRAPHICS PROCESSING UNITS (2)
HARDWARE (2)
REGISTERS (2)
115 MHZ (1)
210 MHZ (1)
BACKWARD MOTION ESTIMATION (1)
BENCHMARK TESTING (1)
BIT ERRORS (1)
CACHE MANAGEMENT (1)
CLOCKS (1)
COMPUTER ARCHITECTURE (1)
CONFERENCES (1)
CONTEXT (1)
DATA COMPRESSION (1)
DIGITAL TIMING RECOVERY (1)
ENERGY-EFFICIENCY (1)
EQUATIONS (1)
FILTERING (1)
FORWARD MOTION ESTIMATION (1)
FRAME RATE UP-CONVERSION (1)
GPGPU (1)
GPU ARCHITECTURES (1)
IMAGE SEQUENCES (1)
INDEXES (1)
INTERPOLATION (1)
KERNEL (1)
MAXIMUM LIKELIHOOD DETECTION (1)
MEMORY ACCESS PATTERNS (1)
MESSAGE SYSTEMS (1)
MONITORING (1)
MOTION COMPENSATION (1)
MOTION ESTIMATION (1)
MULTIFORMAT OPTICAL DISC (1)
OPTICAL DISC STORAGE (1)
OPTICAL MEMORIES (1)
PARTIAL RESPONSE MAXIMUM LIKELIHOOD BIT DETECTOR (1)
PRML BIT DETECTION (1)
PRML READ CHANNEL (1)
REGISTER FILE (1)
RESOURCE MANAGEMENT (1)
SEARCH PROBLEMS (1)
SIGNAL LEVEL SHIFT (1)
SYNCHRONISATION (1)
SYNCHRONIZATION (1)
TIMING MATCHED DATA (1)
more

INFONA - science communication portal

Search results for: Gunjae Koo

Improving Energy Efficiency of GPUs through Data Compression and Compressed Execution

Access pattern-aware cache management for improving data utilization in GPU

Warped-preexecution: A GPU pre-execution approach for improving latency hiding

Revealing Critical Loads and Hidden Data Locality in GPGPU Applications

Warped-Compression: Enabling power efficient GPUs through register compression

Complementary block-based motion estimation for frame rate up-conversion

A robust PRML read channel with digital timing recovery for multi-format optical disc

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options