Soojung Ryu

chapter

Accelerating vector graphics on low-end device

Jeong-Joon Yoo, Sundeep Krishnadasan, Youngsam Shin, Won-Jong Lee, more

2017 IEEE International Conference on Consumer Electronics (ICCE) > 180 - 181

2017 IEEE International Conference on Consumer Electronics (ICCE)

In this paper, we present an efficient vector graphics rendering algorithm which is suitable to use on low-end device. To enjoy high performance vector graphics on low-end device, our algorithm must satisfy two folds; i) providing parallel rendering scheme, ii) removing redundant computations. To do so, we propose BSP Tree-based vector graphics rendering which provides a good solution in such situation...

chapter

Fast stereoscopic rendering on mobile ray tracing GPU for virtual reality applications

Won-Jong Lee, Seok Joong Hwang, Youngsam Shin, Jeong-Joon Yoo, more

2017 IEEE International Conference on Consumer Electronics (ICCE) > 355 - 357

2017 IEEE International Conference on Consumer Electronics (ICCE)

In this paper, we present a stereoscopic rendering based on a mobile ray tracing GPU. Adopting an existing algorithm to new mobile GPUs specialized for ray tracing enables two high performance techniques such as reprojection and tile-based rendering. Experimental results show that our implementation can be a versatile solution for future virtual reality applications, as it achieves up to 1.64 times...

chapter

Towards an efficient data transfer on mobile device: A case study on ray-tracing

Youngsam Shin, Won-Jong Lee, Jeong-Joon Yoo, Soojung Ryu

2017 IEEE International Conference on Consumer Electronics (ICCE) > 358 - 359

2017 IEEE International Conference on Consumer Electronics (ICCE)

In this paper, we propose an efficient ray scheduling algorithm and non-block cache architecture to hiding main-memory access latency targeting real-time ray tracing on mobile device. We first analyze on the impact of memory latency by analyzing the memory access patterns for a ray tracing system and present an energy efficient data transmission method using a dedicated interface between the processor...

chapter

Dynamic clock synchronization scheme between voltage domains in multi-core architecture

Jaehyun Kim, Kiyoung Choi, Sangheon Lee, Soojung Ryu

2016 IFIP/IEEE International Conference on Very Large Scale Integration (VLSI-SoC) > 1 - 6

2016 IFIP/IEEE International Conference on Very Large Scale Integration (VLSI-SoC)

Using independent voltage (and frequency) domains for cores and caches allows us to achieve high energy efficiency since it enables operating the cores and caches at their own optimal voltages. However, it incurs a clock synchronization problem between the core and cache voltage domains. One of the conventional solutions is to add asynchronous FIFOs on the domain crossing boundary, but it degrades...

chapter

iPAWS: Instruction-issue pattern-based adaptive warp scheduling for GPGPUs

Minseok Lee, Gwangsun Kim, John Kim, Woong Seo, more

2016 IEEE International Symposium on High Performance Computer Architecture (HPCA) > 370 - 381

2016 IEEE International Symposium on High Performance Computer Architecture (HPCA)

Thread or warp scheduling in GPGPUs has been shown to have a significant impact on overall performance. Recently proposed warp schedulers have been based on a greedy warp scheduler where some warps are prioritized over other warps. However, a single warp scheduling policy does not necessarily provide good performance across all types of workloads; in particular, we show that greedy warp schedulers...

chapter

Selective multi-sample anti-aliasing for mobile vector graphics

Jaedon Lee, Jeong-Joon Yoo, Soojung Ryu, Jeongwook Kim

2016 IEEE International Conference on Consumer Electronics (ICCE) > 174 - 175

2016 IEEE International Conference on Consumer Electronics (ICCE)

Vector graphics is a key technology for drawing 2D graphics images on the mobile devices. As screen resolution increases and multi-touch interface is widely used in mobile devices, the efficient solution for vector graphics becomes more important. For future mobile environments, we should process vector graphics with high performance and low power. In this paper, we have proposed the efficient anti-aliasing...

chapter

PerfEPI: Parallel performance estimation with effective progress index

Youngsam Shin, Won-Jong Lee, Seok Joong Hwang, Soojung Ryu

2016 IEEE International Conference on Consumer Electronics (ICCE) > 29 - 30

2016 IEEE International Conference on Consumer Electronics (ICCE)

Multi-core system has merits in terms of energy efficiency and performance enhancement compared to the single core. However, design and development of a system using the multiple processors are very difficult, and in particular, verification of a system having concurrency may be difficult. This makes parallel system design hard, so developers must spend substantial amounts of time for design and debugging...

chapter

Tile binning and rendering for resolution independent graphics

Jeong-Joon Yoo, Jaedon Lee, Sundeep Krishnadasan, Wonjong Lee, more

2016 IEEE International Conference on Consumer Electronics (ICCE) > 71 - 72

2016 IEEE International Conference on Consumer Electronics (ICCE)

In this paper, we present an efficient resolution independent path rendering algorithm. To do so, we propose tile binning and rendering algorithm for resolution independent path rendering which is suitable to use on mobile device. Experimental comparisons show that our scheme reduces not only most of memory I/O overhead but also 50% of computation overhead. As the result, most of mobile phones can...

chapter

Path rendering using winding number generator

Jeong-Joon Yoo, Sundeep Krishnadasan, John Brothers, Seokyoon Jung, more

2015 IEEE International Conference on Consumer Electronics (ICCE) > 96 - 97

2015 IEEE International Conference on Consumer Electronics (ICCE)

In this paper, we propose a computing intensive path rendering scheme. Because legacy path rendering schemes are memory I/O bound they are not suitable to the high resolution display. To do so, we propose to use winding number generator which generates per pixel winding number in parallel manner. When we use the winding number generator, computing latency (cycles) for path rendering are reduced into...

chapter

Simulation-based memory dependence checker for CGRA-mapped code verification

Heejun Shim, Soojung Ryu

2014 IEEE International Symposium on Circuits and Systems (ISCAS) > 1235 - 1238

2014 IEEE International Symposium on Circuits and Systems (ISCAS)

In a coarse-grained reconfigurable array (CGRA) architecture, software pipelining is primarily used to improve performance by exploiting loop-level parallelism (LLP). In this technique, the loop-carried memory dependence in user code prevents high parallelism, and it is difficult to be detected. In this paper, we propose a simulation-based memory dependence checker, which is used in the verification...

chapter

Full-stream architecture for ray tracing with efficient data transmission

Youngsam Shin, Jaedon Lee, Won-Jong Lee, Soojung Ryu, more

2014 IEEE International Symposium on Circuits and Systems (ISCAS) > 2165 - 2168

2014 IEEE International Symposium on Circuits and Systems (ISCAS)

In this paper, we focus on the impact of a memory bandwidth limitation by analyzing the bandwidth consumption for a ray tracing system and present an energy efficient data transmission method using a dedicated interface between the processor and ray tracing hardware engine. To achieve real-time ray tracing, we propose a full-stream architecture through the use of this dedicated interface. For an evaluation...

chapter

Quantitative comparison of the power reduction techniques for samsung reconfigurable processor

Hoyoung Kim, Soojung Ryu, Abhishek Sinkar, Nam Sung Kim

2014 IEEE International Symposium on Circuits and Systems (ISCAS) > 1736 - 1739

2014 IEEE International Symposium on Circuits and Systems (ISCAS)

With significant growth in portable multimedia devices such as smartphones, application processors (AP) play a critical role for running various multimedia applications on these devices. By considering the power constraints of such devices, we often integrate reconfigurable processors (RPs) into APs. This is because RPs offer flexibility and good performance, thereby greatly improving the power efficiency...

chapter

SimParallel: A high performance parallel SystemC simulator using hierarchical multi-threading

Moo-Kyoung Chung, Jun-Kyoung Kim, Soojung Ryu

2014 IEEE International Symposium on Circuits and Systems (ISCAS) > 1472 - 1475

2014 IEEE International Symposium on Circuits and Systems (ISCAS)

As the system complexity increases, the simulation performance becomes one of the most important issues in virtual prototyping. Parallel simulation is a fascinating technique for high-speed simulation utilizing state of the art multi-core processors on a host workstation, but the efficiency of the parallel simulation is low because of the synchronization and communication overhead and unbalanced workloads...

chapter

Improving GPGPU resource utilization through alternative thread block scheduling

Minseok Lee, Seokwoo Song, Joosik Moon, John Kim, more

2014 IEEE 20th International Symposium on High Performance Computer Architecture (HPCA) > 260 - 271

2014 IEEE 20th International Symposium on High Performance Computer Architecture (HPCA)

High performance in GPGPU workloads is obtained by maximizing parallelism and fully utilizing the available resources. The thousands of threads are assigned to each core in units of CTA (Cooperative Thread Arrays) or thread blocks - with each thread block consisting of multiple warps or wavefronts. The scheduling of the threads can have significant impact on overall performance. In this work, explore...

chapter

Energy-efficient scheduling for memory-intensive GPGPU workloads

Seokwoo Song, Minseok Lee, John Kim, Woong Seo, more

2014 Design, Automation & Test in Europe Conference & Exhibition (DATE) > 1 - 6

2014 Design, Automation & Test in Europe Conference & Exhibition (DATE)

High performance for a GPGPU workload is obtained by maximizing parallelism and fully utilizing the available resources. However, this is not necessarily energy efficient, especially for memory-intensive GPGPU workloads. In this work, we propose Throttle CTA (cooperative-thread array) Scheduling (TCS) where we leverage two type of throttling — throttling the number of actives cores and throttling...

chapter

Tile boundary sharing for tile-based vector graphics rendering

Jeong-Joon Yoo, Seokyoon Jung, Soojung Ryu, Jeongwook Kim

2014 IEEE International Conference on Consumer Electronics (ICCE) > 93 - 94

2014 IEEE International Conference on Consumer Electronics (ICCE)

In this paper, we present an efficient curve rasterization method that effectively reduces duplicated computations in a tile-based rendering. To do so, Tile Boundary Sharing (TBS) method is proposed that shares boundary information between neighbor tiles. When we use the TBS, computing cycles for tile-based vector graphics are reduced into 21∼34%.

chapter

Low-power reconfigurable audio processor for mobile devices

Seunghun Jin, Woong Seo, Yeon-Gon Cho, Soojung Ryu

2014 IEEE International Conference on Consumer Electronics (ICCE) > 369 - 370

2014 IEEE International Conference on Consumer Electronics (ICCE)

Low-power processing of multimedia data is mandatory for the recent mobile devices. In this paper, we present coarse-grained reconfigurable processor for low-power audio processing. By utilizing perfect instruction cache and tightly-coupled scratchpad memory, we can eliminate all the bandwidth consumption caused by external memory access while decoding compressed audio data. Acceleration of audio...

chapter

Seeded region growing on multi-core system

Sangheon Lee, Yeongon Cho, Soojung Ryu, Byeonghun Lee, more

2014 IEEE International Conference on Consumer Electronics (ICCE) > 490 - 491

2014 IEEE International Conference on Consumer Electronics (ICCE)

This paper presents an implementation of seeded region growing (SRG) algorithm on multi-core system to achieve real time constraint with high precision. The proposed implementation has dynamic load balancing feature inherently and shows a speedup of 13.3 times with 16 cores.

chapter

Hierarchical Verification Framework for Samsung Reconfigurable Processor Video System

Hoyoung Kim, Seonghun Jeong, Sunmin Kwon, Soojung Ryu

2013 14th International Workshop on Microprocessor Test and Verification > 14 - 18

2013 14th International Workshop on Microprocessor Test and Verification (MTV)

The Samsung reconfigurable processor (SRP) is developed to accelerate multimedia applications such as video decoding, audio decoding, and image processing. Owing to coarse-grained reconfigurable array (CGRA) acceleration via software (SW) pipelining and application-specific intrinsic instructions, SRP outperforms other digital signal processors (DSPs) in these application domains. In addition, recent...

chapter

Adaptive compression for instruction code of Coarse Grained Reconfigurable Architectures

Moo-Kyoung Chung, Jun-Kyoung Kim, Yeon-Gon Cho, Soojung Ryu

2013 International Conference on Field-Programmable Technology (FPT) > 394 - 397

2013 International Conference on Field-Programmable Technology (FPT)

Coarse Grained Reconfigurable Architecture (CGRA) achieves high performance by exploiting instruction-level parallelism with software pipeline. Large instruction memory is, however, a critical problem of CGRA, which requires large silicon area and power consumption. Code compression is a promising technique to reduce the memory area, bandwidth requirements, and power consumption. We present an adaptive...

INFONA - science communication portal

Search results for: Soojung Ryu

Accelerating vector graphics on low-end device

Fast stereoscopic rendering on mobile ray tracing GPU for virtual reality applications

Towards an efficient data transfer on mobile device: A case study on ray-tracing

Dynamic clock synchronization scheme between voltage domains in multi-core architecture

iPAWS: Instruction-issue pattern-based adaptive warp scheduling for GPGPUs

Selective multi-sample anti-aliasing for mobile vector graphics

PerfEPI: Parallel performance estimation with effective progress index

Tile binning and rendering for resolution independent graphics

Path rendering using winding number generator

Simulation-based memory dependence checker for CGRA-mapped code verification

Full-stream architecture for ray tracing with efficient data transmission

Quantitative comparison of the power reduction techniques for samsung reconfigurable processor

SimParallel: A high performance parallel SystemC simulator using hierarchical multi-threading

Improving GPGPU resource utilization through alternative thread block scheduling

Energy-efficient scheduling for memory-intensive GPGPU workloads

Tile boundary sharing for tile-based vector graphics rendering

Low-power reconfigurable audio processor for mobile devices

Seeded region growing on multi-core system

Hierarchical Verification Framework for Samsung Reconfigurable Processor Video System

Adaptive compression for instruction code of Coarse Grained Reconfigurable Architectures

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Search results for: Soojung Ryu

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options