Jee Ho Ryoo

chapter

SILC-FM: Subblocked InterLeaved Cache-Like Flat Memory Organization

Jee Ho Ryoo, Mitesh R. Meswani, Andreas Prodromou, Lizy K. John

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA) > 349 - 360

2017 IEEE International Symposium on High Performance Computer Architecture (HPCA)

With current DRAM technology reaching its limit, emerging heterogeneous memory systems have become attractive to keep the memory performance scaling. This paper argues for using a small, fast memory closer to the processor as part of a flat address space where the memory system is composed of two or more memory types. OS-transparent management of such memory has been proposed in prior works such as...

chapter

Rethinking TLB designs in virtualized environments: A very large part-of-memory TLB

Jee Ho Ryoo, Nagendra Gulur, Shuang Song, Lizy K. John

2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA) > 469 - 480

2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA)

With increasing deployment of virtual machines for cloud services and server applications, memory address translation overheads in virtualized environments have received great attention. In the radix-4 type of page tables used in x86 architectures, a TLB-miss necessitates up to 24 memory references for one guest to host translation. While dedicated page walk caches and such recent enhancements eliminate...

chapter

Proxy-Guided Load Balancing of Graph Processing Workloads on Heterogeneous Clusters

Shuang Song, Meng Li, Xinnian Zheng, Michael LeBeane, more

2016 45th International Conference on Parallel Processing (ICPP) > 77 - 86

2016 45th International Conference on Parallel Processing (ICPP)

Big data decision-making techniques take advantage of large-scale data to extract important insights from them. One of the most important classes of such techniques falls in the domain of graph applications, where data segments and their inherent relationships are represented as vertices and edges. Efficiently processing large-scale graphs involves many subtle tradeoffs and is still regarded as an...

chapter

Genesys: Automatically generating representative training sets for predictive benchmarking

Reena Panda, Xinnian Zheng, Shuang Song, Jee Ho Ryoo, more

2016 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation (SAMOS) > 116 - 123

2016 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation (SAMOS)

Fast and efficient design space exploration is a critical requirement for designing computer systems, however, the growing complexity of hardware/software systems and significantly long run-times of detailed simulators often makes it challenging. Machine learning (ML) models have been proposed as popular alternatives that enable fast exploratory studies. The accuracy of any ML model depends heavily...

chapter

POSTER: SILC-FM: Subblocked interleaved Cache-Like Flat Memory Organization

Jee Ho Ryoo, Mitesh R. Meswani, Reena Panda, Lizy K. John

2016 International Conference on Parallel Architecture and Compilation Techniques (PACT) > 435 - 437

2016 International Conference on Parallel Architecture and Compilation Techniques (PACT)

In this paper, we present a flat address space organization called SILC-FM that allows subblocks from two pages to coexist in an interleaved fashion in die-stacked DRAM. Data movement at subblocked granularity consumes less bandwidth compared to migrating the entire large block and prevents fetching useless subblocks that may never get accessed. SILC-FM can get more spatial locality hits than CAMEO...

article

Dynamic Core Allocation and Packet Scheduling in Multicore Network Processors

Muhammad Faisal Iqbal, Jim Holt, Jee Ho Ryoo, Gustavo de Veciana, more

IEEE Transactions on Computers > 2016 > 65 > 12 > 3646 - 3660

With ever increasing network traffic rates, multicore architectures for network processors have successfully provided performance improvements through high parallelism. However, naively allocating the network traffic to multiple cores without considering diversified applications and flow locality results in issues such as packet reordering, load imbalance and inefficient cache usage. Consequently,...

chapter

Watt Watcher: Fine-Grained Power Estimation for Emerging Workloads

Michael LeBeane, Jee Ho Ryoo, Reena Panda, Lizy Kurian John

2015 27th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD) > 106 - 113

2015 27th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD)

Extensive research has focused on estimating power to guide advances in power management schemes, thermal hot spots, and voltage noise. However, simulated power models are slow and struggle with deep software stacks, while direct measurements are typically coarse-grained. This paper introduces Watt Watcher, a multicore power measurement framework that offers fine-grained functional unit breakdowns...

chapter

Performance Characterization of Modern Databases on Out-of-Order CPUs

Reena Panda, Christopher Erb, Michael LeBeane, Jee Ho Ryoo, more

2015 27th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD) > 114 - 121

2015 27th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD)

Big data revolution has created an unprecedented demand for intelligent data management solutions on a large scale. While data management has traditionally been used as a synonym for relational data processing, in recent years a new group popularly known as NoSQL databases have emerged as a competitive alternative. There is a pressing need to gain greater understanding of the characteristics of modern...

chapter

i-MIRROR: A Software Managed Die-Stacked DRAM-Based Memory Subsystem

Jee Ho Ryoo, Karthik Ganesan, Yao-Min Chen, Lizy Kurian John

2015 27th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD) > 82 - 89

2015 27th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD)

This paper presents an operating system managed die-stacked DRAM called i-MIRROR that mirrors high locality pages from off-chip DRAM. Optimizing the problems of reducing cache tag area, reducing transfer bandwidth and improving hit latency altogether while using die-stacked DRAM as hardware cache is extremely challenging. In this paper, we show that performance and energy efficiency can be obtained...

chapter

GPGPU Benchmark Suites: How Well Do They Sample the Performance Spectrum?

Jee Ho Ryoo, Saddam J. Quirem, Michael Lebeane, Reena Panda, more

2015 44th International Conference on Parallel Processing > 320 - 329

2015 44th International Conference on Parallel Processing (ICPP)

Recently, GPGPUs have positioned themselves in the mainstream processor arena with their potential to perform a massive number of jobs in parallel. At the same time, many GPGPU benchmark suites have been proposed to evaluate the performance of GPGPUs. Both academia and industry have been introducing new sets of benchmarks each year while some already published benchmarks have been updated periodically...

chapter

PowerTrain: A learning-based calibration of McPAT power models

Wooseok Lee, Youngchun Kim, Jee Ho Ryoo, Dam Sunwoo, more

2015 IEEE/ACM International Symposium on Low Power Electronics and Design (ISLPED) > 189 - 194

2015 IEEE/ACM International Symposium on Low Power Electronics and Design (ISLPED)

As research on improving energy efficiency becomes prevalent, the necessity of a tool to accurately estimate power is increasing. Among various tools proposed, McPAT has gained some popularity due to its easy-to-use analytical power models. However, McPAT's prediction has several limitations. Although under- or over-estimated power from unmodeled and mis-modeled parts offset each other, it still incorporates...

chapter

Data partitioning strategies for graph workloads on heterogeneous clusters

Michael LeBeane, Shuang Song, Reena Panda, Jee Ho Ryoo, more

SC15: International Conference for High Performance Computing, Networking, Storage and Analysis > 1 - 12

SC15: International Conference for High Performance Computing, Networking, Storage and Analysis

Large scale graph analytics are an important class of problem in the modern data center. However, while data centers are trending towards a large number of heterogeneous processing nodes, graph analytics frameworks still operate under the assumption of uniform compute resources. In this paper, we develop heterogeneity-aware data ingress strategies for graph analytics workloads using the popular PowerGraph...

chapter

Control flow behavior of cloud workloads

Jee Ho Ryoo, Michael LeBeane, Muhammad Faisal Iqbal, Lizy K. John

2014 IEEE International Symposium on Workload Characterization (IISWC) > 71 - 73

2014 IEEE International Symposium on Workload Characterization (IISWC)

With massive amounts of information on the web, cloud applications are rapidly emerging as one of the main-stream domains in modern computing, yet very little is known about their behavior. To our knowledge, this paper presents the first detailed study of control flow behavior in cloud workloads. We characterize branch predictability behavior of cloud and big data benchmarks, and compare against those...

chapter

Flow Migration on Multicore Network Processors: Load Balancing While Minimizing Packet Reordering

Muhammad Faisal Iqbal, Jim Holt, Jee Ho Ryoo, Lizy K. John, more

2013 42nd International Conference on Parallel Processing > 150 - 159

2013 42nd International Conference on Parallel Processing (ICPP)

With ever increasing network traffic rates, multicore architectures for network processors have successfully provided performance improvements through high parallelism. However, naively allocating the network traffic to multiple cores without considering diversified applications and flow locality results in issues such as packet reordering, load imbalance and inefficient cache usage. Consequently,...

chapter

Containment domains: A scalable, efficient, and flexible resilience scheme for exascale systems

Jinsuk Chung, Ikhwan Lee, Michael Sullivan, Jee Ho Ryoo, more

2012 International Conference for High Performance Computing, Networking, Storage and Analysis > 1 - 11

2012 SC - International Conference for High Performance Computing, Networking, Storage and Analysis

This paper describes and evaluates a scalable and efficient resilience scheme based on the concept of containment domains. Containment domains are a programming construct that enable applications to express resilience needs and to interact with the system to tune and specialize error detection, state preservation and restoration, and recovery schemes. Containment domains have weak transactional semantics...

INFONA - science communication portal

Search results for: Jee Ho Ryoo

SILC-FM: Subblocked InterLeaved Cache-Like Flat Memory Organization

Rethinking TLB designs in virtualized environments: A very large part-of-memory TLB

Proxy-Guided Load Balancing of Graph Processing Workloads on Heterogeneous Clusters

Genesys: Automatically generating representative training sets for predictive benchmarking

POSTER: SILC-FM: Subblocked interleaved Cache-Like Flat Memory Organization

Dynamic Core Allocation and Packet Scheduling in Multicore Network Processors

Watt Watcher: Fine-Grained Power Estimation for Emerging Workloads

Performance Characterization of Modern Databases on Out-of-Order CPUs

i-MIRROR: A Software Managed Die-Stacked DRAM-Based Memory Subsystem

GPGPU Benchmark Suites: How Well Do They Sample the Performance Spectrum?

PowerTrain: A learning-based calibration of McPAT power models

Data partitioning strategies for graph workloads on heterogeneous clusters

Control flow behavior of cloud workloads

Flow Migration on Multicore Network Processors: Load Balancing While Minimizing Packet Reordering

Containment domains: A scalable, efficient, and flexible resilience scheme for exascale systems

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Search results for: Jee Ho Ryoo

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options