Wyniki wyszukiwania dla: Praveen Yedlapalli

Pozycje od 1 do 11 spośród 11 wyników

artykuł

Cache Hierarchy-Aware Query Mapping on Emerging Multicore Architectures

Ozcan Ozturk, Umut Orhan, Wei Ding, Praveen Yedlapalli, więcej

IEEE Transactions on Computers > 2017 > 66 > 3 > 403 - 415

One of the important characteristics of emerging multicores/manycores is the existence of “shared on-chip caches,” through which different threads/processes can share data (help each other) or displace each other’s data (hurt each other). Most of current commercial multicore systems on the market have on-chip cache hierarchies with multiple layers (typically, in the form of L1, L2 and L3, the last...

rozdział

Improving bank-level parallelism for irregular applications

Xulong Tang, Mahmut Kandemir, Praveen Yedlapalli, Jagadish Kotra

2016 49th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO) > 1 - 12

2016 49th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO)

Observing that large multithreaded applications with irregular data access patterns exhibit very low memory bank-level parallelism (BLP) during their execution, we propose a novel loop iteration scheduling strategy built upon the inspector-executor paradigm. A unique characteristic of this strategy is that it considers both bank-level parallelism (from an inter-core perspective) and bank reuse (from...

rozdział

Domain knowledge based energy management in handhelds

Nachiappan Chidambaram Nachiappan, Praveen Yedlapalli, Niranjan Soundararajan, Anand Sivasubramaniam, więcej

2015 IEEE 21st International Symposium on High Performance Computer Architecture (HPCA) > 150 - 160

2015 IEEE 21st International Symposium on High Performance Computer Architecture (HPCA)

Energy management in handheld devices is becoming a daunting task with the growing number of accelerators, increasing memory demands and high computing capacities required to support applications with stringent QoS needs. Current DVFS techniques that modulate power states of a single hardware component, or even recent proposals that manage multiple components, can lose out opportunities for attaining...

rozdział

Short-Circuiting Memory Traffic in Handheld Platforms

Praveen Yedlapalli, Nachiappan Chidambaram Nachiappan, Niranjan Soundararajan, Anand Sivasubramaniam, więcej

2014 47th Annual IEEE/ACM International Symposium on Microarchitecture > 166 - 177

2014 47th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO)

Handheld devices are ubiquitous in today's world. With their advent, we also see a tremendous increase in device-user interactivity and real-time data processing needs. Media (audio/video/camera) and gaming use-cases are gaining substantial user attention and are defining product successes. The combination of increasing demand from these use-cases and having to run them at low power (from a battery)...

rozdział

A cache topology-aware multi-query scheduler for multicore architectures

Umut Orhan, Wei Ding, Praveen Yedlapalli, Mahmut Kandemir, więcej

2014 IEEE International Symposium on Workload Characterization (IISWC) > 86 - 87

2014 IEEE International Symposium on Workload Characterization (IISWC)

Growing performance gap between processors and main memory has made it worthwhile to consider off-chip data accesses in multi-query processing [2], [1], [3]. Exploiting data-sharing opportunities among concurrent queries can be critical for effective utilization of the underlying shared memory hierarchy. Given a set of queries, there may be a common retrieval operation for several cases to the same...

rozdział

QoS aware dynamic time-slice tuning

Yang Ding, Praveen Yedlapalli, Mahmut Kandemir

2014 IEEE International Symposium on Workload Characterization (IISWC) > 84 - 85

2014 IEEE International Symposium on Workload Characterization (IISWC)

The ability to manage quality of service (QoS) and to provide service differentiation has been very important in a wide range of computing environments [4], [3], [5]. In modern operating systems, multiple applications share processor cores and take turns to execute. Each application typically runs for a while before its CPU time-slice (allocated quantum) expires or the execution is blocked due to...

rozdział

Trading cache hit rate for memory performance

Wei Ding, Mahmut Kandemir, Diana Guttman, Adwait Jog, więcej

2014 23rd International Conference on Parallel Architecture and Compilation (PACT) > 357 - 368

2014 23rd International Conference on Parallel Architecture and Compilation (PACT)

Most of the prior compiler based data locality optimization works target exclusively cache locality optimization, and row-buffer locality in DRAM banks received much less attention. In particular, to the best of our knowledge, there is no single compiler based approach that can improve row-buffer locality in executing irregular applications. This presents a critical problem considering the fact that...

rozdział

Meeting midway: Improving CMP performance with memory-side prefetching

Praveen Yedlapalli, Jagadish Kotra, Emre Kultursay, Mahmut Kandemir, więcej

Proceedings of the 22nd International Conference on Parallel Architectures and Compilation Techniques > 289 - 298

2013 22nd International Conference on Parallel Architectures and Compilation Techniques (PACT)

Both on-chip resource contention and off-chip latencies have a significant impact on memory requests in large-scale chip multiprocessors. We propose a memory-side prefetcher, which brings data on-chip from DRAM, but does not proactively further push this data to the cores/caches. Sitting close to memory, it avails close knowledge of DRAM state and memory channels to leverage DRAM row buffer locality...

rozdział

Locality-aware mapping and scheduling for multicores

Wei Ding, Yuanrui Zhang, Mahmut Kandemir, Jithendra Srinivas, więcej

Proceedings of the 2013 IEEE/ACM International Symposium on Code Generation and Optimization (CGO) > 1 - 12

2013 IEEE/ACM International Symposium on Code Generation and Optimization (CGO)

This paper presents a cache hierarchy-aware code mapping and scheduling strategy for multicore architectures. Our mapping strategy determines a loop iteration-to-core mapping by taking into account application data access patterns and on-chip cache hierarchy. It employs a novel concept called “core vectors” to obtain a mapping matrix which exploits data reuses at different layers of the cache hierarchy...

rozdział

Cooperative parallelization

Praveen Yedlapalli, Emre Kultursay, Mahmut T. Kandemir

2011 IEEE/ACM International Conference on Computer-Aided Design (ICCAD) > 134 - 141

2011 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)

We propose a cooperation between the programmer, the compiler and the runtime system to identify, exploit and efficiently exercise the parallelism available in many pointer based applications. Our parallelization strategy, called Cooperative Parallelization, is driven by programmer directives as well as runtime information. We show that minimal information from the programmer can be combined with...

rozdział

A special-purpose compiler for look-up table and code generation for function evaluation

Yuanrui Zhang, Lanping Deng, Praveen Yedlapalli, Sai Prashanth Muralidhara, więcej

2010 Design, Automation&Test in Europe Conference&Exhibition (DATE 2010) > 1130 - 1135

2010 Design, Automation & Test in Europe Conference & Exhibition (DATE 2010)

Elementary functions are extensively used in computer graphics, signal and image processing, and communication systems. This paper presents a special-purpose compiler that automatically generates customized look-up tables and implementations for elementary functions under user given constraints. The generated implementations include a C/C++ code that can be used directly by applications running on...

Opcje filtrowania

Data publikacji

Ustaw własny zakres dat

Typ publikacji

książka (10)
artykuł (1)

Słowa kluczowe

SYSTEM-ON-CHIP (4)
MULTICORE PROCESSING (3)
CACHE (2)
CONTEXT (2)
EDUCATIONAL INSTITUTIONS (2)
HARDWARE (2)
LOAD MANAGEMENT (2)
RANDOM ACCESS MEMORY (2)
RUNTIME (2)
SOCKETS (2)
ACCURACY (1)
ARCHITECTURE (1)
ARRAYS (1)
BANDWIDTH (1)
BUFFER (1)
C++ CODE (1)
CAMERAS (1)
CLUSTERING ALGORITHMS (1)
CMP (1)
CODE GENERATION (1)
COMPILER (1)
COMPUTER AIDED MANUFACTURING (1)
COMPUTER LANGUAGES (1)
CORE (1)
CUSTOMIZED LOOK-UP TABLES (1)
DATA LOCALITY (1)
DRAM (1)
ELEMENTARY FUNCTIONS (1)
FEEDBACK CONTROL (1)
FIELD PROGRAMMABLE GATE ARRAYS (1)
FPGA PLATFORMS (1)
FRAMES (1)
FUNCTION EVALUATION (1)
INDEXES (1)
IP NETWORKS (1)
IRREGULAR APPLICATION (1)
KERNEL (1)
LAYOUT (1)
LOCALITY (1)
LOOP TRANSFORMATION, CACHE HIERARCHY, MULTI (1)
MATHEMATICS COMPUTING (1)
MATLAB-LIKE CODE (1)
MEMORY (1)
MEMORY PREFETCHING (1)
MOBILE (1)
MULTICORE (1)
MULTICORES (1)
NOC (1)
OPERATING SYSTEMS (1)
OPTIMIZATION (1)
POLYNOMIALS (1)
PROGRAM COMPILERS (1)
QUALITY OF SERVICE (1)
QUERY (1)
ROW BUFFER (1)
SCHEDULE (1)
SOC (1)
SPECIAL-PURPOSE COMPILER (1)
TABLE LOOKUP (1)
THROUGHPUT (1)
TOPOLOGY (1)
TUNING (1)
więcej

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania dla: Praveen Yedlapalli

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Typ publikacji

Słowa kluczowe

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu