Search results

Items from 1 to 11 out of 11 results

chapter

Cache Partitioning + Loop Tiling: A Methodology for Effective Shared Cache Management

Vasilios Kelefouras, Georgios Keramidas, Nikolaos Voros

2017 IEEE Computer Society Annual Symposium on VLSI (ISVLSI) > 477 - 482

2017 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)

In this paper, we present a new methodology that provides i) a theoretical analysis of the two most commonly used approaches for effective shared cache management (i.e., cache partitioning and loop tiling) and ii) a unified framework to fine tuning those two mechanisms in tandem (not separately). Our approach manages to lower the number of main memory accesses by one order of magnitude keeping at...

chapter

A static-placement, dynamic-issue framework for CGRA loop accelerator

Zhongyuan Zhao, Weiguang Sheng, Weifeng He, ZhiGang Mao, more

Design, Automation & Test in Europe Conference & Exhibition (DATE), 2017 > 1348 - 1353

2017 Design, Automation & Test in Europe Conference & Exhibition (DATE)

This paper presents a static-placement, dynamic-issue (SPDI) framework for the coarse-grained reconfigurable architecture (CGRA) in order to tackle the inefficiencies of the static-issue, static-placement (SISP) CGRA. This framework includes the compiler that statically places the operations and hardware design, a SPDI CGRA, that automatically schedule the operations. We stress on introducing the...

chapter

Polyhedral Optimizations of Explicitly Parallel Programs

Prasanth Chatarasi, Jun Shirako, Vivek Sarkar

2015 International Conference on Parallel Architecture and Compilation (PACT) > 213 - 226

2015 International Conference on Parallel Architecture and Compilation (PACT)

The polyhedral model is a powerful algebraic framework that hasenabled significant advances to analysis and transformation ofsequential affine (sub)programs, relative to traditional AST-basedapproaches. However, given the rapid growth of parallel software, there is a need for increased attention to using polyhedral frameworksto optimize explicitly parallel programs. An interesting side effectof supporting...

chapter

JolokiaC++: Optimizing Irregular Accesses for GPGPU

Vibha Patel, Sanjeev Aggarwal, Amey Karkare

2015 IEEE 17th International Conference on High Performance Computing and Communications, 2015 IEEE 7th International Symposium on Cyberspace Safety and Security, and 2015 IEEE 12th International Conference on Embedded Software and Systems > 583 - 590

2015 IEEE 17th International Conference on High Performance Computing and Communications (HPCC), 2015 IEEE 7th International Symposium on Cyberspace Safety and Security (CSS) and 2015 IEEE 12th International Conf on Embedded Software and Systems (ICESS)

We present JolokiaC++ a compiler framework to ease coding of irregular data applications on GPUs. The effectiveness of the compiler and runtime systems of JolokiaC++ is tested using three kernels IRREG, MOLDYN and NBF, executed on NVIDIA GPUs. We developed extensions for the generic parallel constructs that allow portable and efficient programming of codes with irregular accesses on the GPU. We present...

chapter

SAFEBOX: A Verified Microkernel Based on Spatial-Temporal Isolation

Fan Zhang, Weining Su, Tianfang Wang, Xiaopeng Wang

2015 2nd International Conference on Information Science and Control Engineering > 451 - 455

2015 2nd International Conference on Information Science and Control Engineering (ICISCE)

The correctness of kernel is the key to the safety critical embedded application, and only by formal verification it can prove the kernel does not exist some defects or meet certain attributes. In this paper, we introduce SAFEBOX, a microkernel based on spatial-temporal isolation, give the formal description of SAFEBOX, and use theorem proverb Isabelle/HOL to verify the functional and non-functional...

chapter

On the Fairness of Linux O(1) Scheduler

Jyothish Jose, Oravanpadath Sujisha, Malayamparambath Gilesh, Thayyil Bindima

2014 5th International Conference on Intelligent Systems, Modelling and Simulation > 668 - 674

2014 5th International Conference on Intelligent Systems, Modelling and Simulation (ISMS)

The scheduling algorithm of Linux operating systems has to fulfill several conflicting objectives: fast process response time, higher throughput for background jobs, avoidance of process starvation, reconciliation of the needs of low and high priority processes etc. The set of rules used to determine when and how to select a new process to run is called scheduling policy. Current Linux kernel uses...

chapter

Runtime dependency analysis for loop pipelining in High-Level Synthesis

Mythri Alle, Antoine Morvan, Steven Derrien

2013 50th ACM/EDAC/IEEE Design Automation Conference (DAC) > 1 - 10

2013 50th ACM/EDAC/IEEE Design Automation Conference (DAC)

Research on High-Level Synthesis has mainly focused on applications with statically determinable characteristics and current tools often perform poorly in presence of data-dependent memory accesses. The reason is that they rely on conservative static scheduling strategies, which lead to inefficient implementations. In this work, we propose to address this issue by leveraging well-known techniques...

chapter

Parameterized Verification of GPU Kernel Programs

Guodong Li, Ganesh Gopalakrishnan

2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum > 2450 - 2459

2012 26th IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

We present an automated symbolic verifier for checking the functional correctness of GPGPU kernels parametrically, for an arbitrary number of threads. Our tool checks the functional equivalence of a kernel and its optimized versions, helping debug errors introduced during memory coalescing and bank conflict elimination related optimizations. Key features of our work include: (1) a symbolic method...

chapter

Memory partitioning and scheduling co-optimization in behavioral synthesis

Peng Li, Yuxin Wang, Peng Zhang, Guojie Luo, more

2012 IEEE/ACM International Conference on Computer-Aided Design (ICCAD) > 488 - 495

2012 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)

Achieving optimal throughput by extracting parallelism in behavioral synthesis often exaggerates memory bottleneck issues. Data partitioning is an important technique for increasing memory bandwidth by scheduling multiple simultaneous memory accesses to different memory banks. In this paper we present a vertical memory partitioning and scheduling algorithm that can generate a valid partition scheme...

chapter

A Data Communication Scheduler for Stream Programs on CPU-GPU Platform

Tao Tang, Xinhai Xu, Yisong Lin

2010 10th IEEE International Conference on Computer and Information Technology > 139 - 146

2010 IEEE 10th International Conference on Computer and Information Technology (CIT)

In recent years, heterogeneous parallel system have become a focus research area in high performance computing field. Generally, in a heterogeneous parallel system, CPU provides the basic computing environment and special purpose accelerator (GPU in this paper) provides high computing performance. However, the overall performance of the system is prone to be limited by the data communication between...

chapter

Stream Processing in Triplet Based Architecture

Jiaxin Li, Xuelai Luo

2008 International Conference on MultiMedia and Information Technology > 573 - 576

2008 International Conference on Multimedia and Information Technology (MMIT 2008)

Stream processing is widely used for its high speed processing in some special areas such as image processing and science computing. Many efforts have been made by researchers to use stream processing in general purpose computing. In this paper, we propose an implementation of stream processing based on TriBA, which is a multi-core architecture. By applying different strategies in this structure,...

Filter options

Data set:
ieee
Keywords:
KERNEL
ARRAYS
SCHEDULES

Publication date

Set your own date range

Keywords

FORMAL VERIFICATION (2)
GPU (2)
GRAPHICS PROCESSING UNIT (2)
HARDWARE (2)
INSTRUCTION SETS (2)
MEMORY MANAGEMENT (2)
PROGRAMMING (2)
RUNTIME (2)
STREAMING MEDIA (2)
ALGORITHM DESIGN AND ANALYSIS (1)
BEHAVIORAL SYNTHESIS (1)
BENCHMARK TESTING (1)
CACHE PARTITIONING (1)
CFS (1)
COMPILERS (1)
COMPLEX NUMBER MULTIPLICATION (1)
COMPUTER ARCHITECTURE (1)
COMPUTER BUGS (1)
COMPUTER GRAPHIC EQUIPMENT (1)
COPROCESSORS (1)
CORRECTNESS OF OPTIMIZATIONS (1)
CPU-GPU PLATFORM (1)
DATA COMMUNICATION (1)
DATA COMMUNICATION SCHEDULE (1)
DATA COMMUNICATION SCHEDULER (1)
EMBEDDED SOFTWARE (1)
EMBEDDED SYSTEMS (1)
EXPLICIT PARALLELISM (1)
GPGPU (1)
GPU PROGRAMMING (1)
GRAPHICS PROCESSING UNITS (1)
HAPPENS-BEFORE RELATIONS (1)
HETEROGENEOUS PARALLEL SYSTEM (1)
HEURISTIC ALGORITHMS (1)
HIGH PERFORMANCE COMPUTING FIELD (1)
HIGH SPEED PROCESSING (1)
IRREGULAR ACCESSES (1)
JACOBIAN MATRICES (1)
LAYOUT (1)
LINUX (1)
LOOP CONTROL STRUCTURE (1)
LOOP TILING (1)
MEMORY PARTITIONING (1)
MEMORY S CH EDULING (1)
MICROKERNEL (1)
MICROPROCESSORS (1)
MULTICORE ARCHITECTURE (1)
MULTICORE PROCESSING (1)
O(1) SCHED-ULER (1)
OPENMP (1)
OPTIMIZATION (1)
PARALLEL CONSTRUCTS (1)
PARALLEL PROCESSING (1)
PARAMETERIZED REASONING (1)
PARTITIONING ALGORITHMS (1)
PIPELINE PROCESSING (1)
PLUTO (1)
POLYHEDRAL TRANSFORMATIONS (1)
PROCESS SCHEDULING (1)
PROCESSOR SCHEDULING (1)
PROGRAM PROCESSORS (1)
REGISTERS (1)
ROUTING (1)
SAFETY-CRITICAL (1)
SATISFIABILITY MODULO THEORIES (SMT) (1)
SCHEDULE (1)
SCHEDULING (1)
SCHEDULING ALGORITHMS (1)
SHARED CACHE (1)
SHIFT REGISTERS (1)
STARVATION (1)
STREAM PROCESSING (1)
STREAM PROGRAMMING MODEL (1)
STREAM PROGRAMS (1)
STRUCTURAL ANALYSIS (1)
SYMBOLIC ANALYSIS (1)
SYNCHRONIZATION STRATEGY (1)
TASK PARALLELISM (1)
TEMPORAL-SPATIAL ISOLATION (1)
THREADS (1)
THROUGHPUT (1)
TOPOLOGY (1)
TRIBA TOPOLOGY (1)
TRIPLET BASED ARCHITECTURE (1)
UNIX OS (1)
more

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options