Advanced search

Advanced search in people

From:

To:

Items from 1 to 9 out of 9 results

chapter

Directive-Based Pipelining Extension for OpenMP

Xuewen Cui, Thomas R. W. Scogland, Bronis R. de Supinski, Wu-Chun Feng

2016 IEEE International Conference on Cluster Computing (CLUSTER) > 481 - 484

2016 IEEE International Conference on Cluster Computing (CLUSTER)

Programming models like CUDA, OpenMP, OpenACC and OpenCL are designed to offload compute-intensive workloads to accelerators efficiently. However, the naive offload model, which synchronously copies and executes in sequence, requires extensive hand-tuning of techniques, such as pipelining to overlap computation and communication. Therefore, we propose an easy-to-use, directive-based pipelining extension...

chapter

Memory partition for SIMD in streaming dataflow architectures

Xiaowei Shen, Xiaochun Ye, Xu Tan, Da Wang, more

2016 Seventh International Green and Sustainable Computing Conference (IGSC) > 1 - 8

2016 Seventh International Green and Sustainable Computing Conference (IGSC)

The high parallelism feature of scientific applications makes SIMD very suitable for streaming dataflow architectures. However, the splitting of SIMD memory requests increases the messages in on-chip networks and decreases the efficiency of streaming dataflow architectures. To process SIMD memory requests without splitting, a memory partition mechanism is proposed for SIMD in streaming dataflow architectures...

chapter

Automatic Parallelization of GPU Applications Using OpenCL

Lizandro D. Solano-Quinde, Brett M. Bode, Arun K. Somani

2015 Asia-Pacific Conference on Computer Aided System Engineering > 276 - 283

2015 Asia-Pacific Conference on Computer Aided System Engineering (APCASE)

Graphics Processing Units (GPUs) have been successfully used to accelerate scientific applications due to their computation power and the availability of programming languages that make more approachable writing scientific applications for GPUs. However, since the programming model of GPUs requires offloading all the data to the GPU memory, the memory footprint of the application is limited to the...

chapter

A Co-processor Design of an Energy Efficient Reconfigurable Accelerator CMA

Mai Izawa, Nobuaki Ozaki, Yusuke Koizumi, Rie Uno, more

2013 First International Symposium on Computing and Networking > 148 - 154

2013 First International Symposium on Computing and Networking (CANDAR)

Cool Mega Array (CMA) is an energy efficient reconfigurable accelerator consisting of a large PE array with combinatorial circuits and a small microcontroller. In order to enhance the energy efficiency of the total system, a coprocessor design of CMA(Cool Mega Array), called CMA-Geyser is proposed. By replacing the programmable microcontroller by the host processor Geyser with a dedicated hardware...

chapter

Hybrid compile and run-time memory management for a 3D-stacked reconfigurable accelerator

Lovic Gauthier, Shinya Ueno, Koji Inoue

2013 International Conference on Compilers, Architecture and Synthesis for Embedded Systems (CASES) > 1 - 10

2013 International Conference on Compilers, Architecture and Synthesis for Embedded Systems (CASES)

This paper presents a hybrid compile and run-time memory management technique for a 3D-stacked reconfigurable accelerator including a memory layer composed of multiple memory units whose parallel access allows a very high bandwidth. The technique inserts allocation, free and data transfers into the code for using the memory layer and avoids memory overflows by adding a limited number of additional...

chapter

Realization of high speed mass storage Data Record Card with CF Card and SDRAM

Qiao Liyan, Xu Hongwei

2010 IEEE Instrumentation&Measurement Technology Conference Proceedings > 1363 - 1366

2010 IEEE Instrumentation & Measurement Technology Conference Proceedings

This paper presents a design that realizes a dual-channel Data Record Card which achieves 33 MB/s data transfer rate and 4 GB capacity per channel with CF Card and SDRAM. According to the characteristics of data transfer with CF Card, an advanced data transfer method is provided, which reduces the requirements of memory resources in FPGA without influencing the data transfer rate. And it realizes...

chapter

Software cache support and API design for embedded DSP processor

Cheng-Yen Lin, Shao-Chung Wang, Ming-Yu Hung, Kun-Yuan Hsieh, more

2009 International SoC Design Conference (ISOCC) > 161 - 164

2009 International SoC Design Conference (ISOCC 2009)

In embedded SoC design, memory hierarchies are playing increasingly important roles for system performances. There is a significant latency gap between internal and external memory accesses. The external memory access might downgrade the performance of embedded systems. Application developers must explicitly handle data transfer between external and internal memories. That is a burden for programmers...

chapter

System-level exploration tool for energy-awarememory management in the design of multidimensional signal processing systems

F. Balasa, I.I. Luican, Hongwei Zhu, D.V. Nasui

2009 Asia and South Pacific Design Automation Conference > 443 - 448

ASP-DAC 2009. 14th Asia and South Pacific Design Automation Conference

Many signal processing systems, particularly in the multimedia and telecom domains, are synthesized to execute data-dominated applications. In such systems, data transfer and storage have a significant impact on both the system performance and the major cost parameters - power consumption and chip area. This paper presents a software tool for system-level exploration, where several memory management...

chapter

Computation rotating for data reuse

Guiming Wu, Jinhui Xu, Yong Dou, Miao Wang

2008 13th Asia-Pacific Computer Systems Architecture Conference > 1 - 7

2008 13th Asia-Pacific Computer Systems Architecture Conference (ACSAC)

Loop tiling is an effective loop transformation technique that tiles the iteration space of loop nests to improve the data locality. The appropriate data layout and transfer strategies are also important to assist loop tiling. This paper describes an approach to enhance data reuse and reduce off-chip memory access after loop tiling. Data tiles due to loop tiling may have overlapped elements, which...

Filter options

Keywords:
ARRAYS
MEMORY MANAGEMENT
DATA TRANSFER

Publication date

Set your own date range

Keywords

COMPUTATIONAL MODELING (3)
BENCHMARK TESTING (2)
DIGITAL SIGNAL PROCESSING (2)
EMBEDDED SYSTEM (2)
GRAPHICS PROCESSING UNITS (2)
KERNEL (2)
MEMORY (2)
SYSTEM-ON-CHIP (2)
TILES (2)
ALGORITHMS (1)
API DESIGN (1)
APPLICATION PROGRAM INTERFACES (1)
ATMOSPHERIC MODELING (1)
CACHE STORAGE (1)
CF CARD (1)
CHIP AREA (1)
CODE EXECUTION (1)
DATA LAYOUT (1)
DATA LOCALITY (1)
DATA RECORD CARD (1)
DATA REUSE (1)
DATA STORAGE (1)
DATA TILE (1)
DESIGN AUTOMATION (1)
DIGITAL SIGNAL PROCESSING CHIPS (1)
DSP (1)
DUAL-CHANNEL (1)
EMBEDDED DSP PROCESSOR (1)
EMBEDDED PROCESSOR (1)
EMBEDDED SOC DESIGN (1)
EMBEDDED SYSTEMS (1)
ENERGY CONSUMPTION (1)
EXTERNAL MEMORY ACCESS (1)
FIELD PROGRAMMABLE GATE ARRAYS (1)
FPGA (1)
GPU (1)
GRAPH THEORY (1)
HIGH SPEED MASS STORAGE (1)
HIGH-PERFORMANCE COMPUTING (1)
IMAGE EDGE DETECTION (1)
INDEXES (1)
INSTRUMENTS (1)
INTEGRATED CIRCUIT DESIGN (1)
INTERNAL MEMORY ACCESS (1)
ITERATION SPACE (1)
LATTICES (1)
LIBRARIES (1)
LOOP NEST (1)
LOOP TILING (1)
LOOP TRANSFORMATION (1)
LOW POWER COMPUTATION (1)
MEMORY ALLOCATION (1)
MEMORY PARTITION (1)
MICROCONTROLLERS (1)
MICROPROCESSOR CHIPS (1)
MULTIDIMENSIONAL SIGNAL PROCESSING (1)
MULTIDIMENSIONAL SIGNAL PROCESSING SYSTEMS (1)
OFF-CHIP MEMORY ACCESS (1)
OPENCL (1)
POWER CONSUMPTION (1)
POWER DEMAND (1)
PROGRAM TRANSFORMATION (1)
RADAR (1)
RANDOM-ACCESS STORAGE (1)
RECONFIGURABLE PROCESSORS (1)
REGISTERS (1)
RESOURCE MANAGEMENT (1)
SCIENTIFIC APPLICATIONS (1)
SDRAM (1)
SIGNAL ASSIGNMENT (1)
SIMD (1)
SOFTWARE CACHE (1)
SOFTWARE CACHE SUPPORT (1)
STORAGE MANAGEMENT (1)
STORAGE MANAGEMENT CHIPS (1)
STORAGE VARIATION GRAPH (1)
STREAMING DATAFLOW (1)
SYSTEM-LEVEL EXPLORATION (1)
TABLE LOOKUP (1)
TIN (1)
ULTRA DMA (1)
XML (1)
more

INFONA - science communication portal

Advanced search

Advanced search in people

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options