Search results

Items from 1 to 6 out of 6 results

chapter

Understanding the Impact of Fine-Grained Data Sharing and Thread Communication on Heterogeneous Workload Development

Tuan Ta, David Troendle, Xiaoqi Hu, Byunghyun Jang

2017 16th International Symposium on Parallel and Distributed Computing (ISPDC) > 132 - 139

2017 16th International Symposium on Parallel and Distributed Computing (ISPDC)

The conventional OpenCL 1.x style CPU-GPU heterogeneous computing paradigm treats the CPU and GPU processors as loosely connected separate entities. At best each executes independent tasks, but, more commonly, the CPU idles while waiting for results from the GPU. No data-sharing and communications are allowed during kernel execution. This model limits the number of applications that can harness the...

chapter

PTAT: An efficient and precise tool for collecting detailed TLB miss traces

Jiutian Zhang, Yuhang Liu, Xiaojing Zhu, Yuan Ruan, more

2017 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS) > 137 - 138

2017 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)

It is well known that the TLB performance impacts the memory system performance, which is critical for overall system performance. Similar to multi-level caches, multilevel TLBs have become an important leverage for boosting data access performance. Applications have increasingly large working sets. Servers targeting such applications have thus been built with ever larger main memory capacities, but...

article

Efficient Synchronization for Distributed Embedded Multiprocessors

Hao Xiao, Ning Wu, Fen Ge, Tsuyoshi Isshiki, more

IEEE Transactions on Very Large Scale Integration (VLSI) Systems > 2016 > 24 > 2 > 779 - 783

In multiprocessor systems, low-latency synchronization is extremely important to effectively exploit fine-grain data parallelism and improve overall performance. This brief presents an efficient synchronization for embedded distributed multiprocessors. The proposed solution works in a completely decentralized request–response manner via explicit message exchange among the processing elements. Scalable...

chapter

Leveraging OmpSs to Exploit Hardware Accelerators

Florentino Sainz, Sergi Mateo, Vicenc Beltran, Jose L. Bosque, more

2014 IEEE 26th International Symposium on Computer Architecture and High Performance Computing > 112 - 119

2014 26th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD)

CUDA and OpenCL are the most widely used programming models to exploit hardware accelerators. Both programming models provide a C-based programming language to write accelerator kernels and a host API used to glue the host and kernel parts. Although this model is a clear improvement over a low-level and ad-hoc programming model for each hardware accelerator, it is still too complex and cumbersome...

chapter

Design aware scheduling of dynamic testbench controlled design element accesses in FPGA-based HW/SW co-simulation systems for fast functional verification

S Banerjee, T Gupta

2nd Asia Symposium on Quality Electronic Design (ASQED) > 175 - 181

2010 2nd Asia Symposium on Quality Electronic Design (ASQED 2010)

In HW/SW co-simulation based logic verification systems, the design under test (DUT) is executed on an FPGA based emulator and the behavioral testbench written in some high level language like C or HDL is run on a SW simulator or a general purpose CPU. In such systems it is essential to reduce the communication between SW and HW sides to enhance overall verification speed. Therefore it is of significant...

chapter

Performance Implications of Next-Generation Multi-processing Platforms on e-Business Server Applications

Qi Ming Teng, Xiao Zhong, Ying Li, Ying Chen

2008 IEEE International Conference on e-Business Engineering > 37 - 44

2008 IEEE International Conference on e-Business Engineering

When running multiple e-Business server applications simultaneously on the same hardware, inappropriate CPU sharing may endanger the performance stability for individual applications. Robustness and manageability are critical for Java server applications on emerging multi-processing hardware platforms. This paper investigates the performance implications of multiprocessing (including SMP and CMP)...

Filter options

Data set:
ieee
Keywords:
SYNCHRONIZATION
KERNEL
BENCHMARK TESTING
HARDWARE

Publication date

Set your own date range

Publication type

book (5)
article (1)

Keywords

GRAPHICS PROCESSING UNITS (2)
ACCELERATOR (1)
COMPUTER ARCHITECTURE (1)
CPU-INTENSIVE SERVER (1)
CUDA (1)
DATA TRANSFER (1)
DESIGN AWARE SCHEDULING (1)
DESIGN UNDER TEST (1)
DISTRIBUTED ARCHITECTURE (1)
DYNAMIC RANDOM ACCESS (1)
DYNAMIC SCHEDULING (1)
E-BUSINESS SERVER (1)
ELECTRONIC COMMERCE (1)
EMBEDDED MULTIPROCESSOR (1)
EMULATION (1)
FIELD PROGRAMMABLE GATE ARRAYS (1)
FILE SERVERS (1)
FORMAL VERIFICATION (1)
FPGA (1)
FPGA BASED EMULATOR (1)
FPGA-BASED HW/SW CO-SIMULATION SYSTEMS (1)
HARDWARE DESIGN LANGUAGES (1)
HARDWARE-SOFTWARE CODESIGN (1)
HW-SW CO-VERIFICATION (1)
INSTRUCTION SETS (1)
JAVA (1)
JAVA SERVER (1)
JGF BENCHMARK (1)
LOGIC VERIFICATION SYSTEMS (1)
MAGNETIC CORES (1)
MESSAGE PASSING (1)
MONITORING (1)
MULTIPROCESSING SYSTEMS (1)
NEXT-GENERATION MULTIPROCESSING PLATFORMS (1)
OMPSS (1)
OPENCL (1)
OPTIMIZATION (1)
PERFORMANCE STABILITY (1)
PROCESSOR CORES (1)
PROGRAMMING (1)
PROTOCOLS (1)
READ/WRITE ACCESSES (1)
REGISTERS (1)
SCHEDULING (1)
SERVERS (1)
STREAMING TRANSACTION BASED INTERFACES (1)
SYNCHRONIZATION. (1)
THREAD MIGRATION (1)
TOOLS (1)
more

INFONA - science communication portal

Search results

Understanding the Impact of Fine-Grained Data Sharing and Thread Communication on Heterogeneous Workload Development

PTAT: An efficient and precise tool for collecting detailed TLB miss traces

Efficient Synchronization for Distributed Embedded Multiprocessors

Leveraging OmpSs to Exploit Hardware Accelerators

Design aware scheduling of dynamic testbench controlled design element accesses in FPGA-based HW/SW co-simulation systems for fast functional verification

Performance Implications of Next-Generation Multi-processing Platforms on e-Business Server Applications

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options