Search results

Items from 1 to 12 out of 12 results

chapter

Implementing Lattice QCD Application with XcalableACC Language on Accelerated Cluster

Masahiro Nakao, Hitoshi Murai, Hidetoshi Iwashita, Akihiro Tabuchi, more

2017 IEEE International Conference on Cluster Computing (CLUSTER) > 429 - 438

2017 IEEE International Conference on Cluster Computing (CLUSTER)

Accelerated clusters, which are distributed memory systems equipped with accelerators, have been used in various fields. For accelerated clusters, programmers often implement their applications by a combination of MPI and CUDA (MPI+CUDA). However, the approach faces programming complexity issues. This paper introduces the XcalableACC (XACC) language, which is a hybrid model of XcalableMP (XMP) and...

chapter

OpenACC Cache Directive: Opportunities and Optimizations

Ahmad Lashgar, Amirali Baniasadi

2016 Third Workshop on Accelerator Programming Using Directives (WACCPD) > 46 - 56

2016 Third Workshop on Accelerator Programming Using Directives (WACCPD)

OpenACC's programming model presents a simple interface to programmers, offering a trade-off between performance and development effort. OpenACC relies on compiler technologies to generate efficient code and optimize for performance. Among the difficult to implement directives, is the cache directive. The cache directive allows the programmer to utilize accelerator's hardware- or software-managed...

chapter

An Extension of OpenACC Directives for Out-of-Core Stencil Computation with Temporal Blocking

Nobuhiro Miki, Fumihiko Ino, Kenichi Hagihara

2016 Third Workshop on Accelerator Programming Using Directives (WACCPD) > 36 - 45

2016 Third Workshop on Accelerator Programming Using Directives (WACCPD)

In this paper, aiming at realizing directive-based temporal blocking for out-of-core stencil computation, we present an extension of OpenACC directives and a source-to-source translator capable of accelerating out-of-core stencil computation on a graphics processing unit (GPU). Out-of-core stencil computation here deals with large data that cannot be entirely stored in GPU memory. Given an OpenACC-like...

chapter

An OpenACC Optimizer for Accelerating Histogram Computation on a GPU

Kei Ikeda, Fumihiko Ino, Kenichi Hagihara

2016 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP) > 468 - 477

2016 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP)

This paper presents a source-to-source OpenACC optimizer that automatically optimizes a histogram computation code for a graphics processing unit (GPU). Parallel histogram computation codes typically deploy multiple copies of histograms and update them with atomic operations. This duplication method can be implemented as an OpenACC code. However, the structure of sequential code blocks must be manually...

chapter

An OpenACC Optimizer for Accelerating Histogram Computation on a GPU

Kei Ikeda, Fumihiko Ino, Kenichi Hagihara

2016 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP) > 468 - 477

2016 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP)

chapter

SmartBackup: An Efficient and Reliable Backup Strategy for Solid State Drives with Backup Capacitors

Min Huang, Yi Wang, Liyan Qiao, Duo Liu, more

2015 IEEE 17th International Conference on High Performance Computing and Communications, 2015 IEEE 7th International Symposium on Cyberspace Safety and Security, and 2015 IEEE 12th International Conference on Embedded Software and Systems > 746 - 751

2015 IEEE 17th International Conference on High Performance Computing and Communications (HPCC), 2015 IEEE 7th International Symposium on Cyberspace Safety and Security (CSS) and 2015 IEEE 12th International Conf on Embedded Software and Systems (ICESS)

Unpredictable power outages in NAND flashbased Solid State Drives (SSDs) may cause system failure or reliability problems. Capacitors are widely adopted as the interim power supplier when power interruption happens. However, since the energy provided by backup capacitors is limited, and the capacitance of a capacitor will gradually degrade with time, it is imperative to improve the efficiency and...

chapter

Analysis and optimization of program disturb in split-gate cells using source side injection and impact on further cell size reduction

Christoph Bukethal, Georg Tempel, Robert Strenz, John Power

2013 International Semiconductor Conference Dresden - Grenoble (ISCDG) > 1 - 3

2013 International Semiconductor Conference Dresden - Grenoble (ISCDG)

Program disturb is a major issue limiting the functionality of hot carrier programmed flash memories. This paper reports a detailed characterization of program disturb in a split-gate flash memory cell using source side injection programming. Key parameters influencing the cell's disturb sensitivity have been investigated, empirical models have been developed and a physical root cause has been identified...

chapter

Implementation of XcalableMP Device Acceleration Extention with OpenCL

Takuma Nomizu, Daisuke Takahashi, Jinpil Lee, Taisuke Boku, more

2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum > 2394 - 2403

2012 26th IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

Due to their outstanding computational performance, many acceleration devices, such as GPUs, the Cell Broadband Engine (Cell/B.E.), and multi-core computing are attracting a lot of attention in the field of high-performance computing. Although there are many programming models and languages de-signed for programming accelerators, such as CUDA, AMD Accelerated Parallel Processing (AMD APP), and OpenCL,...

chapter

Highly Optimized Nanocrystal-Based Split Gate Flash for High Performance and Low Power Microcontroller Applications

J Yater, C Hong, S.-T Kang, D Kolar, more

2011 3rd IEEE International Memory Workshop (IMW) > 1 - 4

2011 3rd IEEE International Memory Workshop (IMW)

We show a 90nm nanocrystal-based split gate embedded flash memory that is able to meet the speed, endurance and reliability requirements for 32-bit microcontroller products. A 3.4V operating window is achievable and the process is robust and repeatable across many lots. Erase after 10k cycles can be achieved in 5ms, long-term data retention of cycled arrays is not susceptible to SILC-induced charge...

chapter

Accelerating scientific applications using GPU's

M. Taher

2009 4th International Design and Test Workshop (IDT) > 1 - 6

2009 4th International Design and Test Workshop (IDT 2009)

Graphics processing units (GPUs) have emerged as a powerful platform for high-performance computation. They have been successfully used to accelerate many scientific workloads. Typically, the computationally intensive parts of the application are offloaded to the GPU, which serves as the CPU's parallel coprocessor. The key to effective utilization of GPUs for scientific computing is the design and...

chapter

Reliability characterization of Phase Change Memory

B. Gleixner, F. Pellizzer, R. Bez

2009 10th Annual Non-Volatile Memory Technology Symposium (NVMTS) > 7 - 11

2009 10th Annual Non-Volatile Memory Technology Symposium (NVMTS 2009)

Phase Change Memory (PCM) has emerged as an attractive candidate for next-generation non-volatile memory devices. For these applications, reliability is determined by the ability to retain the state of data in the device and support a specified number of re-writes without failure. In PCM technologies, retention is limited by the meta-stable amorphous state of the cell. For cycling endurance (re-writes),...

chapter

Reliability of advanced embedded non-volatile memories: The 2T-FNFN device

Guoqiao Tao

2008 IEEE International Conference on Integrated Circuit Design and Technology and Tutorial > 79 - 82

2008 IEEE International Conference on IC Design and Technology & Tutorial (ICICDT)

The reliability of advanced embedded non-volatile memories has been discussed using the 2T-FNFN devices example. The write/erase endurance and the data retention are the most important reliability parameters. The intrinsic reliability mechanisms can be addressed through single cell evaluation, while the cell-to-cell variation determines the product level reliability. The cell-to-cell variation can...

Filter options

Data set:
ieee
Keywords:
ARRAYS
PROGRAMMING
ACCELERATION

Publication date

Set your own date range

Keywords

GRAPHICS PROCESSING UNITS (5)
RELIABILITY (4)
LOGIC GATES (3)
OPENACC (3)
AUTOMATED TUNING (2)
CAPACITORS (2)
CIRCUIT RELIABILITY (2)
FLASH MEMORIES (2)
GPU (2)
HISTOGRAM COMPUTATION (2)
HISTOGRAMS (2)
INSTRUCTION SETS (2)
NONVOLATILE MEMORY (2)
OPTIMIZATION (2)
PERFORMANCE EVALUATION (2)
TUNING (2)
2T-FNFN DEVICE (1)
ACCELERATED CLUSTER (1)
ACCELERATOR (1)
ASH (1)
AUTOMOTIVE ENGINEERING (1)
CACHE MEMORY (1)
CELL PROGRAMMING (1)
CELL-TO-CELL VARIATION (1)
CHALCOGENIDE (1)
CHARGE CARRIER PROCESSES (1)
CLUSTER (1)
CMOS INTEGRATED CIRCUITS (1)
CMOS TECHNOLOGY (1)
COMPILER (1)
COMPUTER ARCHITECTURE (1)
COMPUTER GRAPHIC EQUIPMENT (1)
COPROCESSORS (1)
COUPLINGS (1)
CPU PARALLEL COPROCESSOR (1)
CUDA (1)
CUDA PROGRAMMING MODEL (1)
CYCLING ENDURANCE (1)
DATA ENCRYPTION STANDARD (1)
DATA MODELS (1)
DATA RETENTION (1)
DATA TRANSFER (1)
DEGRADATION (1)
DISCHARGES (ELECTRIC) (1)
DOPING (1)
DRAIN DISTURB (1)
EEPROM (1)
EFFICIENT DATA PARALLEL ALGORITHMS (1)
EMBEDDED NONVOLATILE MEMORIES (1)
EMBEDDED SYSTEMS (1)
ERROR CORRECTION (1)
EXTRAPOLATION (1)
FLASH (1)
FLASH ARRAYS (1)
FLASH MEMORY (1)
FLASH MEMORY CELLS (1)
GRAPHICS (1)
GRAPHICS PROCESSING UNIT (1)
HARDWARE (1)
HEATING ELEMENT (1)
HIGHLY OPTIMIZED NANOCRYSTAL-BASED SPLIT GATE FLASH MEMORY (1)
IEEE JOURNAL OF SOLID-STATE CIRCUITS (1)
INDEXES (1)
INTEGRATED CIRCUIT RELIABILITY (1)
INTERFACE STATES (1)
INTERNATIONAL ELECTRON DEVICES MEETING (1)
INTRINSIC RELIABILITY MECHANISMS (1)
KERNEL (1)
LATTICES (1)
LIFE ESTIMATION (1)
LOW POWER MICROCONTROLLER APPLICATION (1)
MATERIALS (1)
MATRICES (1)
MEDIA (1)
MEMORIES (1)
MEMORY ARCHITECTURE (1)
META-STABLE AMORPHOUS STATE (1)
MICROCONTROLLERS (1)
MICROPROCESSORS (1)
MOSFETS (1)
NANOCRYSTALS (1)
NEXT-GENERATION NON-VOLATILE MEMORY DEVICES (1)
NVIDIA GPU (1)
OPENCL (1)
OXIDATION (1)
PARALLEL LANGUAGE (1)
PARALLEL PROCESSING (1)
PERFORMANCE (1)
PHASE CHANGE MATERIALS (1)
PHASE CHANGE MEMORIES (1)
PHASE CHANGE MEMORY (1)
POSTAL SERVICES (1)
PREDICTIVE MODELS (1)
PROCESS INTEGRATION (1)
PRODUCT DESIGN (1)
PRODUCT LEVEL RELIABILITY (1)
PRODUCTIVITY (1)
more

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options