Search results

chapter

Fork Bomb Attack Mitigation by Process Resource Quarantine

Gaku Nakagawa, Shuichi Oikawa

2016 Fourth International Symposium on Computing and Networking (CANDAR) > 691 - 695

2016 Fourth International Symposium on Computing and Networking (CANDAR)

A fork bomb attack is a denial of service attack. An attacker generates many processes rapidly, exhausting the resources of the target computer systems. There are several previous work to detect and remove the processes that cause fork bomb attacks. However, the operating system with the previous methods have the risks to terminate inappropriate processes that do not fork bomb processes. In this paper,...

chapter

A dynamic special-purpose scheduler for concurrent kernels on GPU

Rasoul Mohammadi, S.- Kazem Shekofteh, Mahmoud Naghibzadeh, Hamid Noori

2016 6th International Conference on Computer and Knowledge Engineering (ICCKE) > 218 - 222

2016 6th International Conference on Computer and Knowledge Engineering (ICCKE)

GPUs are widely used as powerful accelerators for data-parallel applications such as financial and scientific applications in industrial and scientific areas. Effective scheduling of kernels can significantly enhance performance and utilization. In shared environments such as cloud, lots of kernels from users are being requested to be launched for execution. An effective kernel scheduling method can...

chapter

Memos: A full hierarchy hybrid memory management framework

Lei Liu, Hao Yang, Yong Li, Mengyao Xie, more

2016 IEEE 34th International Conference on Computer Design (ICCD) > 368 - 371

2016 IEEE 34th International Conference on Computer Design (ICCD)

In this paper, we introduce memos, which integrates suitable memory management policies and schedules resources over the entire memory hierarchy in hybrid memory system. Powered by an OS kernel level monitoring tool, memos captures memory patterns online, and then leverages them to guide the memory page placement and data mapping. Experimental results show, on average, memos can benefit memory utilization,...

chapter

Microcontroller with ARM kernel and real time operating system

Mikhail Bychkov, Artem Fedorenko

2016 IX International Conference on Power Drives Systems (ICPDS) > 1 - 6

2016 IX International Conference on Power Drives Systems (ICPDS)

Structures of hardware and software of real time operating system of the SATELLITE programmable controller are considered. In the processor module the microprocessor based on ARM Cortex-M4F kernel is used. The user application program is developed in the environment of CODESYS, and executed under control of the RTS program of 3S-Smart Software Solutions GmbH. For contact of RTS with hardware the ARM...

chapter

SRAM memory margin probability failure estimation using Gaussian Process regression

Manish Rana, Ramon Canal, Jie Han, Bruce Cockburn

2016 IEEE 34th International Conference on Computer Design (ICCD) > 448 - 451

2016 IEEE 34th International Conference on Computer Design (ICCD)

Estimating the failure probabilities of SRAM memory cells using Monte Carlo or Importance Sampling techniques is expensive in the number of SPICE simulations needed. This paper presents a methodology for estimating the dynamic margin failure probabilities by building a surrogate model of the dynamic margin using Gaussian Process regression. Additive kernel functions that can extrapolate the margin...

chapter

Measuring error-tolerance in SRAM architecture on hardware accelerated neural network

Sangheon Kwon, Kyungmin Lee, Yoonsoo Kim, Kyungah Kim, more

2016 IEEE International Conference on Consumer Electronics-Asia (ICCE-Asia) > 1 - 4

2016 IEEE International Conference on Consumer Electronics - Asia (ICCE-Asia)

Hardware accelerators for convolutional neural network (CNN) accompany a large amount of SRAM in order to reduce the number of expensive off-chip DRAM accesses. This design trend gives implications to architects: the SRAM area will dominate the entire chip area for the future CNN accelerators. Since the probability of soft errors such as energetic particle strikes goes as the density of SRAM, errors...

chapter

MEI: A Light Weight Memory Error Injection Tool for Validating Online Memory Testers

Xiaoqiang Wang, Xuguo Wang, Fangfang Zhu, Qingguo Zhou, more

2016 International Symposium on System and Software Reliability (ISSSR) > 129 - 136

2016 International Symposium on System and Software Reliability (ISSSR)

Lots of studies have shown that memory hardware error rates are orders of magnitude higher than previously reported. In order to fight with these memory hardware errors, many memory testing tools have been developed, especially software level online memory testers, which means these memory testers implemented in software can work with the OS (operating system) at the same time. However, validation...

chapter

Full-System Simulation of big.LITTLE Multicore Architecture for Performance and Energy Exploration

Anastasiia Butko, Florent Bruguier, Abdoulaye Gamatie, Gilles Sassatelli, more

2016 IEEE 10th International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSOC) > 201 - 208

2016 IEEE 10th International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSoC)

Single-ISA heterogeneous multicore processors have gained increasing popularity with the introduction of recent technologies such as ARM big.LITTLE. These processors offer increased energy efficiency through combining low power in-order cores with high performance out-of-order cores. Efficiently exploiting this attractive feature requires careful management so as to meet the demands of targeted applications...

chapter

In-storage embedded accelerator for sparse pattern processing

Sang-Woo Jun, Huy T. Nguyen, Vijay Gadepally, Arvind

2016 IEEE High Performance Extreme Computing Conference (HPEC) > 1 - 7

2016 IEEE High Performance Extreme Computing Conference (HPEC)

We present a novel architecture for sparse pattern processing, using flash storage with embedded accelerators. Sparse pattern processing on large data sets is the essence of applications such as document search, natural language processing, bioinformatics, subgraph matching, machine learning, and graph processing. One slice of our prototype accelerator is capable of handling up to 1TB of data, and...

chapter

3D DRAM based application specific hardware accelerator for SpMV

Fazle Sadi, Larry Pileggi, Franz Franchetti

2016 IEEE High Performance Extreme Computing Conference (HPEC) > 1

2016 IEEE High Performance Extreme Computing Conference (HPEC)

For numerous scientific applications Sparse Matrix-Vector multiplication (SpMV) is one of the most important kernels. Unfortunately, due to its very low ratio of computation to memory access SpMV is inherently a memory bound problem. On the other hand, the main memory bandwidth of commercial off-the-shelf (COTS) architectures is insufficient for available computation resources on these platforms,...

chapter

Design space exploration of GPU Accelerated cluster systems for optimal data transfer using PCIe bus

Janki Bhimani, Miriam Leeser, Ningfang Mi

2016 IEEE High Performance Extreme Computing Conference (HPEC) > 1 - 7

2016 IEEE High Performance Extreme Computing Conference (HPEC)

Use of accelerators such as GPUs is increasing, but efficient use of GPUs requires making good design choices. Such design choices include type of memory allocation and overlapping concurrency of data transfer with parallel computation. Performance varies with the application, hardware version such as generation of GPU, and software version including programming language drivers. This large number...

chapter

Real-time, low-latency image processing with high throughput on a multi-core SoC

Barath Ramesh, Alan D. George, Herman Lam

2016 IEEE High Performance Extreme Computing Conference (HPEC) > 1 - 7

2016 IEEE High Performance Extreme Computing Conference (HPEC)

Real-time, low-latency, image processing with high throughput is vital for many time-critical applications in fields such as medical imaging, robotics, and wearable computers. Traditionally, FPGAs have often been employed to meet these requirements. However, due to the productivity challenges, using FPGAs may not be viable in some cases. Alternatively, the typical approach of processing an image on...

chapter

Generation of the Single Precision BLAS Library for the Parallella Platform, with Epiphany Co-processor Acceleration, Using the BLIS Framework

Miguel Tasende

2016 IEEE 14th Intl Conf on Dependable, Autonomic and Secure Computing, 14th Intl Conf on Pervasive Intelligence and Computing, 2nd Intl Conf on Big Data Intelligence and Computing and Cyber Science and Technology Congress(DASC/PiCom/DataCom/CyberSciTech) > 894 - 897

2016 IEEE 14th Intl Conf on Dependable, Autonomic and Secure Computing, 14th Intl Conf on Pervasive Intelligence and Computing, 2nd Intl Conf on Big Data Intelligence and Computing and Cyber Science and Technology Congress(DASC/PiCom/DataCom/CyberSciTech)

The Parallella is a hybrid computing platform that came into existence as the result of a Kickstarter project by Adapteva. It is composed of the high performance, energy-efficient, manycore architecture, Epiphany chip (used as co-processor) and one Zynq-7000 series chip, which normally runs a regular Linux OS version, serves as the main processor, and implements "glue logic" in its internal...

chapter

Security scanner system of oVirt cloud platform

Longhu Cao, Dan Liu

2016 7th IEEE International Conference on Software Engineering and Service Science (ICSESS) > 357 - 360

2016 7th IEEE International Conference on Software Engineering and Service Science (ICSESS)

Cloud computing is a new IT delivery paradigm that offers computing resources as on-demand services over the Internet. Like all forms of outsourcing, cloud computing raises serious concerns about the security of the data assets that are outsourced to providers of cloud services. Security issues of cloud platform have gradually drawn the attention of research institutions and various security companies...

chapter

Soft2LM: Application Guided Heterogeneous Memory Management

Michael Giardino, Kshitij Doshi, Bonnie Ferri

2016 IEEE International Conference on Networking, Architecture and Storage (NAS) > 1 - 10

2016 IEEE International Conference on Networking, Architecture and Storage (NAS)

This paper introduces a software policy for memory management in heterogeneous memory systems in order to improve the trade-offs between performance and power consumption, while attempting to make the best use of different characteristics of the underlying memory technologies. In this policy, the operating system and the application co-schedule page management in order to make informed decisions about...

chapter

GPU-based nonlocal filtering for large scale SAR processing

Gerald Baier, Xiao Xiang Zhu

2016 IEEE International Geoscience and Remote Sensing Symposium (IGARSS) > 7608 - 7611

IGARSS 2016 - 2016 IEEE International Geoscience and Remote Sensing Symposium

In the past few years nonlocal filters have emerged as a serious contender for denoising synthetic aperture radar (SAR) images, offering superior noise reduction and detail preservation compared to many other filters. In this manuscript we analyze how nonlocal filters, whose computational costs were so far prohibitive for large scale processing, can be implemented efficiently on graphics processing...

chapter

Error Monitoring for Legacy Mission-Critical Systems

Marcello Cinque, Raffaele Della Corte, Stefano Russo

2016 46th Annual IEEE/IFIP International Conference on Dependable Systems and Networks Workshop (DSN-W) > 66 - 71

2016 46th Annual IEEE/IFIP International Conference on Dependable Systems and Networks Workshop (DSN-W)

Error data collected at runtime play a key role for dependability analysis and improvement of software systems. The use of monitoring frameworks for legacy mission-critical systems is hindered by limited intervention degree and low intrusiveness requirements. We present the design and experimentation of an error monitoring service for a legacy large-scale critical system in the Air Traffic Control...

chapter

Implementation of decoders for symmetric low density parity check codes on parallel computation platforms using OpenCL

Bruce F. Cockburn, Andrew J. Maier

2016 IEEE Canadian Conference on Electrical and Computer Engineering (CCECE) > 1 - 6

2016 IEEE Canadian Conference on Electrical and Computer Engineering (CCECE)

OpenCL is a high-level language that allows mixed hardware/software systems to be specified and compiled to run on heterogeneous parallel computing platforms. The hardware parallelism can take the form of multi-core central processing units (CPUs), massively parallel graphics processing units (GPUs), and, most recently, field-programmable gate array (FPGA) fabrics. OpenCL compilers for CPUs and GPUs...

chapter

HMC-Sim-2.0: A Simulation Platform for Exploring Custom Memory Cube Operations

John D. Leidel, Yong Chen

2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) > 621 - 630

2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

The recent advent of stacked memory devices has led to a resurgence of researchassociated with the fundamental memory hierarchy and associated memory pipeline. The bandwidth advantages provided by stacked logic and DRAM devices haveinspired research associated with eliminating the bandwidth bottlenecksassociated with many applications in high performance computing. Further, recent efforts have focused...

chapter

FPGA kernels for classification rule induction

P. Skoda, B. Medved Rogina

2016 39th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO) > 337 - 342

2016 39th International Convention on Information and Communication Technology, Electronics and Microelectronics (MIPRO)

Classification is one of the core tasks in machine learning data mining. One of several models of classification are classification rules, which use a set of if-then rules to describe a classification model. In this paper we present a set of FPGA-based compute kernels for accelerating classification rule induction. The kernels can be combined to perform specific procedures in rule induction process,...

INFONA - science communication portal

Search results

Fork Bomb Attack Mitigation by Process Resource Quarantine

A dynamic special-purpose scheduler for concurrent kernels on GPU

Memos: A full hierarchy hybrid memory management framework

Microcontroller with ARM kernel and real time operating system

SRAM memory margin probability failure estimation using Gaussian Process regression

Measuring error-tolerance in SRAM architecture on hardware accelerated neural network

MEI: A Light Weight Memory Error Injection Tool for Validating Online Memory Testers

Full-System Simulation of big.LITTLE Multicore Architecture for Performance and Energy Exploration

In-storage embedded accelerator for sparse pattern processing

3D DRAM based application specific hardware accelerator for SpMV

Design space exploration of GPU Accelerated cluster systems for optimal data transfer using PCIe bus

Real-time, low-latency image processing with high throughput on a multi-core SoC

Generation of the Single Precision BLAS Library for the Parallella Platform, with Epiphany Co-processor Acceleration, Using the BLIS Framework

Security scanner system of oVirt cloud platform

Soft2LM: Application Guided Heterogeneous Memory Management

GPU-based nonlocal filtering for large scale SAR processing

Error Monitoring for Legacy Mission-Critical Systems

Implementation of decoders for symmetric low density parity check codes on parallel computation platforms using OpenCL

HMC-Sim-2.0: A Simulation Platform for Exploring Custom Memory Cube Operations

FPGA kernels for classification rule induction

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options