Search results

chapter

To GPU synchronize or not GPU synchronize?

Wu-chun Feng, Shucai Xiao

Proceedings of 2010 IEEE International Symposium on Circuits and Systems > 3801 - 3804

2010 IEEE International Symposium on Circuits and Systems. ISCAS 2010

The graphics processing unit (GPU) has evolved from being a fixed-function processor with programmable stages into a programmable processor with many fixed-function components that deliver massive parallelism. By modifying the GPU's stream processor to support “general-purpose computation” on the GPU (GPGPU), applications that perform massive vector operations can realize many orders-of-magnitude...

chapter

Parameter optimization and initial value methods of PTA method for DC analysis

Zhou Jin, Dan Niu, Xiao Wu, Jianan Wang, more

2017 Eighth International Conference on Intelligent Control and Information Processing (ICICIP) > 293 - 296

2017 Eighth International Conference on Intelligent Control and Information Processing (ICICIP)

Recently, an efficient damped pseudo transient analysis method was proposed to find DC solutions for nonlinear circuits. However, the simulation efficiency is still not satisfactory. In this paper, for different kinds of circuits, we get some groups of PTRAN algorithm control parameters values by using a parameter optimization algorithm for pseudo-transient analysis to obtain higher efficiency during...

chapter

Detecting IoT zombie attacks on web servers

Sujatha Sivabalan, P J Radcliffe

2017 27th International Telecommunication Networks and Applications Conference (ITNAC) > 1 - 3

2017 27th International Telecommunication Networks and Applications Conference (ITNAC)

Internet of Things (IoT) devices pose a serious threat to the web as poorly configured or faulty devices can be used for massive Distributed Denial of Service attacks. High jacked IoT devices that act like real users are a particular problem that present significant difficulties for traditional detection methods. An adaptive, real time scoring system for detecting such attacks is proposed that does...

chapter

Fault emulation on heterogeneous architectures

Abdullah Yildiz, Cemil Cem Gursoy, Sezer Goren

2017 International Conference on Computer Science and Engineering (UBMK) > 905 - 910

2017 International Conference on Computer Science and Engineering (UBMK)

This paper presents implementation of fault emulation method which is very important in today's chip tests on a platform with heterogeneous architecture. Nowadays, the increase in the number of transistors in electronic circuits put fault emulation method forward which is faster than fault simulation in order to obtain a test set against possible defects on chips. In this method, a hardware model...

chapter

GPU-accelerated particle swarm optimization for selective harmonic elimination in multilevel converters with unequal DC levels

Kehu Yang, Haotian Li, Yao Huang, Qi Zhang, more

IECON 2017 - 43rd Annual Conference of the IEEE Industrial Electronics Society > 1186 - 1191

IECON 2017 - 43rd Annual Conference of the IEEE Industrial Electronics Society

The Particle Swarm Optimization (PSO) has been widely used to solve the selective harmonic elimination (SHE) problem, however, the executing efficiency is not very good if it is implement on the traditional Central Processing Units (CPUs). In this paper, the PSO is parallel implemented on the Graphical Processing Unit (GPU) under the Compute Unified Device Architecture (CUDA). Then, the GPU-accelerated...

chapter

Evaluation of GPU/CPU co-processing models for JPEG 2000 packetization

Volker Bruns, Miguel A. Martinez-del-Amor, Heiko Sparenberg

2017 IEEE 19th International Workshop on Multimedia Signal Processing (MMSP) > 1 - 6

2017 IEEE 19th International Workshop on Multimedia Signal Processing (MMSP)

With the bottom-line goal of increasing the throughput of a GPU-accelerated JPEG 2000 encoder, this paper evaluates whether the post-compression rate control and packetization routines should be carried out on the CPU or on the GPU. Three co-processing models that differ in how the workload is split among the CPU and GPU are introduced. Both routines are discussed and algorithms for executing them...

chapter

High-performance mesoscopic traffic simulation with GPU for large scale networks

Vinh An Vu, Gary Tan

2017 IEEE/ACM 21st International Symposium on Distributed Simulation and Real Time Applications (DS-RT) > 1 - 9

2017 IEEE/ACM 21st International Symposium on Distributed Simulation and Real Time Applications (DS-RT)

Mesoscopic Traffic Simulation is an important tool in traffic analysis and traffic management support. The balance between traffic modeling details and performance has made Mesoscopic Traffic Simulation one of the key solutions for traffic controllers and policy makers. Mesoscopic traffic simulators offer acceptable speed in simulating normal traffic. However, when traffic prediction and optimization...

chapter

Performance characterization, prediction, and optimization for heterogeneous systems with multi-level memory interference

Shin-Ying Lee, Carole-Jean Wu

2017 IEEE International Symposium on Workload Characterization (IISWC) > 43 - 53

2017 IEEE International Symposium on Workload Characterization (IISWC)

Modern computer systems are accelerator-rich, equipped with many types of hardware accelerators to speed up computation. For example, graphics processing units (GPUs) are a type of accelerators that are widely employed to accelerate parallel workloads. In order to well utilize different accelerators to gain better execution time speedup or reduce total energy consumption, many scheduling algorithms...

chapter

adCFS: Adaptive completely fair scheduling policy for containerised workflows systems

Eidah J. Alzahrani, Zahir Tari, Young Choon Lee, Deafallah Alsadie, more

2017 IEEE 16th International Symposium on Network Computing and Applications (NCA) > 1 - 8

2017 IEEE 16th International Symposium on Network Computing and Applications (NCA)

Scientific workflows are increasingly containerised, which requires rethinking central processing unit (CPU) sharing policies to accommodate different workload types. However, container engines running scientific workflows struggle to share the CPU fairly, as workload characteristics are not taken into account. This paper proposes a sharing policy called the Adaptive Completely Fair Scheduling policy...

chapter

Improved SPICE3 implementation algorithms of compound element pseudo-transient analysis for solving nonlinear dc circuits

Zhou Jin, Dan Niu, Xiao Wu

2017 Chinese Automation Congress (CAC) > 7819 - 7822

2017 Chinese Automation Congress (CAC)

This paper presents three implementation algorithms of compound elements pseudo-transient analysis to find DC solutions for nonlinear LSI circuits. In former researches, CEPTA was implemented in SPICE-like simulator with the merits that the size of Jacobian matrix is not expanded during the calculation. While the inserted pseudo parts are converted to some certain equivalent circuits, the conventional...

chapter

WCET analysis of the shared data cache in integrated CPU-GPU architectures

Yijie Huangfu, Wei Zhang

2017 IEEE High Performance Extreme Computing Conference (HPEC) > 1 - 7

2017 IEEE High Performance Extreme Computing Conference (HPEC)

By taking the advantages of both CPU and GPU as well as the shared DRAM and cache, the integrated CPU-GPU architecture has the potential to boost the performance for a variety of applications, including real-time applications as well. However, before being applied to the hard real-time and safety-critical applications, the time-predictability of the integrated CPU-GPU architecture needs to be studied...

chapter

A reduced-complexity, reduced-power camera system for intrusion classification in an outdoor setting

Tarun Choubisa, Sampad B. Mohanty, Kodur Krishna Chaitanya, Mohan Kashyap, more

2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI) > 1155 - 1162

2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

An optical camera was recently employed by a subset of the authors as a sensing modality complementary to that of a Pyroelectric InfraRed (PIR) sensor, for carrying out intrusion detection and classification in an outdoor environment. The aim there was to develop a classification algorithm that mimicked the performance of the PIR sensor and which was complementary to the PIR in the sense that it could...

chapter

Accelerated kerninghan lin algorithm for graph partitioning

Archana K Rajan, Deepika Bhaiya

2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI) > 174 - 178

2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

Grouping the vertex of the graph into sets of certain sizes such that minimum number of edges cross between the sets is called graph partitioning. This NP (Non-deterministic Polynomial time)-complete problem has important applications in computing, task scheduling, and parallel processing. We are implementing Kernighan-Lin, a local algorithm on both a Central Processing Unit (CPU) and a Graphics Processing...

chapter

Mixed Time-Criticality Process Interferences Characterization on a Multicore Linux System

Federico Reghenzani, Giuseppe Massari, William Fornaciari

2017 Euromicro Conference on Digital System Design (DSD) > 427 - 434

2017 Euromicro Conference on Digital System Design (DSD)

The increasing interest in the integration of Mixed Criticality Systems (MCS) in Commercial-Off-The-Shelf (COTS) platforms leads to an increasing number of challenges. The possibility of sharing computing resources among applications with different time criticalities is a key goal for COTS systems, but still hard to achieve. Classical approaches in real-time systems are not feasible when platform...

chapter

A fast nonlinear placement algorithm to improve agricultural mechanization

Wenchao Gao, Xiaoming Xu, Lisheng Xu, Pu Liu, more

2017 6th International Conference on Agro-Geoinformatics > 1 - 6

2017 6th International Conference on Agro-Geoinformatics

Agricultural mechanization impacts on agricultural productivity and society development far-reaching. The emergence of VLSI (Very Large-Scale Integrated circuits) provides possibility for full intelligence and automation of agricultural products. The VLSI placement is now facing such double challenges: the integration scale and the circuit performance. From the experimental results, we find current...

chapter

Large-Scale Parallelization of Smoothed Particle Hydrodynamics Method on Heterogeneous Cluster

Yingrui Wang, Leisheng Li, Rong Tian

2017 46th International Conference on Parallel Processing (ICPP) > 21 - 30

2017 46th International Conference on Parallel Processing (ICPP)

This paper implements a Smoothed Particle Hydrodynamics simulation code and distributes it on a heterogeneous cluster. The theoretical analysis results show that treating GPU as equivalent peer of CPU rather than an assistant or a substitute is the most efficient way of using a CPU+GPU compute node. However, it raises complex challenges of heterogeneous cooperation. Our strategies of hybrid-level...

chapter

A time aware processor (TAP): A simple embedded example

John D MacKay, Henry McHenry

2017 IEEE International Symposium on Precision Clock Synchronization for Measurement, Control, and Communication (ISPCS) > 1 - 5

2017 IEEE International Symposium on Precision Clock Synchronization for Measurement, Control, and Communication (ISPCS)

A central processing unit (CPU) and peripheral devices are discussed for which all data processing and data transfer is uniquely time tagged using a timestamp generated by the embedded processing system master clock. The Time Aware Processor (TAP) introduces time into the processor computing language to relate data to temporal events, including the processors own internal functions.

chapter

Autonomous Power Management for Embedded Systems Using a Non-linear Power Predictor

Sidartha Azevedo Lobo De Carvalho, Daniel Carvalho Da Cunha, Abel Guilhermino Da Silva-Filho

2017 Euromicro Conference on Digital System Design (DSD) > 22 - 29

2017 Euromicro Conference on Digital System Design (DSD)

Embedded systems execute applications that exercise the hardware differently depending on the computation task, generating varying workloads with time. Energy minimization can be reached exploring the optimal CPU frequency for each workload. We propose an autonomous and online approach, capable of minimizing energy through adaptation to these workload variations even in an unknown environment. In...

chapter

Speeding up tone mapping operators: Exploiting parallelism for real-time, high dynamic range video

Ziad Youssfi, Firas Hassan

2017 IEEE 60th International Midwest Symposium on Circuits and Systems (MWSCAS) > 192 - 195

2017 IEEE 60th International Midwest Symposium on Circuits and Systems (MWSCAS)

Tone mapping operators map high dynamic range images so that they can be displayed with a high dynamic range appearance in a limited range medium. However, due to their large computational complexity, sequential implementation of these operators on CPU cannot achieve the frame rate needed for real-time video image processing. In this paper, we revisit these operators to simplify them so that we can...

chapter

Distributed computational load balancing for real-time applications

Saurav Sthapit, James R. Hopgood, John Thompson

2017 25th European Signal Processing Conference (EUSIPCO) > 1385 - 1189

2017 25th European Signal Processing Conference (EUSIPCO)

Mobile Cloud Computing or Fog computing refer to offloading computationally intensive algorithms from a mobile device to a cloud or a intermediate cloud in order to save resources (time and energy) in the mobile device. In this paper, we look at alternative solution when the cloud or fog is not available. We modelled sensors using network of queues and use linear programming to make scheduling decisions...

INFONA - science communication portal

Search results

To GPU synchronize or not GPU synchronize?

Parameter optimization and initial value methods of PTA method for DC analysis

Detecting IoT zombie attacks on web servers

Fault emulation on heterogeneous architectures

GPU-accelerated particle swarm optimization for selective harmonic elimination in multilevel converters with unequal DC levels

Evaluation of GPU/CPU co-processing models for JPEG 2000 packetization

High-performance mesoscopic traffic simulation with GPU for large scale networks

Performance characterization, prediction, and optimization for heterogeneous systems with multi-level memory interference

adCFS: Adaptive completely fair scheduling policy for containerised workflows systems

Improved SPICE3 implementation algorithms of compound element pseudo-transient analysis for solving nonlinear dc circuits

WCET analysis of the shared data cache in integrated CPU-GPU architectures

A reduced-complexity, reduced-power camera system for intrusion classification in an outdoor setting

Accelerated kerninghan lin algorithm for graph partitioning

Mixed Time-Criticality Process Interferences Characterization on a Multicore Linux System

A fast nonlinear placement algorithm to improve agricultural mechanization

Large-Scale Parallelization of Smoothed Particle Hydrodynamics Method on Heterogeneous Cluster

A time aware processor (TAP): A simple embedded example

Autonomous Power Management for Embedded Systems Using a Non-linear Power Predictor

Speeding up tone mapping operators: Exploiting parallelism for real-time, high dynamic range video

Distributed computational load balancing for real-time applications

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options