Search results

chapter

Accelerated Processing Unit (APU) potential: N-body simulation case study

Hassan Youness, Mohamed Moness, Omar Shaaban, Aziza I. Hussein

2016 11th International Conference on Computer Engineering & Systems (ICCES) > 110 - 115

2016 11th International Conference on Computer Engineering & Systems (ICCES)

This paper investigates and studies the acceleration of irregular/regular algorithms via Integrate Graphic Processing Unit (Integrated GPU) known as Accelerated Processing Unit (APU) that is fused on the same die with the CPU, and Discrete Graphic Processing Unit (GPU), while answering the question of How potential is the APU for applications with iregular data structures such as trees knowing that...

chapter

Heterogeneous Computing Platform Based on CPU+FPGA and Working Modes

Yang Li, Xiaodong Zhao, Taoran Cheng

2016 12th International Conference on Computational Intelligence and Security (CIS) > 669 - 672

2016 12th International Conference on Computational Intelligence and Security (CIS)

Currently, with the development of high performance computing, multicore system and heterogeneous system have become the transformation that is taking place. However, Promoting performance of processor has encountered bottlenecks of heat and power by means of Moore's Law, one or more CPUs can't meet requirements of a large number of computing. The use of Heterogeneous Computing Platform is becoming...

chapter

Understanding performance of I/O intensive containerized applications for NVMe SSDs

Janki Bhimani, Jingpei Yang, Zhengyu Yang, Ningfang Mi, more

2016 IEEE 35th International Performance Computing and Communications Conference (IPCCC) > 1 - 8

2016 IEEE 35th International Performance Computing and Communications Conference (IPCCC)

Our cloud-based IT world is founded on hyper-visors and containers. Containers are becoming an important cornerstone, which is increasingly used day-by-day. Among different available frameworks, docker has become one of the major adoptees to use containerized platform in data centers and enterprise servers, due to its ease of deploying and scaling. Further more, the performance benefits of a lightweight...

article

On Soft Error Reliability of Virtualization Infrastructure

Xin Xu, H. Howie Huang

IEEE Transactions on Computers > 2016 > 65 > 12 > 3727 - 3739

Hardware errors are no longer exceptions in modern cloud data centers. Although virtualization provides software failure isolation among different virtual machines (VM), the virtualization infrastructure including the hypervisor and privileged VMs remains vulnerable to hardware errors. What makes matters worse is that such errors are unlikely bounded by the virtualization boundary and may lead to...

chapter

An VM Scheduling Strategy Based on Hierarchy and Load for OpenStack

Zhigang Xu, Limin Xiao, Weidian Zhan, Xichun Yue, more

2016 7th International Conference on Cloud Computing and Big Data (CCBD) > 58 - 63

2016 7th International Conference on Cloud Computing and Big Data (CCBD)

In the cloud computing environment, one of the most important module is the Scheduler. As the most popular open-source cloud platform, OpenStack provides us with a massive amount of scheduling strategies. But there is no one considering of the hierarchies of the VMs and hosts. We will guarantee the security of VM through these hierarchies. Although OpenStack is abundant in scheduling strategies, none...

chapter

E³M: An Energy Efficient Emergency Management System using mobile cloud computing

Chhabi Rani Panigrahi, Joy Lal Sarkar, Bibudhendu Pati, Sambit Bakshi

2016 IEEE International Conference on Advanced Networks and Telecommunications Systems (ANTS) > 1 - 6

2016 IEEE International Conference on Advanced Networks and Telecommunications Systems (ANTS)

Mobile devices play a vital role for handling emergency situations. During emergency, it is very difficult to collect necessary information from the mobile devices if there is unavailability of networks. In this work, an Energy Efficient Emergency Management System named as E³M has been proposed. E³M supports peer-to-peer communication between mobile devices if a mobile device does not find any suitable...

chapter

A process-scheduling simulator based on virtual reality technology

Marcelo de Paiva Guimaraes, Vagner Scamati, Mario Popolin Neto, Valeria Farinazzo Martins, more

2016 IEEE/ACS 13th International Conference of Computer Systems and Applications (AICCSA) > 1 - 6

2016 IEEE/ACS 13th International Conference of Computer Systems and Applications (AICCSA)

A process-scheduling algorithm is a fundamental operating system function that manages the assignment of CPU (Central Processing Unit) processes. It aims to make the system efficient, fast, and fair, allowing as many processes as possible to make the best use of the CPU at any given time. Understanding scheduling algorithms and their impact in practice is a challenging and time-consuming task for...

chapter

Extended Task Queuing: Active Messages for Heterogeneous Systems

Michael LeBeane, Brandon Potter, Abhisek Pan, Alexandru Dutu, more

SC16: International Conference for High Performance Computing, Networking, Storage and Analysis > 933 - 944

SC16: International Conference for High Performance Computing, Networking, Storage and Analysis

Accelerators have emerged as an important component of modern cloud, datacenter, and HPC computing environments. However, launching tasks on remote accelerators across a network remains unwieldy, forcing programmers to send data in large chunks to amortize the transfer and launch overhead. By combining advances in intra-node accelerator unification with one-sided Remote Direct Memory Access (RDMA)...

chapter

Understanding Error Propagation in GPGPU Applications

Guanpeng Li, Karthik Pattabiraman, Chen-Yang Cher, Pradip Bose

SC16: International Conference for High Performance Computing, Networking, Storage and Analysis > 240 - 251

SC16: International Conference for High Performance Computing, Networking, Storage and Analysis

GPUs have emerged as general-purpose accelerators in high-performance computing (HPC) and scientific applications. However, the reliability characteristics of GPU applications have not been investigated in depth. While error propagation has been extensively investigated for non-GPU applications, GPU applications have a very different programming model which can have a significant effect on error propagation...

chapter

Efficient implementation of sobel filter based on GPUs cards

Mouna Afif, Yahia Said, Haythem Bahri, Mohamed Atri

2016 International Image Processing, Applications and Systems (IPAS) > 1 - 4

2016 International Image Processing, Applications and Systems (IPAS)

The Graphics processors or GPUs have become in a few years powerful tools for applications that require a massively parallel computing. Currently include the applications in multimedia processing, the engineering science and image processing in real time. They offer many advantages such as acceleration of treatment and down energy consumption from an equivalent CPU power. In this paper, we will show...

chapter

GreenLA: Green Linear Algebra Software for GPU-accelerated Heterogeneous Computing

Jieyang Chen, Li Tan, Panruo Wu, Dingwen Tao, more

SC16: International Conference for High Performance Computing, Networking, Storage and Analysis > 667 - 677

SC16: International Conference for High Performance Computing, Networking, Storage and Analysis

While many linear algebra libraries have been developed to optimize their performance, no linear algebra library considers their energy efficiency at the library design time. In this paper, we present GreenLA - an energy efficient linear algebra software package that leverages linear algebra algorithmic characteristics to maximize energy savings with negligible overhead. GreenLA is (1) energy efficient:...

chapter

Exploring heterogeneous computing with advanced path tracing algorithms

Andre Oliveira, Cesar Perdigao, Luis Paulo Santos, Alberto Proenca

2016 23° Encontro Português de Computação Gráfica e Interação (EPCGI) > 1 - 8

2016 23° Encontro Português de Computação Gráfica e Interação (EPCGI)

The CG research community has a renewed interest on rendering algorithms based on path space integration, mainly due to new approaches to discover, generate and exploit relevant light paths while keeping the numerical integrator unbiased or, at the very least, consistent. Simultaneously, the current trend towards massive parallelism and heterogeneous environments, based on a mix of conventional computing...

chapter

An Efficient Parallel Implementation of a Light-weight Data Privacy Method for Mobile Cloud Users

Mehdi Bahrami, Dong Li, Mukesh Singhal, Ashish Kundu

2016 Seventh International Workshop on Data-Intensive Computing in the Clouds (DataCloud) > 51 - 58

2016 Seventh International Workshop on Data-Intensive Computing in the Clouds (DataCloud)

Cloud computing provides an opportunity to users to outsource their data and applications. However, data privacy is one of the key challenges for the users who are outsourcing data on some transparent cloud servers. Data encryption is the best option to protect users' data privacy on the cloud. However, computation overheads of encryption methods could be expensive to some small computing machines,...

chapter

A Project-Based HPC Course for Single-Box Computers

Carlos Bederian, Nicolas Wolovick

2016 Workshop on Education for High-Performance Computing (EduHPC) > 1 - 6

2016 Workshop on Education for High-Performance Computing (EduHPC)

Throughout three iterations and six years we have developed a project-based course in HPC for single-box computers tailored to science students in general. The course is based on strong premises: showing that assembly is what actually runs on machines, dividing parallelism in three dimensions (ILP, DLP, TLP), and using them incrementally in a single numerical simulation throughout the course working...

chapter

Dynamic Load Balancing for High-Performance Graph Processing on Hybrid CPU-GPU Platforms

Stijn Heldens, Ana Lucia Varbanescu, Alexandru Iosup

2016 6th Workshop on Irregular Applications: Architecture and Algorithms (IA3) > 62 - 65

2016 6th Workshop on Irregular Applications: Architecture and Algorithms (IA3)

Graph analysis is becoming increasingly important in many research fields - biology, social sciences, data mining - and daily applications - path finding, product recommendation. Many different large-scale graph-processing systems have been proposed for different platforms. However, little effort has been placed on designing systems for hybrid CPU-GPU platforms.In this work, we present HyGraph, a...

chapter

Topology and Affinity Aware Hierarchical and Distributed Load-Balancing in Charm++

Emmanuel Jeannot, Guillaume Mercier, Francois Tessier

2016 First International Workshop on Communication Optimizations in HPC (COMHPC) > 63 - 72

2016 First International Workshop on Communication Optimizations in HPC (COMHPC)

The evolution of massively parallel supercomputers make palpable two issues in particular: the load imbalance and the poor management of data locality in applications. Thus, with the increase of the number of cores and the drastic decrease of amount of memory per core, the large performance needs imply to particularly take care of the load-balancing and as much as possible of the locality of data...

chapter

Performance Evaluation of Parallelizing Algorithm Using Spanning Tree for Stream-Based Computing

Guyue Wang, Koichi Wada, Shinichi Yamagiwa

2016 Fourth International Symposium on Computing and Networking (CANDAR) > 497 - 503

2016 Fourth International Symposium on Computing and Networking (CANDAR)

This paper proposes a detailed performance evaluation of an algorithm using spanning tree that automatically exploits the parallelism and determines an execution order of multiple kernel programs in distributed environment. In stream-based computing, efficient parallel execution requires careful scheduling of the invocation of the kernel programs. By mapping a kernel to a node and an I/O stream between...

chapter

Variable preconditioned Krylov subspace method with communication avoiding technique for electromagnetic analysis

Soichiro Ikuno, Gong Chen, Taku Itoh, Susumu Nakata, more

2016 IEEE Conference on Electromagnetic Field Computation (CEFC) > 1

2016 IEEE Conference on Electromagnetic Field Computation (CEFC)

The Variable Preconditioned (VP) Krylov subspace method with communication avoiding (CA) technique is adopted for the solver of a linear system obtained from electromagnetic analysis, and the numerical features are investigated. A massive communication time between processing units or GPU/MIC is the problem for parallelization efficiency that needs to be solved. Although κ-skip Krylov subspace method...

chapter

RNS-Based Data Representation for Handling Multiple-Precision Integers on Parallel Architectures

Konstantin Isupov, Vladimir Knyazkov

2016 International Conference on Engineering and Telecommunication (EnT) > 76 - 79

2016 International Conference on Engineering and Telecommunication (EnT)

In most computer programs and general-purpose computing environments, the precision of any calculation is limited by the word size of the computer. However, for some applications, such as cryptography, this precision is not sufficient. In these cases, it is necessary to use multiple-precision numbers. Operations on such numbers in most computer software are implemented by third party libraries that...

chapter

Automatic configuration of ROS applications for near-optimal performance

Jose Cano, Alejandro Bordallo, Vijay Nagarajan, Subramanian Ramamoorthy, more

2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) > 2217 - 2223

2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

The performance of a ROS application is a function of the individual performance of its constituent nodes. Since ROS nodes are typically configurable (parameterised), the specific parameter values adopted will determine the level of performance generated. In addition, ROS applications may be distributed across multiple computation devices, thus providing different options for node allocation. We address...

INFONA - science communication portal

Search results

Accelerated Processing Unit (APU) potential: N-body simulation case study

Heterogeneous Computing Platform Based on CPU+FPGA and Working Modes

Understanding performance of I/O intensive containerized applications for NVMe SSDs

On Soft Error Reliability of Virtualization Infrastructure

An VM Scheduling Strategy Based on Hierarchy and Load for OpenStack

E³M: An Energy Efficient Emergency Management System using mobile cloud computing

A process-scheduling simulator based on virtual reality technology

Extended Task Queuing: Active Messages for Heterogeneous Systems

Understanding Error Propagation in GPGPU Applications

Efficient implementation of sobel filter based on GPUs cards

GreenLA: Green Linear Algebra Software for GPU-accelerated Heterogeneous Computing

Exploring heterogeneous computing with advanced path tracing algorithms

An Efficient Parallel Implementation of a Light-weight Data Privacy Method for Mobile Cloud Users

A Project-Based HPC Course for Single-Box Computers

Dynamic Load Balancing for High-Performance Graph Processing on Hybrid CPU-GPU Platforms

Topology and Affinity Aware Hierarchical and Distributed Load-Balancing in Charm++

Performance Evaluation of Parallelizing Algorithm Using Spanning Tree for Stream-Based Computing

Variable preconditioned Krylov subspace method with communication avoiding technique for electromagnetic analysis

RNS-Based Data Representation for Handling Multiple-Precision Integers on Parallel Architectures

Automatic configuration of ROS applications for near-optimal performance

Filter options

Publication date

Content availability

Publication type

Publication language

Keywords

Data set

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Publication language

Keywords

Data set

Reporting an error / abuse

Sending the report failed

Accessibility options