Search results

chapter

Comparison of regressors on 3D visual discomfort prediction

Runfeng Huang, Jun Zhou, Xiao Gu, Ya Zhang, more

2016 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB) > 1 - 6

2016 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB)

Most stereoscopic 3D (S3D) image visual discomfort predictors use the Support Vector Regressor (SVR) as the regression model. However, there are other good regression models such as the Random Forests (RF) and Gradient Boost Regression Tree (GBRT). Here we study the efficacy of these regression models for S3D image visual discomfort prediction. We deployed several regression models to predict the...

chapter

Performance evaluation PL330 DMA controller for bulk data transfer in Zynq SoC

Apurva Choudhary, Jaimin B Chavda, Amit P Ganatra, Rikin J Nayak

2016 IEEE International Conference on Recent Trends in Electronics, Information & Communication Technology (RTEICT) > 1811 - 1815

2016 IEEE International Conference on Recent Trends in Electronics, Information & Communication Technology (RTEICT)

This paper provides performance evaluation of PL330 DMA in Zynq SoC based device. Direct Memory Access is the feature that allows computer hardware to access system memory for data movement in bulk without CPU intervention. The I/O devices operate at a slower speed than CPU, but using DMA the CPU can be available for performing other computing tasks while data is transferred, as CPU has to only initiate...

chapter

Evaluating the Performance Impact of Multiple Streams on the MIC-Based Heterogeneous Platform

Zhaokui Li, Jianbin Fang, Tao Tang, Xuhao Chen, more

2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) > 1341 - 1350

2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

Using multiple streams can improve the overall system performance by mitigating the data transfer overhead on heterogeneous systems. Prior work focuses a lot on GPUs but little is known about the performance impact on (Intel Xeon) Phi. In this work, we apply multiple streams into six real-world applications on Phi. We then systematically evaluate the performance benefits of using multiple streams...

chapter

HMC-Sim-2.0: A Simulation Platform for Exploring Custom Memory Cube Operations

John D. Leidel, Yong Chen

2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) > 621 - 630

2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

The recent advent of stacked memory devices has led to a resurgence of researchassociated with the fundamental memory hierarchy and associated memory pipeline. The bandwidth advantages provided by stacked logic and DRAM devices haveinspired research associated with eliminating the bandwidth bottlenecksassociated with many applications in high performance computing. Further, recent efforts have focused...

chapter

A fine-grained performance model for GPU architectures

Nicola Bombieri, Federico Busato, Franco Fummi

2016 Design, Automation & Test in Europe Conference & Exhibition (DATE) > 1267 - 1272

2016 Design, Automation & Test in Europe Conference & Exhibition (DATE)

The increasing programmability, performance, and cost/effectiveness of GPUs have led to a widespread use of such many-core architectures to accelerate general purpose applications. Nevertheless, tuning applications to efficiently exploit the GPU potentiality is a very challenging task, especially for inexperienced programmers. This is due to the difficulty of developing a SW application for the specific...

chapter

Linux containers networking: Performance and scalability of kernel modules

Joris Claassen, Ralph Koning, Paola Grosso

NOMS 2016 - 2016 IEEE/IFIP Network Operations and Management Symposium > 713 - 717

NOMS 2016 - 2016 IEEE/IFIP Network Operations and Management Symposium

Linux container virtualisation is gaining momentum as lightweight technology to support cloud and distributed computing. Applications relying on container architectures might at times rely on inter-container communication, and container networking solutions are emerging to address this need. Containers can be networked together as part of an overlay network, or with actual links from the container...

chapter

A comprehensive performance analysis of HSA and OpenCL 2.0

Saoni Mukherjee, Yifan Sun, Paul Blinzer, Amir Kavyan Ziabari, more

2016 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS) > 183 - 193

2016 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)

Heterogeneous systems, that marry CPUs and GPUs together in a range of configurations, are quickly becoming the design paradigm for today's platforms because of their impressive parallel processing capabilities. However, in many existing heterogeneous systems, the GPU is only treated as an accelerator by the CPU, working as a slave to the CPU master. But recently we are starting to see the introduction...

chapter

A Multilabel Approach Using Binary Relevance and One-versus-Rest Least Squares Twin Support Vector Machine for Scene Classification

Divya Tomar, Sonali Agarwal

2016 Second International Conference on Computational Intelligence & Communication Technology (CICT) > 37 - 42

2016 Second International Conference on Computational Intelligence & Communication Technology (CICT)

The classification of an image scene having multiple class labels produces significant challenge to the researchers. A semantic scene may be described by multiple objects or by multiple classes. For example, a beach scene may also contain mountain or buildings in the background. This research work proposes a multi-label scene classification model by using Binary Relevance (BR) based one-versus-rest...

chapter

Quick access of sysfs entries through custom system call

Sagar Maliye, Sandeep Krishnaswamy, Harish Gajula

2016 International Conference on Microelectronics, Computing and Communications (MicroCom) > 1 - 4

2016 International Conference on Microelectronics, Computing and Communications (MicroCom)

In Linux, Sysfs entries are created to let the kernel export information to user space processes as well as to take in user input. The entries go through the File System to locate the show and store functions that are registered for it. Although this method is a good way to give inputs from user to the kernel space while restricting access, it is a slower method as it has to go through the file system...

chapter

A frame buffer caching for fast launch of browsers

Kyusik Kim, Taeseok Kim

2016 International Conference on Information Networking (ICOIN) > 430 - 432

2016 International Conference on Information Networking (ICOIN)

A novel scheme for fast browser launch is presented. Our scheme caches the frame buffer data of launched browser by using non-volatile memories, and reuses the cached data when browser launches later. Through implementation, we show that our scheme significantly reduces the launch time of browser.

chapter

Optimize In-kernel swap memory by avoiding duplicate swap out pages

Srividya Desireddy, Dinakar Reddy Pathireddy

2016 International Conference on Microelectronics, Computing and Communications (MicroCom) > 1 - 4

2016 International Conference on Microelectronics, Computing and Communications (MicroCom)

On embedded devices the physical memory is a critical resource. RAM should be used very efficiently without affecting the performance of the device. In-kernel memory swapping is a Linux feature which creates RAM based swap area and provides a form of virtual memory compression. It increases performance by using a compressed block device in RAM for paging instead of disk. Since In-kernel memory swapping...

chapter

A Performance Evaluation Model for Virtual Servers in KVM-Based Virtualized System

Jing Yang, Yuqing Lan

2015 IEEE International Conference on Smart City/SocialCom/SustainCom (SmartCity) > 66 - 71

2015 IEEE International Conference on Smart City/SocialCom/SustainCom (SmartCity)

According to the statistics, there is low resource utilization and high energy consumption in traditional servers. To reduce the cost, more and more companies begin to build virtual servers. Sever virtualization implements the mapping from virtual resources to physical resources and deal with resource contention among all VMs. Because of complexity of virtualized server systems, it is necessary to...

chapter

Effective I/O Processing with Exception-Less System Calls for Low-Latency Devices

Motoharu Nakajima, Shuichi Oikawa

2015 Third International Symposium on Computing and Networking (CANDAR) > 604 - 606

2015 Third International Symposium on Computing and Networking (CANDAR)

Traditional media such as hard disk and NAND flash are low latency storage devices. I/Os from and to such devices are completed asynchronously via interrupts in most cases. However, introduction of ultra-low latency devices using next-generation non-volatile memory changes the appropriate way to complete I/O requests. Therefore, it is an awaiting solution to make better way to complete I/O requests...

chapter

Determining a device crossover point in CPU/GPU systems for streaming applications

Sudeep Kanur, Wictor Lund, Leonidas Tsiopoulos, Johan Lilius

2015 IEEE Global Conference on Signal and Information Processing (GlobalSIP) > 1417 - 1421

2015 IEEE Global Conference on Signal and Information Processing (GlobalSIP)

In streaming dataflow applications such as video conferencing systems, the applications are often subjected to traffic occurring in bursts. As systems consisting of a CPU and a GPU are becoming ubiquitous, efficient utilization of such platforms for handling bursts of data becomes an interesting problem. For GPUs to be efficient, the chunk size of data to process must be large. The bursty nature of...

chapter

To Co-run, or Not to Co-run: A Performance Study on Integrated Architectures

Feng Zhang, Jidong Zhai, Wenguang Chen, Bingsheng He, more

2015 IEEE 23rd International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems > 89 - 92

2015 IEEE 23rd International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems (MASCOTS)

Architecture designers tend to integrate both CPU and GPU on the same chip to deliver energy-efficient designs. To effectively leverage the power of both CPUs and GPUs on integrated architectures, researchers have recently put substantial efforts into co-running a single application on both the CPU and the GPU of such architectures. However, few studies have been performed to analyze a wide range...

chapter

Multi-threaded Simics SystemC Virtual Platform

Asad Khan, Weiqiang Ma, Bengt Werner, Chris Wolf

2015 IEEE/ACM International Conference on Computer-Aided Design (ICCAD) > 373 - 379

2015 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)

The functional simulator Simics provides a co-simulation integration path with a SystemC simulation environment to create Virtual Platforms. With increasing complexity of the SystemC models, this platform suffers from performance degradation due to the single threaded nature of the integrated Virtual Platform. In this paper, we present a multi-threaded Simics SystemC platform solution that significantly...

chapter

Performance Evaluation of Hypervisors for HPC Applications

David Beserra, Felipe Oliveira, Jean Araujo, Felipe Fernandes, more

2015 IEEE International Conference on Systems, Man, and Cybernetics > 846 - 851

2015 IEEE International Conference on Systems, Man, and Cybernetics (SMC)

High Performance Computing (HPC) aggregates computing power in order to solve large and complex problems in different knowledge areas. Nowadays, HPC users can utilize virtualized infrastructures as a low-cost alternative to deploy their applications. However, virtualization brings some challenges for HPC, specially in regard to overhead caused by hyper visors. In this work, our main goal is to analyze...

chapter

OpenCL Kernel Fusion for GPU, Xeon Phi and CPU

Jiri Filipovic, Siegfried Benkner

2015 27th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD) > 98 - 105

2015 27th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD)

Kernel fusion is an optimization method, in which the code from several kernels is composed to create a new, fused kernel. It can push the performance of kernels beyond limits given for their isolated, unfused form. In this paper, we introduce a classification of different types of kernel fusion for both data dependent and data independent kernels. We study kernel fusion on three types of OpenCL devices:...

chapter

Device-Sensitive Framework for Handling Heterogeneous Asymmetric Clusters Efficiently

Valon Raca, Eduard Mehofer

2015 27th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD) > 178 - 185

2015 27th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD)

Heterogeneous systems with different types of compute devices are common nowadays in the field of High Performance Computing (HPC). This heterogeneity is not limited to compute devices, but also includes cluster nodes with different hardware configurations leading to asymmetric cluster architectures. In such a hierarchical system OpenCL is not sufficient any more. Support is required to distribute...

chapter

Orchestrating Multiple Data-Parallel Kernels on Multiple Devices

Janghaeng Lee, Mehrzad Samadi, Scott Mahlke

2015 International Conference on Parallel Architecture and Compilation (PACT) > 355 - 366

2015 International Conference on Parallel Architecture and Compilation (PACT)

Traditionally, programmers and software tools have focused on mapping a single data-parallel kernel onto a heterogeneous computing system consisting of multiple general-purpose processors (CPUS) and graphics processing units (GPUs). These methodologies break down as application complexity grows to contain multiple communicating data-parallel kernels. This paper introduces MKMD, an automatic system...

INFONA - science communication portal

Search results

Comparison of regressors on 3D visual discomfort prediction

Performance evaluation PL330 DMA controller for bulk data transfer in Zynq SoC

Evaluating the Performance Impact of Multiple Streams on the MIC-Based Heterogeneous Platform

HMC-Sim-2.0: A Simulation Platform for Exploring Custom Memory Cube Operations

A fine-grained performance model for GPU architectures

Linux containers networking: Performance and scalability of kernel modules

A comprehensive performance analysis of HSA and OpenCL 2.0

A Multilabel Approach Using Binary Relevance and One-versus-Rest Least Squares Twin Support Vector Machine for Scene Classification

Quick access of sysfs entries through custom system call

A frame buffer caching for fast launch of browsers

Optimize In-kernel swap memory by avoiding duplicate swap out pages

A Performance Evaluation Model for Virtual Servers in KVM-Based Virtualized System

Effective I/O Processing with Exception-Less System Calls for Low-Latency Devices

Determining a device crossover point in CPU/GPU systems for streaming applications

To Co-run, or Not to Co-run: A Performance Study on Integrated Architectures

Multi-threaded Simics SystemC Virtual Platform

Performance Evaluation of Hypervisors for HPC Applications

OpenCL Kernel Fusion for GPU, Xeon Phi and CPU

Device-Sensitive Framework for Handling Heterogeneous Asymmetric Clusters Efficiently

Orchestrating Multiple Data-Parallel Kernels on Multiple Devices

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options