Search results

chapter

GPU based parallel image processing library for embedded systems

Mustafa Cavus, Hakki Doganer Sumerkan, Osman Seckin Simsek, Hasan Hassan, more

2014 International Conference on Computer Vision Theory and Applications (VISAPP) > 1 > 234 - 241

2014 International Conference on Computer Vision Theory and Applications (VISAPP)

Embedded image processing systems have many challenges, due to large computational requirements and other physical, power, and environmental constraints. However recent contemporary mobile devices include a graphical processing unit (GPU) in order to offer better use interface in terms of graphics. Some of these embedded GPUs also support OpenCL which allows the use of computation capacity of embedded...

chapter

Lightweight virtual memory support for many-core accelerators in heterogeneous embedded SoCs

Pirmin Vogel, Andrea Marongiu, Luca Benini

2015 International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS) > 45 - 54

2015 International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS)

While high-end heterogeneous systems are increasingly supporting heterogeneous uniform memory access (hUMA) as envisioned by the Heterogeneous System Architecture (HSA) foundation, their low-power counterparts targeting the embedded domain still lack basic features like virtual memory support for accelerators. As opposed to simply passing virtual address pointers, explicit data management involving...

chapter

A comparison of windows physical memory acquisition tools

Waqas Ahmed, Baber Aslam

MILCOM 2015 - 2015 IEEE Military Communications Conference > 1292 - 1297

MILCOM 2015 - 2015 IEEE Military Communications Conference

Memory forensics analysis is an important area of digital forensics especially in incident response, malware analysis and behavior analysis (of application and system software) in physical memory. Traditional digital forensics, such as investigating non-volatile storage, cannot be used to establish the current state of the system (including network connections) or for analysis of malwares that use...

chapter

Performance Evaluation of Hypervisors for HPC Applications

David Beserra, Felipe Oliveira, Jean Araujo, Felipe Fernandes, more

2015 IEEE International Conference on Systems, Man, and Cybernetics > 846 - 851

2015 IEEE International Conference on Systems, Man, and Cybernetics (SMC)

High Performance Computing (HPC) aggregates computing power in order to solve large and complex problems in different knowledge areas. Nowadays, HPC users can utilize virtualized infrastructures as a low-cost alternative to deploy their applications. However, virtualization brings some challenges for HPC, specially in regard to overhead caused by hyper visors. In this work, our main goal is to analyze...

chapter

Coarse-grain reconfigurable ASIC through multiplexer based switches

Karen Gettings, Marc Burke, Jeremy Muldavin, Michael Vai

2015 IEEE High Performance Extreme Computing Conference (HPEC) > 1 - 4

2015 IEEE High Performance Extreme Computing Conference (HPEC)

We present an ASIC architecture with coarse-grain reconfigurability that uses accelerators to improve performance over fine-grain reconfigurable architectures. A reconfigurable FFT ASIC was built as a proof of concept, and it successfully demonstrated valid switch operation for reconfiguration.

chapter

Scalable Relativistic High-Resolution Shock-Capturing for Heterogeneous Computing

Forrest Wolfgang Glines, Matthew Anderson, David Neilsen

2015 IEEE International Conference on Cluster Computing > 611 - 618

2015 IEEE International Conference on Cluster Computing (CLUSTER)

A shift is underway in high performance computing (HPC) towards heterogeneous parallel architectures that emphasize medium and fine grain thread parallelism. Many scientific computing algorithms, including simple finite-differencing methods, have already been mapped to heterogeneous architectures with order-of-magnitude gains in performance as a result. Recent case studies examining high-resolution...

chapter

Understanding the Propagation of Error Due to a Silent Data Corruption in a Sparse Matrix Vector Multiply

Jon Calhoun, Marc Snir, Luke Olson, Maria Garzaran

2015 IEEE International Conference on Cluster Computing > 541 - 542

2015 IEEE International Conference on Cluster Computing (CLUSTER)

With the rate of errors that silently effect an application's state/output expected to increase in future HPC machines, numerous mitigation schemes have been proposed, but little work has been done investigating why these schemes detect some error while other is masked. This paper investigates how silent data corruption (SDC) propagates through a sparse matrix vector multiply (SpMV), a fundamental...

chapter

Transplantation and realization of VxWorks6.7 porting based on Loongson1B hardware platform

Lu Huihui, Yang Kun

2015 6th IEEE International Conference on Software Engineering and Service Science (ICSESS) > 520 - 524

2015 6th IEEE International Conference on Software Engineering and Service Science (ICSESS)

As the advantages of high performance and low power, the Loongson-1 processor has wide application prospects in industrial control, high-performance embedded, and other fields. Now the Loongson series platforms are mostly based on Linux operating system. However, VxWorks is a better choice for its high real-time performance and high reliability in the field of industrial control and high-performance...

chapter

Efficient memory reclaiming for mitigating sluggish response in mobile devices

Minho Ju, Hyeonggyu Kim, Mincheol Kang, Soontae Kim

2015 IEEE 5th International Conference on Consumer Electronics - Berlin (ICCE-Berlin) > 232 - 236

2015 IEEE 5th International Conference on Consumer Electronics - Berlin (ICCE-Berlin)

Mobile devices based on flash memory have unique hardware characteristics. They have different memory management mechanisms such as no memory swapping and app cache. Android platform adopts new modules such as low memory killer (LMK), activity manager service (AMS) besides kswapd and out of memory killer (OOMK). However, these modules generate many Kernel function calls that incur sluggish responses...

chapter

A noise suppressing filter design for reducing deconvolution error of both-directions downward sloped asymmeric RTN long-tail distributions

Hiroyuki Yamauchi, Worawit Somha

2015 International Workshop on CMOS Variability (VARI) > 51 - 56

2015 International Workshop on CMOS Variability (VARI)

A noise suppressing filter design technique to reduce deconvolution error of both-directions downward sloped asymmetrical long-tail distribution of the Random Telegraph Noise (RTN) is proposed. The filter is used in Lucy-Richardson-deconvolution (LRDec) iteration process. The deconvolution is required for inversely analyzing RTN long tail distribution effects on VLSI time-dependent operating margin...

chapter

Scaling number of cores in GPGPU: A comparative performance analysis

Winnie Thomas, Rohin D. Daruwala

2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI) > 501 - 507

2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

The Single Instruction Multiple Thread (SIMT) architecture based, Graphic Processing Units (GPUs) are emerging as more efficient than Multiple Instruction Multiple Data (MIMD) architectures in exploiting parallelism. A GPU has numerous shader cores and thousands of simultaneous finegrained active threads. These threads are grouped into Cooperative Thread Arrays (CTAs). All the threads within a CTA...

chapter

VLSI architecture of Pairwise Linear SVM for facial expression recognition

Sumeet Saurav, Anil K Saini, Sanjay Singh, Ravi Saini, more

2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI) > 521 - 527

2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

In this paper, we present VLSI architecture of Pairwise Linear Support Vector Machine (SVM) classifier for multi-classification on FPGA. The objective of this work is to facilitate real time classification of the facial expressions into three categories: neutral, happy and pain, which could be used in a typical patient monitoring system. Thus, the challenge here is to achieve good performance without...

chapter

Enhanced page Reclaim for Android devices

Balakrishnan Jayavel, Subbaramaiah Mandava, Jyoti Johri

2015 Eighth International Conference on Contemporary Computing (IC3) > 459 - 462

2015 Eighth International Conference on Contemporary Computing (IC3)

Smartphones emerge as one of the most coherent companion for humans over past few years. A memory crunch situation makes the user feel sluggishness while accessing the applications. So, Linux starts retrieving memory using kswapd or Direct Reclaim followed by Android Low Memory Killer which identifies victim processes to be killed on the basis of defined criteria until sufficient amount of memory...

chapter

An FPGA Memory Hierarchy for High-level Synthesized OpenCL Kernels

Hsiang-Yu Tseng, Ssu-Ting Liu, Sheng-De Wang

2015 IEEE 17th International Conference on High Performance Computing and Communications, 2015 IEEE 7th International Symposium on Cyberspace Safety and Security, and 2015 IEEE 12th International Conference on Embedded Software and Systems > 1719 - 1724

2015 IEEE 17th International Conference on High Performance Computing and Communications (HPCC), 2015 IEEE 7th International Symposium on Cyberspace Safety and Security (CSS) and 2015 IEEE 12th International Conf on Embedded Software and Systems (ICESS)

In this paper, we propose an FPGA memory hierarchy based on the OpenCL memory model. The memory hierarchy allows application-specific memory optimizations during design compilation using information provided in OpenCL kernels. With the proposed memory hierarchy, FPGA application developers can focus on their designs in OpenCL kernel codes, and their designs can be synthesized into FPGA hardware via...

chapter

Forensic analysis of windows user space applications through heap allocations

Michael Cohen

2015 IEEE Symposium on Computers and Communication (ISCC) > 237 - 244

2015 IEEE Symposium on Computers and Communication (ISCC)

Memory analysis is now used routinely for incident response and forensic applications. Current memory analysis techniques are very effective in finding kernel artifacts of significance to the forensic investigator. However, the analysis of user space applications has not received enough attention so far. We identify the lack of pagefile support in analysis and acquisition as a major hurdle in the...

chapter

Permutation based indexing for high dimensional data on GPU architectures

Martin Krulis, Hasmik Osipyan, Stephane Marchand-Maillet

2015 13th International Workshop on Content-Based Multimedia Indexing (CBMI) > 1 - 6

2015 13th International Workshop on Content-Based Multimedia Indexing (CBMI)

Permutation-based indexing is one of the most popular techniques for the approximate nearest-neighbor search problem in high-dimensional spaces. Due to the exponential increase of multimedia data, the time required to index this data has become a serious constraint of the indexing techniques. One of the possible steps towards faster index construction is utilization of massively parallel platforms...

chapter

An Evaluation of Unified Memory Technology on NVIDIA GPUs

Wenqiang Li, Guanghao Jin, Xuewen Cui, Simon See

2015 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing > 1092 - 1098

2015 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid)

Unified Memory is an emerging technology which is supported by CUDA 6.X. Before CUDA 6.X, the existing CUDA programming model relies on programmers to explicitly manage data between CPU and GPU and hence increases programming complexity. CUDA 6.X provides a new technology which is called as Unified Memory to provide a new programming model that defines CPU and GPU memory space as a single coherent...

chapter

Windows NT pagefile.sys Virtual Memory Analysis

Michael Gruhn

2015 Ninth International Conference on IT Security Incident Management & IT Forensics > 3 - 18

2015 Ninth International Conference on IT Security Incident Management & IT Forensics (IMF)

As hard disk encryption, RAM disks, persistent data avoidance technology and memory resident malware become morewidespread, memory analysis becomes more important. In order to provide more virtual memory than is actually physicalpresent on a system, an operating system may transfer frames of memory to a pagefile on persistent storage. Current memoryanalysis software does not incorporate such pagefiles...

chapter

A study of application performance with non-volatile main memory

Yiying Zhang, Steven Swanson

2015 31st Symposium on Mass Storage Systems and Technologies (MSST) > 1 - 10

2015 31st Symposium on Mass Storage Systems and Technologies (MSST)

Attaching next-generation non-volatile memories (NVMs) to the main memory bus provides low-latency, byte-addressable access to persistent data that should significantly improve performance for a wide range of storage-intensive workloads. We present an analysis of storage application performance with non-volatile main memory (NVMM) using a hardware NVMM emulator that allows fine-grain tuning of NVMM...

chapter

CORDIC architecture based 2-D DCT and IDCT for image compression

Sayali R. Bhaisare, Aniket V. Gokhale, Pravin K. Dakhole

2015 International Conference on Communications and Signal Processing (ICCSP) > 1473 - 1477

2015 International Conference on Communications and Signal Processing (ICCSP)

Generally, 2-D DCT/IDCT (Two dimensional discrete cosine transform and its inverse) are widely used in many image processing systems. In this paper, efficient architectures are proposed. These architectures have parallel and pipelined structures which are used to implement 8×8 DCT/IDCT processors. These processors involve two 8-point DCT/IDCT processors along with a dual-bank of SRAM (128 words) and...

INFONA - science communication portal

Search results

GPU based parallel image processing library for embedded systems

Lightweight virtual memory support for many-core accelerators in heterogeneous embedded SoCs

A comparison of windows physical memory acquisition tools

Performance Evaluation of Hypervisors for HPC Applications

Coarse-grain reconfigurable ASIC through multiplexer based switches

Scalable Relativistic High-Resolution Shock-Capturing for Heterogeneous Computing

Understanding the Propagation of Error Due to a Silent Data Corruption in a Sparse Matrix Vector Multiply

Transplantation and realization of VxWorks6.7 porting based on Loongson1B hardware platform

Efficient memory reclaiming for mitigating sluggish response in mobile devices

A noise suppressing filter design for reducing deconvolution error of both-directions downward sloped asymmeric RTN long-tail distributions

Scaling number of cores in GPGPU: A comparative performance analysis

VLSI architecture of Pairwise Linear SVM for facial expression recognition

Enhanced page Reclaim for Android devices

An FPGA Memory Hierarchy for High-level Synthesized OpenCL Kernels

Forensic analysis of windows user space applications through heap allocations

Permutation based indexing for high dimensional data on GPU architectures

An Evaluation of Unified Memory Technology on NVIDIA GPUs

Windows NT pagefile.sys Virtual Memory Analysis

A study of application performance with non-volatile main memory

CORDIC architecture based 2-D DCT and IDCT for image compression

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options