Search results

Items from 61 to 80 out of 402 results

chapter

Emulation of processing in memory architecture for application development

Jin-San Kwon, Tae-ho Hwang, Dong-Sun Kim

2016 International SoC Design Conference (ISOCC) > 183 - 184

2016 International SoC Design Conference (ISOCC)

Since the new technologies like big data and cloud computing require tremendous amount of transactions between processors and memory, researches on a new memory system called Processing in Memory (PIM) architecture has been suggested as a solution for those memory intensive applications. To make software utilize the new architecture, a development environment with tool chain and debug infrastructures...

chapter

Using reconfigurable multi-core architectures for safety-critical embedded systems

Tom Guillaumet, Aayush Sharma, Eric Feron, Madhava Krishna, more

2016 IEEE/AIAA 35th Digital Avionics Systems Conference (DASC) > 1 - 6

2016 IEEE/AIAA 35th Digital Avionics Systems Conference (DASC)

With the onset of multi- and many-core chips, the single-core market is closing down. Those chips constitute a new challenge for aerospace and safety-critical industries in general. Little is known about the certification of software running on these systems. There is therefore a strong need for developing software architectures based on multi-core architectures, yet compliant with safety-criticality...

chapter

PERFECT case studies demonstrating order of magnitude reduction in power consumption

David K. Wittenberg, Edin Kadric, Andre DeHon, Jonathan Edwards, more

2016 IEEE High Performance Extreme Computing Conference (HPEC) > 1 - 7

2016 IEEE High Performance Extreme Computing Conference (HPEC)

We propose three methods for reducing power consumption in high-performance FPGAs (field programmable gate arrays). We show that by using continuous hierarchy memory, lightweight checks, and lower chip voltage for near-threshold voltage computation, we can both reduce power consumption and increase reliability without a decrease in throughput. We have implemented these techniques in two different,...

chapter

Directive-Based Pipelining Extension for OpenMP

Xuewen Cui, Thomas R. W. Scogland, Bronis R. de Supinski, Wu-Chun Feng

2016 IEEE International Conference on Cluster Computing (CLUSTER) > 481 - 484

2016 IEEE International Conference on Cluster Computing (CLUSTER)

Programming models like CUDA, OpenMP, OpenACC and OpenCL are designed to offload compute-intensive workloads to accelerators efficiently. However, the naive offload model, which synchronously copies and executes in sequence, requires extensive hand-tuning of techniques, such as pipelining to overlap computation and communication. Therefore, we propose an easy-to-use, directive-based pipelining extension...

chapter

Non-Equispaced FFT Computation with CUDA and GPU

Xiangwen Lyu, Jian-Min Zuo, Haiyong Xie

2016 International Conference on Virtual Reality and Visualization (ICVRV) > 227 - 234

2016 International Conference on Virtual Reality and Visualization (ICVRV)

Non-equispaced fast Fourier transform (NFFT) has attracted significant interest for its applications in tomography and remote sensing where visualization and image reconstruction require non-equispaced data. Here we present an efficient implementation of high accuracy NFFT on an NVidia GPU (Graphic Processing Unit). We focused on the convolution step in the computation of NFFT, since it is the most...

chapter

Scalable and modularized RTL compilation of Convolutional Neural Networks onto FPGA

Yufei Ma, Naveen Suda, Yu Cao, Jae-sun Seo, more

2016 26th International Conference on Field Programmable Logic and Applications (FPL) > 1 - 8

2016 26th International Conference on Field Programmable Logic and Applications (FPL)

Despite its popularity, deploying Convolutional Neural Networks (CNNs) on a portable system is still challenging due to large data volume, intensive computation and frequent memory access. Although previous FPGA acceleration schemes generated by high-level synthesis tools (i.e., HLS, OpenCL) have allowed for fast design optimization, hardware inefficiency still exists when allocating FPGA resources...

chapter

Group-Based Memory Deduplication against Covert Channel Attacks in Virtualized Environments

Fangxiao Ning, Min Zhu, Ruibang You, Gang Shi, more

2016 IEEE Trustcom/BigDataSE/ISPA > 194 - 200

2016 IEEE Trustcom/BigDataSE/ISPA

Memory deduplication improves memory density by merging identical memory pages in multi-tenanted cloud. However, memory deduplication is vulnerable to memory disclosure attacks and covert channel attacks. The covert channel bases on the difference in write access time on deduplicated memory pages that are re-created by Copy-on-Write technique. Prior works have shown that malicious attackers can make...

chapter

Soft2LM: Application Guided Heterogeneous Memory Management

Michael Giardino, Kshitij Doshi, Bonnie Ferri

2016 IEEE International Conference on Networking, Architecture and Storage (NAS) > 1 - 10

2016 IEEE International Conference on Networking, Architecture and Storage (NAS)

This paper introduces a software policy for memory management in heterogeneous memory systems in order to improve the trade-offs between performance and power consumption, while attempting to make the best use of different characteristics of the underlying memory technologies. In this policy, the operating system and the application co-schedule page management in order to make informed decisions about...

chapter

COGITO: Code polymorphism to secure devices

Damien Courousse, Bruno Robisson, Jean-Louis Lanet, Thierno Barry, more

2014 11th International Conference on Security and Cryptography (SECRYPT) > 1 - 6

2014 11th International Conference on Security and Cryptography (SECRYPT)

In this paper, we advocate the use of code polymorphism as an efficient means to improve security at several levels in electronic devices. We analyse the threats that polymorphism could help thwart, and present the solution that we plan to demonstrate in the scope of a collaborative research project called COGITO. We expect our solution to be effective to improve security, to comply with the computing...

chapter

Incremental kernel non-negative matrix factorization for hyperspectral unmixing

Risheng Huang, Xiaorun Li, Liaoying Zhao

2016 IEEE International Geoscience and Remote Sensing Symposium (IGARSS) > 6569 - 6572

IGARSS 2016 - 2016 IEEE International Geoscience and Remote Sensing Symposium

In this paper, we proposed an incremental kernel non-negative matrix factorization (IKNMF) to reduce the computing scale in hyperspectral unmixing. Kernel non-negative matrix factorization (KNMF) is an extended non-negative matrix factorization (NMF) able to capture nonlinear dependency features in data matrix through kernel functions. In KNMF algorithm, the size of kernel matrices is closely associated...

chapter

Transitioning Native Application into Virtual Machine by Using Hardware Virtualization Extensions

Muhammad Shams Ul Haq, Liao Lejian, Ma Lerong

2016 International Symposium on Computer, Consumer and Control (IS3C) > 397 - 403

2016 International Symposium on Computer, Consumer and Control (IS3C)

In presence of known and unknown vulnerabilities in code and flow control of programs, virtual machine alike isolation and sandboxing to confine maliciousness of process, by monitoring and controlling the behaviour of untrusted application, is an effective strategy. A confined malicious application cannot effect system resources and other applications running on same operating system. But present...

chapter

Adaptive cyber-physical systems with interpreted operating system kernels

Peter Troger, Christine Jakobs, Thomas Jakobs, Matthias Werner

2016 5th Mediterranean Conference on Embedded Computing (MECO) > 26 - 29

2016 5th Mediterranean Conference on Embedded Computing (MECO)

In the new era of cyber-physical systems, software must adapt itself to ever-changing environmental conditions and situations. This is currently not reflected in the design of embedded operating systems, since they are primarily optimized for fixed usage scenarios with tight resource constraints. We discuss the idea of interpreted operating system kernels, which can form a new foundation for highly...

chapter

Research on survivability strategic of operating system

Tong Wang, Liang Wang

2016 IEEE/ACIS 15th International Conference on Computer and Information Science (ICIS) > 1 - 5

2016 IEEE/ACIS 15th International Conference on Computer and Information Science (ICIS)

The survivability of OS is very important for the whole system because OS is the base of information system or network system. Based on the analysis of resources, services and functions of the OS, this paper proposed the concept of a integrity running environment (IRE) owing to the particularity of the OS survivability, and then, puts forward the new definition, namely the OS survivability is that...

chapter

SLAMfusion: Fusing SLAM Methods for Improved Robustness

Miguel Fernandes, Luis A. Alexandre

2016 International Conference on Autonomous Robot Systems and Competitions (ICARSC) > 229 - 234

2016 International Conference on Autonomous Robot Systems and Competitions (ICARSC)

There are multiple approaches for SLAM, but we found the the ones implemented in ROS had problems when a robot drove over small obstacles. This paper presents a proposal to make a more robust SLAM by running three SLAM methods in parallel and using their information to produce a better estimate of the robot's surroundings. The proposed method defines its output by making the three methods vote for...

chapter

HMC-Sim-2.0: A Simulation Platform for Exploring Custom Memory Cube Operations

John D. Leidel, Yong Chen

2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) > 621 - 630

2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

The recent advent of stacked memory devices has led to a resurgence of researchassociated with the fundamental memory hierarchy and associated memory pipeline. The bandwidth advantages provided by stacked logic and DRAM devices haveinspired research associated with eliminating the bandwidth bottlenecksassociated with many applications in high performance computing. Further, recent efforts have focused...

chapter

Effective Utilization of CUDA Hyper-Q for Improved Power and Performance Efficiency

Ryan S. Luley, Qinru Qiu

2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) > 1160 - 1169

2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

High utilization of hardware resources is the key for designing performance and power optimized GPUapplications. The efficiency of applications and kernels, which do not fully utilize the GPU resources, can be improved through concurrent execution with independent kernels and/or applications. Hyper-Q enables multiple CPU threads or processes to launch work on a single GPU simultaneously for increased...

chapter

Efficient algorithms for memory management in embedded vision systems

Khadija Hadj Salem, Yann Kieffer, Stephane Mancini

2016 11th IEEE Symposium on Industrial Embedded Systems (SIES) > 1 - 6

2016 11th IEEE Symposium on Industrial Embedded Systems (SIES)

In the field of embedded vision systems, meeting the constraints on design criteria such as performance, area, and power consumption can be a real challenge. In fact, to alleviate the well known “Memory Mall”, it is mandatory to provide efficient memory hierarchies to reach usable performance for the system to be designed when it has to handle non-linear image treatments. To address this problematic,...

chapter

Operating systems: Demand based modularity

Prabhat Kumar, Rohit Raj, Aman Reyaz, Pooshkar Rajiv

2016 International Conference on Computing, Communication and Automation (ICCCA) > 878 - 883

2016 International Conference on Computing, Communication and Automation (ICCCA)

Traditional PC based operating systems load most of its components during the boot process along with the kernel. This mechanism though effective for a broader objective, is seldom utilized fully by a majority of users as they usually perform a specific job which does not require every component of OS. It has been observed that operating systems which are designed keeping in mind the nature of job,...

chapter

Improving Spark performance with zero-copy buffer management and RDMA

Hu Li, Tianjia Chen, Wei Xu

2016 IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS) > 33 - 38

IEEE INFOCOM 2016 - IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS)

With the ever increasing demand on interactive data analytics, latency for big data frameworks becomes more important. We present our preliminary experience designing and implementing NetSpark, an improved Spark [1] framework that is highly optimized for network latency. Combining optimizations on data serialization, network buffer management with hardware-supported Remote Direct Memory Access (RDMA)...

chapter

X-Mem: A cross-platform and extensible memory characterization tool for the cloud

Mark Gottscho, Sriram Govindan, Bikash Sharma, Mohammed Shoaib, more

2016 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS) > 263 - 273

2016 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)

Effective use of the memory hierarchy is crucial to cloud computing. Platform memory subsystems must be carefully provisioned and configured to minimize overall cost and energy for cloud providers. For cloud subscribers, the diversity of available platforms complicates comparisons and the optimization of performance. To address these needs, we present X-Mem, a new open-source software tool that characterizes...

Keywords:
KERNEL
MEMORY MANAGEMENT

Publication date

Set your own date range

Content availability

Available (398)
None (4)

Keywords

INSTRUCTION SETS (81)
LINUX (80)
HARDWARE (74)
GRAPHICS PROCESSING UNITS (58)
RANDOM ACCESS MEMORY (55)
RESOURCE MANAGEMENT (50)
GRAPHICS PROCESSING UNIT (45)
GPU (42)
BENCHMARK TESTING (38)
PARALLEL PROCESSING (38)
BANDWIDTH (37)
SERVERS (36)
OPTIMIZATION (33)
LIBRARIES (28)
ARRAYS (27)
STORAGE MANAGEMENT (25)
COMPUTATIONAL MODELING (24)
CUDA (24)
REGISTERS (24)
FIELD PROGRAMMABLE GATE ARRAYS (23)
PROGRAMMING (23)
RUNTIME (21)
OPERATING SYSTEM (20)
OPERATING SYSTEMS (COMPUTERS) (19)
PERFORMANCE EVALUATION (19)
VIRTUAL MACHINING (19)
EMBEDDED SYSTEMS (18)
SECURITY (18)
COPROCESSORS (17)
OPERATING SYSTEMS (17)
PROTOCOLS (17)
ALGORITHM DESIGN AND ANALYSIS (16)
MONITORING (16)
OPERATING SYSTEM KERNELS (16)
COMPUTER GRAPHIC EQUIPMENT (15)
DATA STRUCTURES (15)
MULTIPROCESSING SYSTEMS (15)
PROGRAM PROCESSORS (15)
SYNCHRONIZATION (15)
THROUGHPUT (14)
VIRTUAL MACHINE MONITORS (14)
INDEXES (13)
OPENCL (13)
VIRTUAL MACHINES (13)
ACCELERATION (12)
DATA MINING (12)
PARALLEL PROGRAMMING (12)
VIRTUALIZATION (12)
CACHE STORAGE (11)
CLOUD COMPUTING (11)
GPGPU (11)
IMAGE PROCESSING (11)
PREFETCHING (11)
REAL TIME SYSTEMS (11)
YARN (11)
COMPUTE UNIFIED DEVICE ARCHITECTURE (10)
FPGA (10)
MULTICORE PROCESSING (10)
NONVOLATILE MEMORY (10)
RADIATION DETECTORS (10)
CONVOLUTION (9)
DRIVER CIRCUITS (9)
POWER DEMAND (9)
RELIABILITY (9)
STREAMING MEDIA (9)
VECTORS (9)
EQUATIONS (8)
HIGH PERFORMANCE COMPUTING (8)
LATTICES (8)
REAL-TIME SYSTEMS (8)
SCALABILITY (8)
SUPPORT VECTOR MACHINES (8)
SYSTEM-ON-A-CHIP (8)
TRAINING (8)
VIRTUAL MACHINE (8)
COMPUTER ARCHITECTURE (7)
COMPUTER GRAPHICS (7)
DATA TRANSFER (7)
MEMORY (7)
MEMORY ARCHITECTURE (7)
NEURAL NETWORKS (7)
PIXEL (7)
PROCESSOR SCHEDULING (7)
RECONFIGURABLE ARCHITECTURES (7)
SCHEDULES (7)
SWITCHES (7)
ACCURACY (6)
APPLICATION PROGRAM INTERFACES (6)
CLOCKS (6)
COMPLEXITY THEORY (6)
CONTEXT (6)
DATABASES (6)
DIGITAL SIGNAL PROCESSING (6)
ENERGY CONSUMPTION (6)
GRAPHICS (6)
INSTRUMENTS (6)
INTERNET (6)
LOGIC GATES (6)
more

INFONA - science communication portal

Search results

Emulation of processing in memory architecture for application development

Using reconfigurable multi-core architectures for safety-critical embedded systems

PERFECT case studies demonstrating order of magnitude reduction in power consumption

Directive-Based Pipelining Extension for OpenMP

Non-Equispaced FFT Computation with CUDA and GPU

Scalable and modularized RTL compilation of Convolutional Neural Networks onto FPGA

Group-Based Memory Deduplication against Covert Channel Attacks in Virtualized Environments

Soft2LM: Application Guided Heterogeneous Memory Management

COGITO: Code polymorphism to secure devices

Incremental kernel non-negative matrix factorization for hyperspectral unmixing

Transitioning Native Application into Virtual Machine by Using Hardware Virtualization Extensions

Adaptive cyber-physical systems with interpreted operating system kernels

Research on survivability strategic of operating system

SLAMfusion: Fusing SLAM Methods for Improved Robustness

HMC-Sim-2.0: A Simulation Platform for Exploring Custom Memory Cube Operations

Effective Utilization of CUDA Hyper-Q for Improved Power and Performance Efficiency

Efficient algorithms for memory management in embedded vision systems

Operating systems: Demand based modularity

Improving Spark performance with zero-copy buffer management and RDMA

X-Mem: A cross-platform and extensible memory characterization tool for the cloud

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options