Search results

chapter

Quantum Accelerators for High-Performance Computing Systems

Keith A. Britt, Fahd A. Mohiyaddin, Travis S. Humble

2017 IEEE International Conference on Rebooting Computing (ICRC) > 1 - 7

2017 IEEE International Conference on Rebooting Computing (ICRC)

We define some of the programming and system-level challenges facing the application of quantum processing to high-performance computing. Alongside barriers to physical integration, prominent differences in the execution of quantum and conventional programs challenges the intersection of these computational models. Following a brief overview of the state of the art, we discuss recent advances in programming...

chapter

Accessing CUDA Features in the OpenGL Rendering Pipeline: A Case Study Using N-Body Simulation

Mario Santos Camillo, Wu Shin-Ting

2017 30th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI) > 315 - 322

2017 30th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI)

The advances of the graphics programing unit (GPU) architecture and its rapidly evolving towards general purpose GPU make a series of applications adopt a general purpose (GPGPU) and a graphics computing interoperability approach in which the first is used for heavy calculations and the second for 3D graphics rendering. Because GPGPU exposes several hardware features, such as shared memory and thread...

chapter

SharP: Towards Programming Extreme-Scale Systems with Hierarchical Heterogeneous Memory

Manjunath Gorentla Venkata, Ferrol Aderholdt, Zachary Parchman

2017 46th International Conference on Parallel Processing Workshops (ICPPW) > 145 - 154

2017 46th International Conference on Parallel Processing Workshops (ICPPW)

The pre-exascale systems are expected to have a significant amount of hierarchical and heterogeneous on-node memory, and this trend of system architecture in extreme-scale systems is expected to continue into the exascale era. Along with hierarchical-heterogeneous memory, the system typically has a high-performing network and a compute accelerator. This system architecture is not only effective for...

chapter

Programming Model to Develop Supercomputer Combinatorial Solvers

Ghaith Tarawneh, Andrey Mokhov, Matthew Naylor, Alex Rast, more

2017 46th International Conference on Parallel Processing Workshops (ICPPW) > 171 - 179

2017 46th International Conference on Parallel Processing Workshops (ICPPW)

Novel architectures for massively parallel machines offer better scalability and the prospect of achieving linear speedup for sizable problems in many domains. The development of suitable programming models and accompanying software tools for these architectures remains one of the biggest challenges towards exploiting their full potential. We present a multi-layer software abstraction model to develop...

article

NARMAX Self-Tuning Controller for Line-of-Sight-Based Waypoint Tracking for an Autonomous Underwater Vehicle

Raja Rout, Bidyadhar Subudhi

IEEE Transactions on Control Systems Technology > 2017 > 25 > 4 > 1529 - 1536

In this brief, a constrained self-tuning controller (CSTC) is developed for the heading and diving motions of an autonomous underwater vehicle (AUV) considering the parameter variation and practical realization of the algorithm. Parameters in the AUV dynamics may vary due to change in payload or physical structure. A Nonlinear Auto-Regressive Moving Average eXogenous (NARMAX) model is designed using...

chapter

Energy Efficiency Optimization of Task-Parallel Codes on Asymmetric Architectures

Luis Costero, Francisco D. Igual, Katzalin Olcoz, Francisco Tirado

2017 International Conference on High Performance Computing & Simulation (HPCS) > 402 - 409

2017 International Conference on High Performance Computing & Simulation (HPCS)

We present a family of policies that, integrated within a runtime task scheduler (Nanox), pursue the goal of improving the energy efficiency of task-parallel executions with no intervention from the programmer. The proposed policies tackle the problem by modifying the core operating frequency via DVFS mechanisms, or by enabling/disabling the mapping of tasks to specific cores at selected execution...

chapter

Self-Aware Context in Smart Home Pervasive Platforms

Philippe Lalanda, Eva Gerber-Gaillard, Stephanie Chollet

2017 IEEE International Conference on Autonomic Computing (ICAC) > 119 - 124

2017 IEEE International Conference on Autonomic Computing (ICAC)

Pervasive computing envisions environments where computers are blended into everyday objects in order to provide added-value services to people. A growing number of advanced embedded systems, extended with computing and communication capabilities, are already appearing around us. However, pervasive applications raise major challenges in terms of software engineering and remain hard to develop, deploy,...

chapter

PFSI.sw: A programming framework for sea ice model algorithms based on Sunway many-core processor

Binyang Li, Bo Li, Depei Qian

2017 IEEE 28th International Conference on Application-specific Systems, Architectures and Processors (ASAP) > 119 - 126

2017 IEEE 28th International Conference on Application-specific Systems, Architectures and Processors (ASAP)

Sea ice model is a typical high performance computing problem. CPU and GPU based parallel method has been proposed to accelerate the simulation process, but it is still hard to meet the large-scale calculation demand due to the compute-intensive nature of the model. Sunway TaihuLight supercomputer use the SW26010 processor as its computing unit and achieves high performance for large-scale scientific...

chapter

Modelling of systems for real

Joachim Fischer, Birger Moller-Pedersen, Andreas Prinz

2016 4th International Conference on Model-Driven Engineering and Software Development (MODELSWARD) > 427 - 434

2016 4th International Conference on Model-Driven Engineering and Software Development (MODELSWARD)

Modelling and Programming are often used together in system development. However, typically there is a large difference between the handling of modelling parts and the handling of programming parts. This leads to the fact that the transition between the two is not easy, and important information is lost as well as extra information has to be provided when combining modelling and programming. This...

chapter

Medical Imaging Processing on a Big Data Platform Using Python: Experiences with Heterogeneous and Homogeneous Architectures

Estefania Serrano, Javier Garcia Blas, Jesus Carretero, Monica Abella, more

2017 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID) > 830 - 837

2017 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGRID)

The apparition of new paradigms, programming models, and languages that offer better programmability and better performance turns the implementation of current scientific applications into a less time-consuming task than years ago. One significant example of this trend is the MapReduce programming model and its implementation using Apache Spark. Nowadays, this programming model is mainly used for...

chapter

One-Way Wave Equation Migration at Scale on GPUs Using Directive Based Programming

Kshitij Mehta, Maxime Hugues, Oscar Hernandez, David E. Bernholdt, more

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS) > 224 - 233

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS)

One-Way Wave Equation Migration (OWEM) is a depth migration algorithm used for seismic imaging. A parallel version of this algorithm is widely implemented using MPI. Heterogenous architectures that use GPUs have become popular in the Top 500 because of their performance/power ratio. In this paper, we discuss the methodology and code transformations used to port OWEM to GPUs using OpenACC, along with...

chapter

Exploring Translation of OpenMP to OpenACC 2.5: Lessons Learned

Sergio Pino, Lori Pollock, Sunita Chandrasekaran

2017 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) > 673 - 682

2017 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

Scientists who want to exploit the computing power of the latest parallel architectures are faced with a diverse set of architectures and a number of programming languages, models and approaches. Among several such programming techniques are directive-based programming models, OpenMP and OpenACC. This paper explores the similarities and the functionality gaps between both models and presents insights...

chapter

Coordinating Proactive Social Devices in a Mobile Cloud: Lessons Learned and a Way Forward

Niko Makitalo, Timo Aaltonen, Tommi Mikkonen

2016 IEEE/ACM International Conference on Mobile Software Engineering and Systems (MOBILESoft) > 179 - 188

2016 IEEE/ACM International Conference on Mobile Software Engineering and Systems (MOBILESoft)

Recent device shipment trends strongly indicate that the number of Web-enabled devices other than PCs and smart phones are growing rapidly. Marking the end of the dominant era of these two traditional device categories, people will soon commonly use various types of Internet-connected devices in their daily lives, where no single device will dominate. Since today's devices are mostly standalone and...

chapter

User-friendly interface for GPGPU programming

Hasindu Gamaarachchi, Mohamed Fawsan, Fathima Fasna, Dhammika Elkaduwe

2017 6th National Conference on Technology and Management (NCTM) > 99 - 104

2017 6th National Conference on Technology and Management (NCTM)

Compute Unified Device Architecture (CUDA) is an attractive alternative for our ever growing need for high performance computing. However to extract the full potential of CUDA one should, at the least be familiar with the programming model and should have a fair understanding of the memory and the cache architecture. Yet most of the domain experts from domains that warrant high performance computing...

chapter

Programming agent-based demographic models with cross-state and message-exchange dependencies: A study with speculative PDES and automatic load-sharing

Alessandro Pellegrini, Francesco Quaglia, Cristina Montanola-Sales, Josep Casanovas-Garca

2016 Winter Simulation Conference (WSC) > 955 - 966

2016 Winter Simulation Conference (WSC)

Agent-based modeling and simulation is a versatile and promising methodology to capture complex interactions among entities and their surrounding environment. A great advantage is its ability to model phenomena at a macro scale by exploiting simpler descriptions at a micro level. It has been proven effective in many fields, and it is rapidly becoming a de-facto standard in the study of population...

chapter

MOOC architecture model for computer programming courses

Budi Yulianto, Harjanto Prabowo, Raymond Kosala, Manik Hapsara

2016 International Conference on Information Management and Technology (ICIMTech) > 35 - 40

2016 International Conference on Information Management and Technology (ICIMTech)

Computer programming courses are major contributor to non-passing rates and teaching the subject is significantly challenging. MOOC (Massive Open Online Course), a web-based LMS providing online courses, is a new popular learning media expected to facilitate better teaching-learning process for improving learning outcomes. Unfortunately, most of the current MOOCs are intended only for non-vocational...

chapter

Elastic Multi-resource Fairness: Balancing Fairness and Efficiency in Coupled CPU-GPU Architectures

Shanjiang Tang, BingSheng He, Shuhao Zhang, Zhaojie Niu

SC16: International Conference for High Performance Computing, Networking, Storage and Analysis > 875 - 886

SC16: International Conference for High Performance Computing, Networking, Storage and Analysis

Fairness and efficiency are two important concerns for users in a shared computer system, and there tends to be a tradeoff between them. Heterogeneous computing poses new challenging issues on the fair allocation of computational resources among users due to the availability of different kinds of computing devices (e.g., CPU and GPU). Prior work either considers the fair resource allocation separately...

chapter

Towards Achieving Performance Portability Using Directives for Accelerators

M. Graham Lopez, Veronica Vergara Larrea, Wayne Joubert, Oscar Hernandez, more

2016 Third Workshop on Accelerator Programming Using Directives (WACCPD) > 13 - 24

2016 Third Workshop on Accelerator Programming Using Directives (WACCPD)

In this paper we explore the performance portability of directives provided by OpenMP 4 and OpenACC to program various types of node architectures with attached accelerators, both self-hosted multicore and offload multicore/GPU. Our goal is to examine how successful OpenACC and the newer offload features of OpenMP 4.5 are for moving codes between architectures, how much tuning might be required and...

chapter

Automatic Code Generation and Data Management for an Asynchronous Task-Based Runtime

Muthu Baskaran, Benoit Pradelle, Benoit Meister, Athanasios Konstantinidis, more

2016 5th Workshop on Extreme-Scale Programming Tools (ESPT) > 34 - 41

2016 5th Workshop on Extreme-Scale Programming Tools (ESPT)

Hardware scaling and low-power considerations associated with the quest for exascale and extreme scale computing are driving system designers to consider new runtime and execution models such as the event-driven-task (EDT) models that enable more concurrency and reduce the amount of synchronization. Further, for performance, productivity, and code sustainability reasons, there is an increasing demand...

chapter

Rethinking operating systems for rebooted computing

Phil Laplante, Dejan Milojicic

2016 IEEE International Conference on Rebooting Computing (ICRC) > 1 - 8

2016 IEEE International Conference on Rebooting Computing (ICRC)

as the deceleration of processor scaling due to Moore's law accelerates research in new types of computing structures, the need arises for rethinking operating systems paradigms. Traditionally, an operating system is a layer between hardware and applications and its primary function is in managing hardware resources and providing a common abstraction to applications. How does this function apply,...

INFONA - science communication portal

Search results

Quantum Accelerators for High-Performance Computing Systems

Accessing CUDA Features in the OpenGL Rendering Pipeline: A Case Study Using N-Body Simulation

SharP: Towards Programming Extreme-Scale Systems with Hierarchical Heterogeneous Memory

Programming Model to Develop Supercomputer Combinatorial Solvers

NARMAX Self-Tuning Controller for Line-of-Sight-Based Waypoint Tracking for an Autonomous Underwater Vehicle

Energy Efficiency Optimization of Task-Parallel Codes on Asymmetric Architectures

Self-Aware Context in Smart Home Pervasive Platforms

PFSI.sw: A programming framework for sea ice model algorithms based on Sunway many-core processor

Modelling of systems for real

Medical Imaging Processing on a Big Data Platform Using Python: Experiences with Heterogeneous and Homogeneous Architectures

One-Way Wave Equation Migration at Scale on GPUs Using Directive Based Programming

Exploring Translation of OpenMP to OpenACC 2.5: Lessons Learned

Coordinating Proactive Social Devices in a Mobile Cloud: Lessons Learned and a Way Forward

User-friendly interface for GPGPU programming

Programming agent-based demographic models with cross-state and message-exchange dependencies: A study with speculative PDES and automatic load-sharing

MOOC architecture model for computer programming courses

Elastic Multi-resource Fairness: Balancing Fairness and Efficiency in Coupled CPU-GPU Architectures

Towards Achieving Performance Portability Using Directives for Accelerators

Automatic Code Generation and Data Management for an Asynchronous Task-Based Runtime

Rethinking operating systems for rebooted computing

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options