Search results

chapter

Speed Up Weather Prediction on QCT Developer Cloud: A Case Study on Knights Landing Platform

Gong-Do Hwang, Stephen Chang

2017 IEEE 4th International Conference on Cyber Security and Cloud Computing (CSCloud) > 6 - 9

2017 IEEE 4th International Conference on Cyber Security and Cloud Computing (CSCloud)

We present the direct performance measurements of two popular weather forecast models, Weather Research and Forecast Model (WRF) and Models for Predictions Across Scales (MPAS) on Intel's Knight Landing Platform (KNL). WRF is widely evaluated over different platforms while the benchmarks of MPAS are still scarce. In this study we measured the running time of WRF and MPAS on the QCT Developer Cloud,...

chapter

Optimal Algorithms for a Mesh-Connected Computer with Limited Additional Global Bandwidth

Yujie An, Quentin F. Stout

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS) > 937 - 946

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS)

We give efficient algorithms to solve fundamental data movement problems on mesh-connected computers augmented with limited global bandwidth. Adding a small amount of global bandwidth makes a practical design that combines aspects of mesh and fully connected models to achieve the benefits of each. We give algorithms for sorting, finding the median, finding a spanning tree, and determining various...

chapter

Capability Models for Manycore Memory Systems: A Case-Study with Xeon Phi KNL

Sabela Ramos, Torsten Hoefler

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS) > 297 - 306

2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS)

Increasingly complex memory systems and onchip interconnects are developed to mitigate the data movement bottlenecks in manycore processors. One example of such a complex system is the Xeon Phi KNL CPU with three different types of memory, fifteen memory configuration options, and a complex on-chip mesh network connecting up to 72 cores. Users require a detailed understanding of the performance characteristics...

chapter

Polyhedral compilation for energy efficiency

Benoit Pradelle, Muthu Baskaran, Tom Henretty, Benoit Meister, more

2016 IEEE High Performance Extreme Computing Conference (HPEC) > 1 - 7

2016 IEEE High Performance Extreme Computing Conference (HPEC)

In the last decade, the scope of software optimizations expanded to encompass energy consumption on top of the classical runtime minimization objective. In that context, several optimizations have been developed to improve the software energy efficiency. However, these optimizations commonly rely on long profiling steps and are often implemented as unstable runtime systems, which limits their applicability...

article

A Recursive Hypergraph Bipartitioning Framework for Reducing Bandwidth and Latency Costs Simultaneously

Oguz Selvitopi, Seher Acer, Cevdet Aykanat

IEEE Transactions on Parallel and Distributed Systems > 2017 > 28 > 2 > 345 - 358

Intelligent partitioning models are commonly used for efficient parallelization of irregular applications on distributed systems. These models usually aim to minimize a single communication cost metric, which is either related to communication volume or message count. However, both volume- and message-related metrics should be taken into account during partitioning for a more efficient parallelization...

chapter

An “on/off” model for energy-efficient scheduling of workflow applications in computational grids

Marek Mika

2015 20th International Conference on Methods and Models in Automation and Robotics (MMAR) > 1006 - 1009

2015 20th International Conference on Methods and Models in Automation and Robotics (MMAR )

A computational grid is a high performance computing system consisting of computer resources distributed over multiple locations and connected via computer network. One of many possible types of applications executed in computational grids is known as workflow applications. These applications consist of multiple computational tasks, which are precedence related, and usually process huge data files...

chapter

Simulation of Asynchronous Iterative Algorithms Using SimGrid

Charles Emile Ramamonjisoa, Lilia Ziane Khodja, David Laiymani, Arnaud Giersch, more

2014 IEEE Intl Conf on High Performance Computing and Communications, 2014 IEEE 6th Intl Symp on Cyberspace Safety and Security, 2014 IEEE 11th Intl Conf on Embedded Software and Syst (HPCC,CSS,ICESS) > 890 - 895

2014 IEEE International Conference on High Performance Computing and Communications (HPCC), 2014 IEEE 6th International Symposium on Cyberspace Safety and Security (CSS) and 2014 IEEE 11th International Conference on Embedded Software and Systems (ICESS)

Synchronous iterative algorithms are often less scalable than asynchronous iterative ones. Performing large scale experiments with different kind of network parameters is not easy because with supercomputers such parameters are fixed. So, one solution consists in using simulations first in order to analyze what parameters could influence or not the behavior of an algorithm. In this paper, we show...

chapter

A Communication-Optimal N-Body Algorithm for Direct Interactions

Michael Driscoll, Evangelos Georganas, Penporn Koanantakool, Edgar Solomonik, more

2013 IEEE 27th International Symposium on Parallel and Distributed Processing > 1075 - 1084

2013 IEEE International Symposium on Parallel & Distributed Processing (IPDPS)

We consider the problem of communication avoidance in computing interactions between a set of particles in scenarios with and without a cutoff radius for interaction. Our strategy, which we show to be optimal in communication, divides the work in the iteration space rather than simply dividing the particles over processors, so more than one processor may be responsible for computing updates to a single...

chapter

Perfect Strong Scaling Using No Additional Energy

James Demmel, Andrew Gearhart, Benjamin Lipshitz, Oded Schwartz

2013 IEEE 27th International Symposium on Parallel and Distributed Processing > 649 - 660

2013 IEEE International Symposium on Parallel & Distributed Processing (IPDPS)

Energy efficiency of computing devices has become a dominant area of research interest in recent years. Most previous work has focused on architectural techniques to improve power and energy efficiency, only a few consider saving energy at the algorithmic level. We prove that a region of perfect strong scaling in energy exists for matrix multiplication (classical and Strassen) and the direct n-body...

chapter

Isomorphic Recursive Splitting: Conflict-Free Memory Accesses for Structured Memory

Jacques Jorda, Abdelaziz M'zoughi

2012 41st International Conference on Parallel Processing Workshops > 574 - 580

2012 41st International Conference on Parallel Processing Workshops (ICPPW)

Data organization for matrices and arrays in memory has been extensively studied since the early 70's and until the mid 90's - the vector computers golden age. But this old SIMD model seems more topical than ever, as shown by the use of GPU in high performance computers or the architecture of the Nec SX-9. Such memory organization should then be considered again in order to access efficiently data...

chapter

Accelerated simulation of complex waveforms in nonlinear amplifiers with memory

George Stantchev, David Chernin, Thomas Antonsen, Baruch Levush

2011 IEEE MTT-S International Microwave Symposium > 1 - 4

2011 IEEE/MTT-S International Microwave Symposium - MTT 2011

We present a framework for efficient, physics-based computer simulation of complex time-dependent waveforms (i.e. wide-band, with large number of frequency components) in nonlinear amplifiers with memory. It is built upon a well established pseudo-spectral, multi-frequency, large-signal code and relies on an adaptive algorithm for signal splitting and splicing in the time domain. Included in the model,...

chapter

Scheduling Parallel Iterative Applications on Volatile Resources

Henri Casanova, Fanny Dufossé, Yves Robert, Frederic Vivien

2011 IEEE International Parallel & Distributed Processing Symposium > 1012 - 1023

2011 IEEE International Parallel & Distributed Processing Symposium (IPDPS)

In this paper we study the execution of iterative applications on volatile processors such as those found on desktop grids. We develop master-worker scheduling schemes that attempt to achieve good trade-offs between worker speed and worker availability. A key feature of our approach is that we consider a communication model where the bandwidth capacity of the master for sending application data to...

chapter

Parallel Optimisation Strategies for Fusion Codes

Adrian Jackson, Fiona Reid, Stephen Booth, Joachim Hein, more

2011 19th International Euromicro Conference on Parallel, Distributed and Network-Based Processing > 357 - 364

19th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP 2011)

We have previously documented the on-going work in the EUFORIA project to parallelise and optimise European fusion simulation codes, see. This involves working with a wide range of codes to try and address any performance and scaling issues that these codes have. However, as no two simulation codes are exactly the same, it is very hard to apply exactly the same approach to optimising a disparate range...

chapter

LogGPH: A Parallel Computational Model with Hierarchical Communication Awareness

Liang Yuan, Yunquan Zhang, Yuxin Tang, Li Rao, more

2010 13th IEEE International Conference on Computational Science and Engineering > 268 - 274

2010 IEEE 13th International Conference on Computational Science and Engineering (CSE 2010)

In large-scale cluster systems, interconnecting thousands of computing nodes increase the complexity of the network topology. Nevertheless, few existing computational models consider the impact of hierarchical communication latencies and bandwidths caused by the network complexity. In this paper we propose a new parallel computational model called LogGPH with a new parameter H incorporated into the...

chapter

An adaptive strategy for scheduling data-intensive applications in Grid environments

Wantao Liu, Rajkumar Kettimuthu, Bo Li, Ian Foster

2010 17th International Conference on Telecommunications > 642 - 649

2010 17th International Conference on Telecommunications (ICT 2010)

Data-intensive applications are becoming increasingly common in Grid environments. These applications require enormous volume of data for the computation. Most conventional meta-scheduling approaches are aimed at computation intensive application and they do not take data requirement of the applications into account, thus leading to poor performance. Efficient scheduling of data-intensive applications...

chapter

Scheduling algorithms for linear workflow optimization

K Agrawal, A Benoit, L Magnan, Y Robert

2010 IEEE International Symposium on Parallel&Distributed Processing (IPDPS) > 1 - 12

2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS)

Pipelined workflows are a popular programming paradigm for parallel applications. In these workflows, the computation is divided into several stages, and these stages are connected to each other through first-in first-out channels. In order to execute these workflows on a parallel machine, we must first determine the mapping of the stages onto the various processors on the machine. After finding the...

chapter

Three-layer optimizations for fast GMM computations on GPU-like parallel processors

K. Gupta, J.D. Owens

2009 IEEE Workshop on Automatic Speech Recognition&Understanding > 146 - 151

2009 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU 2009)

In this paper we focus on optimizing compute and memory-bandwidth-intensive GMM computations for low-end, small-form-factor devices running on GPU-like parallel processors. With special emphasis on tackling the memory bandwidth issue that is exacerbated by a lack of CPU-like caches providing temporal locality on GPU-like parallel processors, we propose modifications to three well-known GMM computation...

chapter

Design and Deployment of a Network-Aware Grid for e-Science Applications

M.A. Marchenko, D. Adami, C. Callegari, S. Giordano, more

2009 IEEE International Conference on Communications > 1 - 5

ICC 2009 - 2009 IEEE International Conference on Communications

In the last years, grid computing has emerged as a valuable service to solve complex computational problems in many scientific and industrial domains. Quality of Service (QoS) provision for these applications is therefore a key challenge for high speed Next Generation Networks and cross-layer mechanisms, enabling the development of network-aware grids, should be introduced. This paper takes into account,...

chapter

Revisiting communication performance models for computational clusters

A. Lastovetsky, V. Rychkov, M. O'Flynn

2009 IEEE International Symposium on Parallel&Distributed Processing > 1 - 11

2009 IEEE International Symposium on Parallel & Distributed Processing (IPDPS)

In this paper, we analyze restrictions of traditional models affecting the accuracy of analytical prediction of the execution time of collective communication operations. In particular, we show that the constant and variable contributions of processors and network are not fully separated in these models. Full separation of the contributions that have different nature and arise from different sources...

chapter

A Network Bandwidth-Aware Job Scheduling with Dynamic Information Model for Grid Resource Brokers

Chao-Tung Yang, Fang-Yie Leu, Wen-Jen Hu

2008 IEEE Asia-Pacific Services Computing Conference > 775 - 780

2008 IEEE Asia-Pacific Services Computing Conference (APSCC 2008)

In this paper, we propose a resource broker, which providing a friendly interface for accessing available and appropriate resources via user credentials, is developed on a platform constructed by employing the Globus toolkit. This broker not only deploys a domain-based network information model and its dynamic version to measure network status by invoking Network Weather Service (NWS) on grid computing...

INFONA - science communication portal

Search results

Speed Up Weather Prediction on QCT Developer Cloud: A Case Study on Knights Landing Platform

Optimal Algorithms for a Mesh-Connected Computer with Limited Additional Global Bandwidth

Capability Models for Manycore Memory Systems: A Case-Study with Xeon Phi KNL

Polyhedral compilation for energy efficiency

A Recursive Hypergraph Bipartitioning Framework for Reducing Bandwidth and Latency Costs Simultaneously

An “on/off” model for energy-efficient scheduling of workflow applications in computational grids

Simulation of Asynchronous Iterative Algorithms Using SimGrid

A Communication-Optimal N-Body Algorithm for Direct Interactions

Perfect Strong Scaling Using No Additional Energy

Isomorphic Recursive Splitting: Conflict-Free Memory Accesses for Structured Memory

Accelerated simulation of complex waveforms in nonlinear amplifiers with memory

Scheduling Parallel Iterative Applications on Volatile Resources

Parallel Optimisation Strategies for Fusion Codes

LogGPH: A Parallel Computational Model with Hierarchical Communication Awareness

An adaptive strategy for scheduling data-intensive applications in Grid environments

Scheduling algorithms for linear workflow optimization

Three-layer optimizations for fast GMM computations on GPU-like parallel processors

Design and Deployment of a Network-Aware Grid for e-Science Applications

Revisiting communication performance models for computational clusters

A Network Bandwidth-Aware Job Scheduling with Dynamic Information Model for Grid Resource Brokers

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options