Search results for: Benoit Meister

Items from 1 to 17 out of 17 results

chapter

Automatic Code Generation and Data Management for an Asynchronous Task-Based Runtime

Muthu Baskaran, Benoit Pradelle, Benoit Meister, Athanasios Konstantinidis, more

2016 5th Workshop on Extreme-Scale Programming Tools (ESPT) > 34 - 41

2016 5th Workshop on Extreme-Scale Programming Tools (ESPT)

Hardware scaling and low-power considerations associated with the quest for exascale and extreme scale computing are driving system designers to consider new runtime and execution models such as the event-driven-task (EDT) models that enable more concurrency and reduce the amount of synchronization. Further, for performance, productivity, and code sustainability reasons, there is an increasing demand...

chapter

The Open Community Runtime: A runtime system for extreme scale computing

Timothy G. Mattson, Romain Cledat, Vincent Cave, Vivek Sarkar, more

2016 IEEE High Performance Extreme Computing Conference (HPEC) > 1 - 7

2016 IEEE High Performance Extreme Computing Conference (HPEC)

The Open Community Runtime (OCR) is a new runtime system designed to meet the needs of extreme-scale computing. While there is growing support for the idea that future execution models will be based on dynamic tasks, there is little agreement on what else should be included. OCR minimally adds events for synchronization and relocatable data-blocks for data management to form a complete system that...

chapter

Polyhedral compilation for energy efficiency

Benoit Pradelle, Muthu Baskaran, Tom Henretty, Benoit Meister, more

2016 IEEE High Performance Extreme Computing Conference (HPEC) > 1 - 7

2016 IEEE High Performance Extreme Computing Conference (HPEC)

In the last decade, the scope of software optimizations expanded to encompass energy consumption on top of the classical runtime minimization objective. In that context, several optimizations have been developed to improve the software energy efficiency. However, these optimizations commonly rely on long profiling steps and are often implemented as unstable runtime systems, which limits their applicability...

chapter

Scalable Hierarchical Polyhedral Compilation

Benoit Pradelle, Benoit Meister, Muthu Baskaran, Athanasios Konstantinidis, more

2016 45th International Conference on Parallel Processing (ICPP) > 432 - 441

2016 45th International Conference on Parallel Processing (ICPP)

Computers across the board, from embedded to future exascale computers, are consistently designed with deeper memory hierarchies. While this opens up exciting opportunities for improving software performance and energy efficiency, it also makes it increasingly difficult to efficiently exploit the hardware. Advanced compilation techniques are a possible solution to this difficult problem and, among...

chapter

An Interactive Visual Tool for Code Optimization and Parallelization Based on the Polyhedral Model

Eric Papenhausen, Klaus Mueller, M. Harper Langston, Benoit Meister, more

2016 45th International Conference on Parallel Processing Workshops (ICPPW) > 309 - 318

2016 45th International Conference on Parallel Processing Workshops (ICPPW)

Writing high performance software requires the programmer to take advantage of multi-core processing. This can be done through tools like OpenMP, which allow the programmer to mark parallel loops. Identifying parallelizable loops, however, is a non-trivial task. Furthermore, transformations can be applied to a loop nest to expose parallelism. Polyhedral compilation has become an increasingly popular...

chapter

PUMA-V: An interactive visual tool for code optimization and parallelization based on the polyhedral model

Eric Papenhausen, Klaus Mueller, Harper Langston, Benoit Meister, more

2016 New York Scientific Data Summit (NYSDS) > 1 - 4

2016 New York Scientific Data Summit (NYSDS)

Taking advantage of multi-core processing has become crucial in realizing significant performance gains for most applications. When it comes to performance optimization, this has led to a delicate balancing act between parallelism and locality. Furthermore, exposing parallelism can require some non-trivial transformations. Although tools exist to automatically identify good transformations, a user...

chapter

Data Sequence Locality: A Generalization of Temporal Locality

Vincent Loechner, Benoît Meister, Philippe Clauss

Lecture Notes in Computer Science > Euro-Par 2001 Parallel Processing > Topic 4 > 262-272

A significant source for enhancing application performance and for reducing power consumption in embedded processor applications is to improve the usage of the memory hierarchy. Such objective classically translates into optimizing spatial and temporal data locality especially for nested loops. In this paper, we focus on temporal data locality. Unlike many existing methods, our approach pays special...

chapter

Periodic Polyhedra

Benoît Meister

Lecture Notes in Computer Science > Compiler Construction > Loop Analysis > 134-149

This paper presents a new method for computing the integer hull of a parameterized rational polyhedron by introducing the concept of periodic polyhedron. Besides concerning generally parametric combinatorial optimization, the method has many applications for the analysis, optimization and parallelization of loop nests, especially in compilers.

chapter

Automatic cluster parallelization and minimizing communication via selective data replication

Sanket Tavarageri, Benoit Meister, Muthu Baskaran, Benoit Pradelle, more

2015 IEEE High Performance Extreme Computing Conference (HPEC) > 1 - 7

2015 IEEE High Performance Extreme Computing Conference (HPEC)

The technology scaling has initiated two distinct trends that are likely to continue into future: first, the increased parallelism in hardware and second, the increasing performance and energy cost of communication relative to computation. Both of the above trends call for development of compiler and runtime systems to automatically parallelize programs and reduce communication in parallel computations...

chapter

Optimization of symmetric tensor computations

Jonathon Cai, Muthu Baskaran, Benoit Meister, Richard Lethin

2015 IEEE High Performance Extreme Computing Conference (HPEC) > 1 - 7

2015 IEEE High Performance Extreme Computing Conference (HPEC)

For applications that deal with large amounts of high dimensional multi-aspect data, it is natural to represent such data as tensors or multi-way arrays. Tensor computations, such as tensor decompositions, are increasingly being used to extract and explain properties of such data. An important class of tensors is the symmetric tensor, which shows up in real-world applications such as signal processing,...

chapter

Polyhedral user mapping and assistant visualizer tool for the r-stream auto-parallelizing compiler

Eric Papenhausen, Bing Wang, M. Harper Langston, Muthu Baskaran, more

2015 IEEE 3rd Working Conference on Software Visualization (VISSOFT) > 180 - 184

2015 IEEE 3rd Working Conference on Software Visualization (VISSOFT)

Existing high-level, source-to-source compilers can accept input programs in a high-level language (e.g., C) and perform complex automatic parallelization and other mappings using various optimizations. These optimizations often require trade-offs and can benefit from the user's involvement in the process. However, because of the inherent complexity, the barrier to entry for new users of these high-level...

chapter

ACDT: Architected Composite Data Types trading-in unfettered data access for improved execution

Andres Marquez, Joseph Manzano, Shuaiwen Leon Song, Benoit Meister, more

2014 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS) > 289 - 297

2014 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS)

With Exascale performance and its challenges in mind, one ubiquitous concern among architects is energy efficiency. Petascale systems projected to Exascale systems are unsustainable at current power consumption rates. One major contributor to system-wide power consumption is the number of memory operations leading to data movement and management techniques applied by the runtime system. To address...

chapter

Low-overhead load-balanced scheduling for sparse tensor computations

Muthu Baskaran, Benoit Meister, Richard Lethin

2014 IEEE High Performance Extreme Computing Conference (HPEC) > 1 - 6

2014 IEEE High Performance Extreme Computing Conference (HPEC)

Irregular computations over large-scale sparse data are prevalent in critical data applications and they have significant room for improvement on modern computer systems from the aspects of parallelism and data locality. We introduce new techniques to efficiently map large irregular computations onto modern multi-core systems with non-uniform memory access (NUMA) behavior. Our techniques are broadly...

chapter

Re-Introduction of communication-avoiding FMM-accelerated FFTs with GPU acceleration

M. Harper Langston, Muthu Baskaran, Benoit Meister, Nicolas Vasilache, more

2013 IEEE High Performance Extreme Computing Conference (HPEC) > 1 - 6

2013 IEEE High Performance Extreme Computing Conference (HPEC)

As distributed memory systems grow larger, communication demands have increased. Unfortunately, while the costs of arithmetic operations continue to decrease rapidly, communication costs have not. As a result, there has been a growing interest in communication-avoiding algorithms for some of the classic problems in numerical computing, including communication-avoiding Fast Fourier Transforms (FFTs)...

chapter

Runnemede: An architecture for Ubiquitous High-Performance Computing

Nicholas P. Carter, Aditya Agrawal, Shekhar Borkar, Romain Cledat, more

2013 IEEE 19th International Symposium on High Performance Computer Architecture (HPCA) > 198 - 209

2013 IEEE 19th International Symposium on High Performance Computer Architecture (HPCA)

DARPA's Ubiquitous High-Performance Computing (UHPC) program asked researchers to develop computing systems capable of achieving energy efficiencies of 50 GOPS/Watt, assuming 2018-era fabrication technologies. This paper describes Runnemede, the research architecture developed by the Intel-led UHPC team. Runnemede is being developed through a co-design process that considers the hardware, the runtime/OS,...

chapter

Efficient and scalable computations with sparse tensors

Muthu Baskaran, Benoit Meister, Nicolas Vasilache, Richard Lethin

2012 IEEE Conference on High Performance Extreme Computing > 1 - 6

2012 IEEE Conference on High Performance Extreme Computing (HPEC)

For applications that deal with large amounts of high dimensional multi-aspect data, it becomes natural to represent such data as tensors or multi-way arrays. Multi-linear algebraic computations such as tensor decompositions are performed for summarization and analysis of such data. Their use in real-world applications can span across domains such as signal processing, data mining, computer vision,...

article

Precise Data Locality Optimization of Nested Loops

Vincent Loechner, Benoît Meister, Philippe Clauss

The Journal of Supercomputing > 2002 > 21 > 1 > 37-76

A significant source for enhancing application performance and for reducing power consumption in embedded processor applications is to improve the usage of the memory hierarchy. In this paper, a temporal and spatial locality optimization framework of nested loops is proposed, driven by parameterized cost functions. The considered loops can be imperfectly nested. New data layouts are propagated through...

Filter options

Publication date

Set your own date range

INFONA - science communication portal

Search results for: Benoit Meister

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Data set

Reporting an error / abuse

Sending the report failed

Accessibility options