Languages and Compilers for Parallel Computing
5th International Workshop New Haven, Connecticut, USA, August 3–5, 1992 Proceedings

Utpal Banerjee, David Gelernter, Alex Nicolau, David Padua

Items from 1 to 20 out of 35 results

chapter

Compilation of a highly parallel Actor-Based Language

W. Kim, G. Agha

Lecture Notes in Computer Science > Languages and Compilers for Parallel Computing > 1-15

Hal is a High-level Actor-based Language. Hal supports a number of communication mechanisms, local synchronization constraints, inheritance, and restricted forms of reflection. This paper discusses some issues in compiling Hal. Specifically, we describe three source-level transformations used by the compiler for Hal. Two of the transformations translate RPC-style message sending into asynchronous...

chapter

A concurrent execution semantics for Parallel Program Graphs and Program Dependence Graphs

V. Sarkar

Lecture Notes in Computer Science > Languages and Compilers for Parallel Computing > 16-30

In this paper, we present a concurrent execution semantics for Parallel Program Graphs (PPGs), a general parallel program representation that includes Program Dependence Graphs (PDGs) and sequential programs. We believe that this semantics is natural to the programmer's way of thinking, and that it also provides a suitable execution model for efficient implementation on real architectures. To demonstrate...

chapter

Using profile information to assist advanced compiler optimization and scheduling

W. Chen, R. Bringmann, S. Mahlke, S. Anik, more

Lecture Notes in Computer Science > Languages and Compilers for Parallel Computing > 31-48

Compilers for superscalar and VLIW processors must expose sufficient instruction-level parallelism in order to achieve high performance. Compiletime code transformations which expose instruction-level parallelism typically take into account the constraints imposed by all execution scenarios in the program. However, there are additional opportunities to increase instructionlevel parallelism along the...

chapter

A hierarchical parallelizing compiler for VLIW/MIMD machines

C. Brownhill, A. Nicolau

Lecture Notes in Computer Science > Languages and Compilers for Parallel Computing > 49-63

Hierarchical architectures attempt to provide the benefits of both VLIW/superscalar and MIMD machines by combining multiple VLIW or superscalar processors as parallel, asynchronous processors. An example of these architectures is the Intel Touchstone multicomputer. These machines provide the opportunity to execute a program in parallel at both the machine instruction level and the source statement...

chapter

Dynamic dependence analysis: A novel method for data dependence evaluation

P. Peterson, D. Padua

Lecture Notes in Computer Science > Languages and Compilers for Parallel Computing > 64-81

A dynamic evaluation of the effects of data dependence analysis in the Perfect Benchmarks is demonstrated. We show that it is possible to measure the optimal parallelism, as defined by our model, and to compare the obtained parallelism for various data dependence tests with the optimal parallelism. We find that a variation of Banerjee's inequalities is sufficient in all cases to obtain more than half...

chapter

On the feasibility of dynamic partitioning of pointer structures

J. Solworth

Lecture Notes in Computer Science > Languages and Compilers for Parallel Computing > 82-96

In this paper, we give experimental results of runtime partitioning of arbitrary order loop with pointer-based structures. Arbitrary order loops are those loops whose iterations can be executed in any order, typically iterating over a bag, list, or set. Such iterations are often used in algorithm texts for iterating over adjacency lists and other collections, and are similar to Linda's bags.

chapter

Compiler analysis for irregular problems in fortran D

R. Hanxleden, K. Kennedy, C. Koelbel, R. Das, more

Lecture Notes in Computer Science > Languages and Compilers for Parallel Computing > 97-111

Many parallel programs require run-time support to implement the communication caused by indirect data references. In previous work, we have developed the inspectorexecutor paradigm to handle these cases. This paper extends that work by developing a dataflow framework to aid in placing the executor communications calls. Our dataflow analysis determines when it is safe to combine communications statements,...

chapter

Data ensembles in Orca C

C. Lin, L. Snyder

Lecture Notes in Computer Science > Languages and Compilers for Parallel Computing > 112-123

This paper describes Orca C, the first language built on the Phase Abstractions programming model. The focus is on the syntax and semantics of the data ensembles, a parallel data structuring facility to support machine-independent parallel programs. These features are compared with the data decomposition capabilities of other parallel languages, including Fortran D, Vienna Fortran, and Kali.

chapter

Compositional C++: Compositional parallel programming

K. Mani Chandy, C. Kesselman

Lecture Notes in Computer Science > Languages and Compilers for Parallel Computing > 124-144

A compositional parallel program is a program constructed by composing component programs in parallel, where the composed program inherits properties of its components. In this paper, we describe a small extension of C++ called Compositional C++ or CC++ which is an object-oriented notation that supports compositional parallel programming. CC++ integrates different paradigms of parallel programming:...

chapter

Data parallelism and Linda

N. Carriero, D. Gelernter

Lecture Notes in Computer Science > Languages and Compilers for Parallel Computing > 145-159

Is the owner-computes style of parallelism, captured in a variety of data parallel languages, attractive as a paradigm for designing explicitly parallel codes? This question gives rise to a number of others. Will such use be unwieldy? Will the resulting code run well? What can such an approach offer beyond merely replicating, in a more labor intensive way, the services and coverage of data parallel...

chapter

Techniques for efficient execution of fine-grained concurrent programs

A. Chien, W. Feng, V. Karamcheti, J. Plevyak

Lecture Notes in Computer Science > Languages and Compilers for Parallel Computing > 160-174

Concurrent object-oriented programming languages are an attractive approach for programming massively-parallel machines. However, exploiting object-level concurrency is problematic as the linkage and communication overhead can overwhelm the benefits of the fine-grained concurrency. Our approach achieves efficient execution by tuning the grain size, matching the execution grain size to that efficiently...

chapter

Computing per-process summary side-effect information

T. Jeremiassen, S. Eggers

Lecture Notes in Computer Science > Languages and Compilers for Parallel Computing > 175-191

This paper presents a static algorithm, based on standard iterative dataflow techniques, for computing per-process memory references to shared data in coarse-grained parallel programs. The algorithm constructs control flow graphs for families of processes by recognizing predicates used in control statements whose values are invariant relative to any one process, but vary across processes. It is used...

chapter

Supporting SPMD execution for dynamic data structures

A. Rogers, J. Reppy, L. Hendren

Lecture Notes in Computer Science > Languages and Compilers for Parallel Computing > 192-207

In this paper, we address the problem of supporting SPMD execution of programs that use recursively-defined dynamic data structures on distributed memory machines. The techniques developed for supporting SPMD execution of array-based programs rely on the fact that arrays are statically defined and directly addressable. As a result, these techniques do not apply to recursive data structures, which...

chapter

Determining transformation sequences for loop parallelization

W. Appelbe, K. Smith

Lecture Notes in Computer Science > Languages and Compilers for Parallel Computing > 208-222

Considerable research on loop parallelization for shared memory multiprocessors has focused upon developing transformations for removing loop-carried dependences. In many loops, more than one such transformation is required, and hence the choice of transformations and the order in which they are applied is critical. In this paper, we present an algorithm for selecting a sequence of transformations...

chapter

Compiler optimizations for massively parallel machines: Transformations on iterative spatial loops

M. Chen, Y. Hu

Lecture Notes in Computer Science > Languages and Compilers for Parallel Computing > 223-247

This paper presents a set of compiler optimizations and their application strategies for a common class of data parallel loop nests. The arrays updated in the body of the loop nests are assumed to be partitioned into blocks (rectangular, rows, or columns) where each block is assigned to a processor. These optimizations are demonstrated in the context of a FORTRAN-90 compiler with very encouraging...

chapter

Handling distributed data in Vienna Fortran procedures

B. Chapman, H. Zima, P. Mehrotra

Lecture Notes in Computer Science > Languages and Compilers for Parallel Computing > 248-263

Vienna Fortran is a language extension of Fortran which provides the user with a wide range of facilities for the distribution of data structures across the processors of a distributed-memory multiprocessing machine. In contrast to current programming practice, programs in Vienna Fortran are written using global data references. Thus, the user has the advantages of a shared memory programming paradigm...

chapter

On the synthesis of parallel programs from tensor product formulas for block recursive algorithms

S. Gupta, C. -H. Huang, P. Sadayappan, R. Johnson

Lecture Notes in Computer Science > Languages and Compilers for Parallel Computing > 264-280

This paper presents a methodology for synthesizing parallel programs for block recursive algorithms such as fast Fourier transforms and Strassen's matrix multiplication algorithm. A block recursive algorithm is expressed as a tensor product formula which consists of matrix sums, matrix products, direct sums, tensor products, componentwise matrix operations, and stride permutations. These mathematical...

chapter

Collective loop fusion for array contraction

G. Gao, R. Olsen, V. Sarkar, R. Thekkath

Lecture Notes in Computer Science > Languages and Compilers for Parallel Computing > 281-295

In this paper we propose a loop fusion algorithm specifically designed to increase opportunities for array contraction. Array contraction is an optimization that transforms array variables into scalar variables within a loop nest. In contrast to array elements, scalar variables have better cache behavior and can be allocated to registers. In past work we investigated loop interchange and loop reversal...

chapter

Parallel hybrid data flow algorithms: A case study

Y. -F. Lee, B. Ryder

Lecture Notes in Computer Science > Languages and Compilers for Parallel Computing > 296-310

We have designed and implemented parallel compile-time analysis algorithms based on a family of general-purpose, hybrid algorithms for data flow analysis [MR90]. Recently, we have developed optimizations that improve algorithm performance by reducing the time taken for computation, communication and dynamic scheduling. The practicality of this improved algorithm is evidenced by empirical studies of...

chapter

A control-parallel programming model implemented on SIMD hardware

H. Dietz, W. Cohen

Lecture Notes in Computer Science > Languages and Compilers for Parallel Computing > 311-325

Although “data parallelism” has been shown to be an effective and portable way to express some types of parallel algorithms, there are many other problems for which data parallelism seems awkward and inefficient. For example, recursive decompositions and operations on irregular grids are most readily expressed using control parallelism. The problem is that control parallelism has always been associated...

Series:
Lecture Notes in Computer Science

Publication date

Set your own date range

Keywords

ACTOR (1)
COMPILER (1)
CONCURRENCY (1)
COORDINATION LANGUAGES (1)
DATA DISTRIBUTION (1)
DATA PARALLEL ALGORITHMS (1)
DATA PARALLEL LANGUAGES (1)
DATAFLOW ANALYSIS (1)
DATAFLOW MACHINE (1)
DISTRIBUTED-MEMORY MULTIPROCESSOR SYSTEMS (1)
DISTRIBUTION PROPAGATION (1)
EXPLICIT PARALLELISM (1)
FORMAL ARRAY PARAMETERS (1)
FORTRAN (1)
INHERITANCE (1)
LINDA (1)
NUMERICAL COMPUTATION (1)
OPTIMIZATION (1)
SYNCHRONIZATION CONSTRAINT (1)
more

INFONA - science communication portal

Languages and Compilers for Parallel Computing
5th International Workshop New Haven, Connecticut, USA, August 3–5, 1992 Proceedings

Compilation of a highly parallel Actor-Based Language

A concurrent execution semantics for Parallel Program Graphs and Program Dependence Graphs

Using profile information to assist advanced compiler optimization and scheduling

A hierarchical parallelizing compiler for VLIW/MIMD machines

Dynamic dependence analysis: A novel method for data dependence evaluation

On the feasibility of dynamic partitioning of pointer structures

Compiler analysis for irregular problems in fortran D

Data ensembles in Orca C

Compositional C++: Compositional parallel programming

Data parallelism and Linda

Techniques for efficient execution of fine-grained concurrent programs

Computing per-process summary side-effect information

Supporting SPMD execution for dynamic data structures

Determining transformation sequences for loop parallelization

Compiler optimizations for massively parallel machines: Transformations on iterative spatial loops

Handling distributed data in Vienna Fortran procedures

On the synthesis of parallel programs from tensor product formulas for block recursive algorithms

Collective loop fusion for array contraction

Parallel hybrid data flow algorithms: A case study

A control-parallel programming model implemented on SIMD hardware

Filter options

Publication date

Keywords

INFONA - science communication portal

Languages and Compilers for Parallel Computing 5th International Workshop New Haven, Connecticut, USA, August 3–5, 1992 Proceedings $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

Languages and Compilers for Parallel Computing
5th International Workshop New Haven, Connecticut, USA, August 3–5, 1992 Proceedings