Parallel Processing: CONPAR 92—VAPP V
Second Joint International Conference on Vector and Parallel Processing Lyon, France, September 1–4, 1992 Proceedings

Luc Bougé, Michel Cosnard, Yves Robert, Denis Trystram

Items from 21 to 40 out of 131 results

chapter

Synchronization of parallel processes in distributed systems

Mikhail Makhaniok, Reinhard Männer

Lecture Notes in Computer Science > Parallel Processing: CONPAR 92—VAPP V > 157-162

In this paper a solution is given for the problem to determine the duration of a parallel asynchronous multi-step process in a homogeneous system of combinational logic. In such a system the problem arises if the processors have different speeds and input data. If so each processor “knows” its own data and the type of the process but in general has no information about the speed of the other processors...

chapter

Statistical probabilistic clock synchronization algorithm

Pierre Moukeli

Lecture Notes in Computer Science > Parallel Processing: CONPAR 92—VAPP V > 163-168

Cristian proposed a probabilistic algorithm for clock synchronization. This algorithm however is not adapted to the changing system load, and it needs the knowledge of communication delay distribution. In this paper, we presents an algorithm similar to that of Cristian, which supports system load changes. This algorithm is based on the statistic of the communication delays.

chapter

A SIMD architecture for medical imaging

Wieslaw L. Nowinski

Lecture Notes in Computer Science > Parallel Processing: CONPAR 92—VAPP V > 169-180

This paper addresses parallel processing in 3D medical imaging. A SIMD architecture common for image reconstruction, image processing, and volume visualization is proposed. The ray casting algorithm for volume visualization along with some of its improvements is described. The image-space parallelization of ray casting is given. The parallelization ensures an even load balancing among processors and...

chapter

Computing the inner product on reconfigurable buses with shift switching

R. Lin, S. Olariu

Lecture Notes in Computer Science > Parallel Processing: CONPAR 92—VAPP V > 181-192

The purpose of this work is to present two novel architectures for inner product computation. The proposed architectures incorporate shift switching into the reconfigurable buses. Given two arrays of N elements, each consisting of m bits, our first architecture achieves a latency of O((logN + logM)t_a + (logN)t_b), using Nm² basic shift switches and m² adders assuming that broadcasting on a bus takes...

chapter

A novel sorting array processor

Stephen P. S. Lam

Lecture Notes in Computer Science > Parallel Processing: CONPAR 92—VAPP V > 193-204

Based on a novel array processor architecture, consisting of two tightly-coupled mesh-connected processing cells, a number of highly parallelizable sorting algorithms are realized by match the data flow with the interconnection topology. The sorting algorithms chosen are the odd-even sort, bitonic sort and binary tree sort. Taking the modularity of these algorithms, the array implementation of small...

chapter

The time-parallel solution of parabolic partial differential equations using the frequency-filtering method

Graham Horton, Ralf Knirsch, Hermann Vollath

Lecture Notes in Computer Science > Parallel Processing: CONPAR 92—VAPP V > 205-216

We consider the parallel solution of time-dependent partial differential equations. Due to the fact that time is a one-way dimension, traditional methods attack this type of equation by solving the resulting sequence of problems in a sequential manner. Parallel solution methods retain this sequential process, obtaining their parallelism by distributing the problem at each discrete time-step. It has...

chapter

The combination technique for parallel sparse-grid-preconditioning or -solution of PDE's on workstation networks

M. Griebel, W. Huber, U. Rüde, T. Störtkuhl

Lecture Notes in Computer Science > Parallel Processing: CONPAR 92—VAPP V > 217-228

In this paper we study the parallel solution of elliptic partial differential equations with the sparse grid combination technique. This new algorithmic concept is based on the independent solution of many problems with reduced size and their linear combination. The resulting algorithm can be used as a solver and within a preconditioner. We will describe the algorithm for two and three-dimensional...

chapter

Comparing the DAP, meiko and suprenum with a fluid dynamic benchmark

Michael Schäfer, Michael M. Gutzmann, Markus Schwehm

Lecture Notes in Computer Science > Parallel Processing: CONPAR 92—VAPP V > 229-240

An algorithm for the numerical simulation of the fluid flow in a crystal growth process is presented. The algorithm is implemented on three parallel architectures. The performance analysis shows that special care has to be taken for the efficient realization of communication patterns on massively parallel systems.

chapter

Parallel detection algorithm of radar signals

Christo A. Kabakchiev, Vera P. Behar

Lecture Notes in Computer Science > Parallel Processing: CONPAR 92—VAPP V > 241-246

A parallel structure of an adaptive algorithm for delecting a coherent pulse packet in the Gaussian narrowband noise background is discussed. In synthesis of the considered pulse packet detectors a vector model of input radar signals is assumed. The special attention is paid to different possible levels of paralleling of the signal performance. In order to reduce the processing time a parallel structure...

chapter

Efficient linear systolic array for the knapsack problem

Rumen Andonov, Patrice Quinton

Lecture Notes in Computer Science > Parallel Processing: CONPAR 92—VAPP V > 247-258

A processor-efficient systolic algorithm for the dynamic programming approach to the knapsack problem is presented in this paper. The algorithm is implemented on a linear systolic array where the number of the cells q, the cell memory storage α and the input/output requirements are design parameters. These are independent of the problem size given by the number of the objects m and the knapsack capacity...

chapter

On the loading, recovery and access of stationary data in systolic arrays

Jingling Xue

Lecture Notes in Computer Science > Parallel Processing: CONPAR 92—VAPP V > 259-264

A space-time mapping that describes a systolic array consists of a scheduling vector that specifies the temporal distribution and an allocation matrix that specifies the spatial distribution. The allocation matrix determines whether a variable is moving or stationary. The space-time mapping provides a complete description of the data flow for moving variables. But it fails to provide any help in handling...

chapter

Designing modular linear systolic arrays using dependence graph regular partitions

Jean-Frédéric Myoupo, Anne-Cécile Fabret

Lecture Notes in Computer Science > Parallel Processing: CONPAR 92—VAPP V > 265-270

In this paper, we use a variant of the geometric method to derive efficient modular linear systolic algorithms for the transitive closure and shortest path problems. Furthermore, we show that partially-pipelined modular linear systolic algorithms with an output operation, for matrix multiplication, can be as efficient as fully-pipelined ones, and in some cases needs less cells.

chapter

Reducing symmetric banded matrices to tridiagonal form — A comparison of a new parallel algorithm with two serial algorithms on the iPSC/860

Bruno Lang

Lecture Notes in Computer Science > Parallel Processing: CONPAR 92—VAPP V > 271-282

We compare three algorithms for reducing symmetric banded matrices to tridiagonal form and evaluate their performance on the Intel iPSC/860 hypercube parallel computer. Two of these algorithms, the routines BANDR and SBTRD from the EISPACK and LAPACK libraries, resp., are serial algorithms with little potential for coarse grain parallelism. The third one, called SBTH, is a new parallel algorithm....

chapter

An implementation of the BLAS on the i860: A RISC approach to software for RISC devices

Bob Wilkinson, Lawrence S. Mulholland

Lecture Notes in Computer Science > Parallel Processing: CONPAR 92—VAPP V > 283-294

The Liverpool Single-transputer library [10] was ported to the i860 as part of the Esprit Genesis project, P2702. There are approximately 250 routines in this library, including the BLAS (which are a well-known set of subroutines providing functions commonly used in numerical computing, see [8],[3], [2],[1]) and a set of vector routines, known as the FLO routines This strategy is expensive...

chapter

Partitioning and mapping for parallel nested dissection on distributed memory architectures

Pierre Charrier, Jean Roman

Lecture Notes in Computer Science > Parallel Processing: CONPAR 92—VAPP V > 295-306

In this paper, we consider the parallel implementation of a block Cholesky factorization based on a nested dissection ordering for unstructured problems. We focus on loosely coupled networks of many processors with local memory and message passing mechanism. More precisely, we study a parallel block solver associated with refined partitions from the separator partition; the aim is to find the partition...

chapter

On the accuracy of solving triangular systems in parallel-III

Nai-kuan Tsao

Lecture Notes in Computer Science > Parallel Processing: CONPAR 92—VAPP V > 307-318

In this paper we present a component-wise error analysis of two different algorithms — one sequential and the other parallel — for solving triangular systems. The results show that each of the computed components of the solution vector using the parallel algorithm is an extended sum of slightly perturbed exact terms whose relative error bounds are comparable to those generated by the usual sequential...

chapter

Linear algebra calculations on the BBN TC2000

Patrick R. Amestoy, Michel J. Daydé, Iain S. Duff, Pierre Morère

Lecture Notes in Computer Science > Parallel Processing: CONPAR 92—VAPP V > 319-330

The BBN TC2000 is a distributed-memory multiprocessor with up to 512 RISC processor nodes. The originality of the BBN TC2000 comes from its interconnection network (Butterfly switch) and from its globally addressable memory. We evaluate, in this paper, the impact of the memory hierarchy of the TC2000 on the design of algorithms for linear algebra. On shared memory multiprocessor computers, block algorithms...

chapter

Parallel homotopy algorithm for large sparse generalized eigenvalue problems: Application to hydrodynamic stability analysis

G. Chen, H. B. Keller, S. H. Lui, B. Roux

Lecture Notes in Computer Science > Parallel Processing: CONPAR 92—VAPP V > 331-342

A parallel homotopy algorithm is presented for finding a few selected eigenvalues (for example those with the largest real part) of Az = λBz with real, large, sparse, and nonsymmetric square matrix A and real, singular, diagonal matrix B. The essence of the homotopy method is that from the eigenpairs of Dz = λBz, we use Euler-Newton continuation to follow the eigenpairs of A(t)z = λBz with A(t) ≡...

chapter

Parallel algorithms for solving linear recurrence systems

Przemysław Stpiczyński

Lecture Notes in Computer Science > Parallel Processing: CONPAR 92—VAPP V > 343-348

We present two parallel algorithms for solving linear recurrence systems R 〈n,m〉 where m is relatively small, which can be simply implemented on message passing multiprocessors. Theorems concerning their time complexity are also given together with the criterion when each of them should be used. If m is O(1) then the algorithms are effective.

chapter

A new parallel factorization A = DD ^tBC for band symmetric positive definite matrices

Ilan Bar-On, Ophir Munk

Lecture Notes in Computer Science > Parallel Processing: CONPAR 92—VAPP V > 349-354

We present a new factorization for band symmetric positive definite (s.p.d) matrices which is more useful for parallel computations than the classical Choleskey decomposition method. Let A be a band s.p.d matrix of order n and half bandwidth m and let p = 2^k be the number of processors. We show how to factor A as A = DD ^tBC using approximately 4nm ²/p parallel operations, which...

Publication date

Set your own date range

Publication language

English (131)
Polish (1)

Keywords

LINEAR ALGEBRA (2)
MIMD (2)
AREA (1)
ASYNCHRONOUS INTERPRETATION (1)
AUTOMATIC PARALLELIZATION (1)
BLAS (1)
BUS SYSTEMS (1)
C LANGUAGE (1)
COLOUR IMAGE VISUALIZATION (1)
COMPILATION (1)
COMPONENT LABELING (1)
COMPUTER VISION (1)
CONVEX HULL (1)
COORDINATION LANGUAGE (1)
DATA DISTRIBUTION (1)
DISTANCE TRANSFORMATION (1)
DIVIDE AND CONQUER (1)
ENVLOPES (1)
FORTRAN (1)
GEOMETRIC SEARCHING (1)
GRANULARITY (1)
HIERARCHICAL MEMORY (1)
HIGH-SPEED BROWSING (1)
IMAGE ANALYSIS (1)
IMAGE PROCESSING (1)
LOAD BALANCING (1)
MATRIX MULTIPLICATION (1)
MEMORY BANDWIDTH (1)
MIMD MACHINES (1)
MINIMAL DISTANCE (1)
MODELS OF COMPUTATION (1)
MULTIPROCESSOR WINDOW DISPLAY (1)
ORTHOGONALISATION (1)
PARALLEL PROGRAMMING (1)
PARALLEL TOPOLOGICAL BNODE(TAG=, PARTS=[NODE(TAG=SUP, PARTS=[*])])-TREES (1)
PERIMETER (1)
POINTERS (1)
POLYHEDRA INTERSECTIONS (1)
PROCESSOR FARM (1)
PROGRAM TRANSLATION (1)
REALIGNMENT NETWORK (1)
RECONFIGURABLE MESHES (1)
RING TOPOLOGY (1)
RISC (1)
ROBOTICS (1)
RUN-TIME SYSTEMS (1)
SEGMENTATION (1)
SERIAL MULTIPORT MEMORY (1)
SHARED MEMORY (1)
SIDE EFFECTS (1)
TRANSPUTERS (1)
TYPES (1)
VECTOR PROCESSOR (1)
more

INFONA - science communication portal

Parallel Processing: CONPAR 92—VAPP V
Second Joint International Conference on Vector and Parallel Processing Lyon, France, September 1–4, 1992 Proceedings

Synchronization of parallel processes in distributed systems

Statistical probabilistic clock synchronization algorithm

A SIMD architecture for medical imaging

Computing the inner product on reconfigurable buses with shift switching

A novel sorting array processor

The time-parallel solution of parabolic partial differential equations using the frequency-filtering method

The combination technique for parallel sparse-grid-preconditioning or -solution of PDE's on workstation networks

Comparing the DAP, meiko and suprenum with a fluid dynamic benchmark

Parallel detection algorithm of radar signals

Efficient linear systolic array for the knapsack problem

On the loading, recovery and access of stationary data in systolic arrays

Designing modular linear systolic arrays using dependence graph regular partitions

Reducing symmetric banded matrices to tridiagonal form — A comparison of a new parallel algorithm with two serial algorithms on the iPSC/860

An implementation of the BLAS on the i860: A RISC approach to software for RISC devices

Partitioning and mapping for parallel nested dissection on distributed memory architectures

On the accuracy of solving triangular systems in parallel-III

Linear algebra calculations on the BBN TC2000

Parallel homotopy algorithm for large sparse generalized eigenvalue problems: Application to hydrodynamic stability analysis

Parallel algorithms for solving linear recurrence systems

A new parallel factorization A = DD ^tBC for band symmetric positive definite matrices

Filter options

Publication date

Publication language

Keywords

INFONA - science communication portal

Parallel Processing: CONPAR 92—VAPP V Second Joint International Conference on Vector and Parallel Processing Lyon, France, September 1–4, 1992 Proceedings $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication language

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

Parallel Processing: CONPAR 92—VAPP V
Second Joint International Conference on Vector and Parallel Processing Lyon, France, September 1–4, 1992 Proceedings