1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences

Items from 1 to 19 out of 19 results

chapter

Parallel implementation of BDD algorithms using a distributed shared memory

Y. Parasuram, E. Stabler, Shiu-Kai Chin

1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences > 1 > 16 - 25

Proceedings of the Twenty-Seventh Annual Hawaii International Conference on System Sciences

Binary Decision Diagrams (BDDs) are used extensively in VLSI CAD for verification, synthesis, logic minimization and testing. Parallel algorithms for Boolean Function Manipulation using BDDs have been proposed and implemented on a Connection Machine (CM-5). Abstractions have been developed to support the design of these algorithms using the message passing model of parallel programming. A Distributed...

chapter

The S3.mp scalable shared memory multiprocessor

A. Nowatzyk, G. Aybay, M. Browne, E. Kelly, more

1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences > 1 > 144 - 153

Proceedings of the Twenty-Seventh Annual Hawaii International Conference on System Sciences

S3.mp (Sun's Scalable Shared memory MultiProcessor) is a research project to demonstrate a low overhead, high throughput communication system that is based on cache coherent distributed shared memory (DSM). S3.mp uses distributed directories and point-to-point messages that are sent over a packet switched interconnect fabric to achieve scalability over a wide range of configurations. S3.mp uses a...

chapter

Software versus hardware coherence: performance versus cost

R.N. Zucker, J.-L. Baer

1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences > 1 > 163 - 172

Proceedings of the Twenty-Seventh Annual Hawaii International Conference on System Sciences

Directory-based protocols are currently the method of choice to enforce cache coherence in large-scale shared-memory multiprocessors. The problems associated with these hardware schemes include their lack of scalability, although various suggestions have been made to ameliorate this drawback, and the loss of performance due to false sharing. Software controlled cache coherence (SCCC) is an alternative...

chapter

EC/DSIM: a frontend and simulator for huge parallel systems

G. Hermannsson, Ai Li, L. Wittie

1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences > 1 > 241 - 250

Proceedings of the Twenty-Seventh Annual Hawaii International Conference on System Sciences

This paper presents a new fast way to simulate large networks of computers. The method uses a frontend EC, which accepts a parallel C program and translates it into a program in an intermediate language for parallel system simulations. An event driven simulator for distributed shared memory systems, DSIM, uses the intermediate language to simulate and obtain efficiency results in networks of thousands...

chapter

Fast accurate simulation of large shared memory multiprocessors

B. Boothe

1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences > 1 > 251 - 260

Proceedings of the Twenty-Seventh Annual Hawaii International Conference on System Sciences

Fast computer simulation is an essential tool in the design of large parallel computers. We discuss the design and performance of our Fast Accurate Simulation Tool, FAST. We start by summarizing the tradeoffs made in the designs of this and other simulators. The key ideas used in this simulator involve execution driven simulation techniques that modify the object code of the application program being...

chapter

Scalable shared-memory architectures. Introduction to the minitrack

P. Stenstrom

1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences > 1 > 520 - 521

Proceedings of the Twenty-Seventh Annual Hawaii International Conference on System Sciences

The single address-space that shared-memory architectures offer simplifies programming, problem partitioning, and dynamic load balancing as compared to other programming models for parallel computing systems such as e.g. Message passing. Unfortunately, as we scale shared-memory architectures to large configurations, the resulting memory system latencies may limit their performance potentials. Finding...

chapter

Simple COMA node implementations

E. Hagersten, A. Saulsbury, A. Landin

1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences > 1 > 522 - 533

Proceedings of the Twenty-Seventh Annual Hawaii International Conference on System Sciences

Shared memory architectures often have caches to reduce the number of slow remote memory accesses. The largest possible caches exist in shared memory architectures called Cache-Only Memory Architectures (COMAs). In a COMA all the memory resources are used to implement large caches. Unfortunately, these large caches also have their price. Due to its lack of physically shared memory, COMA may suffer...

chapter

Update-based cache coherence protocols for scalable shared-memory multiprocessors

D.B. Glasco, B.A. Delagi, M.J. Flynn

1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences > 1 > 534 - 545

Proceedings of the Twenty-Seventh Annual Hawaii International Conference on System Sciences

Presents two hardware-controlled update-based cache coherence protocols. The authors discuss the two major disadvantages of the update protocols: inefficiency of updates and the mismatch between the granularity of synchronization and the data transfer. They present two enhancements to the update-based protocols, a write combining scheme and a finer grain synchronization, to overcome these disadvantages...

chapter

Interleaved dual tag directory scheme for cache coherence

M. Thapar

1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences > 1 > 546 - 553

Proceedings of the Twenty-Seventh Annual Hawaii International Conference on System Sciences

Shared memory multiprocessors generally use caches to improve the performance. This introduces the cache coherence problem. Multiple copies of the data need to be kept consistent by using a suitable mechanism. The paper presents a novel mechanism for organizing the memory modules in order to provide an inexpensive implementation for cache coherence. The interleaved directory scheme uses a unique address...

chapter

Locating multiprocessor TLBs at memory

P.J. Teller, A. Gottlieb

1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences > 1 > 554 - 563

Proceedings of the Twenty-Seventh Annual Hawaii International Conference on System Sciences

Compares the performance, in shared-memory multiprocessors, of locating translation-lookaside buffers (TLBs) at processors with that of locating TLBs at memory. The comparison is based on trace-driven simulations of multiprocessors with log N-stage networks interconnecting N processors and N memory modules. For the systems and workloads studied, memory-based TLBs perform noticeably better than processor-based...

chapter

Fast locks in distributed shared memory systems

G. Hermannsson, L. Wittie

1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences > 1 > 574 - 583

Proceedings of the Twenty-Seventh Annual Hawaii International Conference on System Sciences

Synchronization and remote memory access delays cause staggering inefficiency in most shared memory programs if run on thousands of processors. The authors introduce efficient lock synchronization using the combination of group write consistency, which guarantees write ordering within groups of processors, and eagersharing distributed memory, which sends newly written data values over fast network...

chapter

Programming, compiling and executing partially-ordered instruction streams on scalable shared-memory multiprocessors

D.K. Probst

1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences > 1 > 584 - 593

Proceedings of the Twenty-Seventh Annual Hawaii International Conference on System Sciences

Performance in large-scale shared-memory multiprocessors depends on finding a scalable solution to the memory-latency problem. The author shows that protect consistency (PRC) relaxes previous consistency models with two distinct performance benefits. First, PRC is used to expose and exploit more parallelism in the computation, giving better support to latency tolerance. Second, assuming that visible...

chapter

Automatic localization for distributed-memory multiprocessors using a shared-memory compilation framework

Sarkar, V., Vazquez, L.A.

1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences > 2 > 4 - 13

Proceedings of the Twenty-Seventh Annual Hawaii International Conference on System Sciences

We outline an approach for compiling for distributed-memory multiprocessors that is inherited from compiler technologies for shared-memory multiprocessors. We believe that this approach to compiling for distributed-memory machines as promising because it is a logical extension of the shared-memory parallel programming model, a model that is easier for programmers to work with, and that has been studied...

chapter

Operating system support for shared memory clusters

Rockhold, R.L., Peterson, J.L.

1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences > 2 > 86 - 95

Proceedings of the Twenty-Seventh Annual Hawaii International Conference on System Sciences

This paper addresses a purely software-based solution to the multiprocessor cache coherence problem by structuring an operating system to provide for the coherence of its own data while exporting coherent memory to user processes. Also covered are the results of proof-of-concept port of Mach 3.0, using the principles in this paper, to a prototype of the IBM Shared Memory System POWER/4, a Shared Memory...

chapter

Using Warp to control network contention in Mermera

Heddaya, A., Park, K., Sinha, H.

1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences > 2 > 96 - 105

Proceedings of the Twenty-Seventh Annual Hawaii International Conference on System Sciences

Parallel computing on a network of workstations can saturate the communication network leading to excessive message delays and consequently poor application performance. We examine empirically the consequences of integrating a flow control protocol, called Warp control, into Mermera, a software shared memory system that supports parallel computing on distributed systems. For an asynchronous iterative...

chapter

Transactions on Shared Data: a coordination model

Puntigam, F.

1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences > 2 > 373 - 382

Proceedings of the Twenty-Seventh Annual Hawaii International Conference on System Sciences

Architecture neutrality, reliability, and support of reactive programs are the primary goals of the coordination and programming language model TSD (Transactions on Shared Data). The basic execution units are transactions that communicate through shared data. Data assigned to variables through unification are immutable; the presence or absence of data is used for synchronization. Transactions report...

chapter

Impact of event scheduling on performance of time warp parallel simulations

Ahmed, H., Ronngren, R., Ayani, R.

1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences > 2 > 455 - 462

Proceedings of the Twenty-Seventh Annual Hawaii International Conference on System Sciences

In conventional parallel processing, the main objective of scheduling is to reduce the processor's idle time. However, in Time Warp (TW), which is an optimistic parallel discrete event simulation approach, keeping the processors busy does not necessarily lead to good performance, since the processors may be performing erroneous computations that must be eventually rolled back. Hence, the existing...

chapter

An optimal asynchronous scheduling algorithm for software cache consistency

Simons, B., Sarkar, V., Breternitz, M., Jr., Lai, M.

1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences > 2 > 502 - 511

Proceedings of the Twenty-Seventh Annual Hawaii International Conference on System Sciences

We present a linear time algorithm for scheduling iterations of a loop that has no loop-carried dependences. The algorithm is optimal in the sense that any p consecutive iterations in the schedule can be executed simultaneously without any possibility of false sharing, where p is the number of processors, and the algorithm uses at most two wait synchronizations per iteration. Our algorithm is asynchronous...

chapter

A common library interface to shared-memory multiprocessors

Crowl, L.A.

1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences > 2 > 595 - 604

Proceedings of the Twenty-Seventh Annual Hawaii International Conference on System Sciences

In writing a highly-portable parallel program, we developed a library of parallel primitives to shield our application code from the various multiprocessors. Rather than adapt the different multiprocessors to a program-specific library via extensive implementation, we chose to first find a common 'intersection' of the virtual machine models provided by each vendor, define a common interface to that...

Filter options

Keywords:
SHARED MEMORY SYSTEMS

Publication date

Set your own date range

Keywords

BUFFER STORAGE (8)
PARALLEL PROGRAMMING (8)
PERFORMANCE EVALUATION (8)
DELAY (7)
COMPUTER SCIENCE (6)
PERFORMANCE (5)
CONCURRENT COMPUTING (4)
COSTS (4)
MEMORY ARCHITECTURE (4)
PROTOCOLS (4)
SHARED-MEMORY MULTIPROCESSORS (4)
BANDWIDTH (3)
COMPUTATIONAL MODELING (3)
DISCRETE EVENT SIMULATION (3)
DISTRIBUTED MEMORY SYSTEMS (3)
HARDWARE (3)
LABORATORIES (3)
LARGE-SCALE SYSTEMS (3)
MESSAGE PASSING (3)
PREFETCHING (3)
PROGRAM COMPILERS (3)
SOFTWARE PERFORMANCE (3)
SUN (3)
SYNCHRONISATION (3)
ACCESS PROTOCOLS (2)
CACHE COHERENCE (2)
CACHE MEMORIES (2)
COMPILERS (2)
COMPUTER ARCHITECTURE (2)
COMPUTER NETWORKS (2)
COMPUTER PERFORMANCE (2)
COMPUTER SIMULATION (2)
DISTRIBUTED COMPUTING (2)
DISTRIBUTED SHARED MEMORY SYSTEMS (2)
MEMORY MODULES (2)
PARALLEL PROCESSING (2)
PROGRAMMING (2)
PROGRAMMING PROFESSION (2)
REGISTERS (2)
SCALABILITY (2)
SCALABLE SHARED-MEMORY MULTIPROCESSORS (2)
SCHEDULING (2)
SHARED MEMORY MULTIPROCESSORS (2)
STORAGE MANAGEMENT (2)
SYNCHRONIZATION (2)
TELECOMMUNICATION TRAFFIC (2)
ADDRESS MAPPING (1)
APPLICATION CODE SHIELDING (1)
APPLICATION PERFORMANCE (1)
APPLICATION PROGRAM (1)
ARCHITECTURE NEUTRALITY (1)
ARITHMETIC (1)
ASSEMBLY (1)
ASYNCHRONOUS ITERATIVE PROGRAM (1)
AUTOMATIC LOCALIZATION (1)
BARRIER SYNCHRONIZATIONS (1)
BARRIER TYPE ALGORITHM (1)
BASIC EXECUTION UNITS (1)
BDD ALGORITHMS (1)
BENCHMARK RESULTS (1)
BINARY DECISION DIAGRAMS (1)
BOOLEAN FUNCTION MANIPULATION (1)
BOOLEAN FUNCTIONS (1)
BROADCASTING (1)
BUILDINGS (1)
CACHE COHERENCE PROTOCOLS (1)
CACHE COHERENT DISTRIBUTED SHARED MEMORY (1)
CACHE-ONLY MEMORY ARCHITECTURES (1)
CACHEABILITY (1)
CACHED DATA (1)
CACHES (1)
CM-5 (1)
CMOS INTEGRATED CIRCUITS (1)
CMOS SERIAL LINK TECHNOLOGY (1)
CMOS TECHNOLOGY (1)
CODE AUGMENTATION (1)
COHERENCE (1)
COHERENCE VIOLATIONS (1)
COHERENT MEMORY (1)
COMA NODE IMPLEMENTATIONS (1)
COMMERCIAL OPERATING SYSTEM (1)
COMMON LIBRARY INTERFACE (1)
COMMUNICATION NETWORK (1)
COMMUNICATION SWITCHING (1)
COMPILER OPTIMISATION (1)
COMPILER TECHNOLOGIES (1)
COMPILING (1)
COMPLEXITY (1)
COMPLEXITY THEORY (1)
COMPUTATIONAL COMPLEXITY (1)
COMPUTER INTERFACES (1)
COMPUTER NETWORK MANAGEMENT (1)
CONNECTION MACHINE (1)
CONSECUTIVE ITERATIONS (1)
CONSISTENCY MODELS (1)
CONTRACTS (1)
CONTROL SYSTEMS (1)
COORDINATION MODEL (1)
CPU CYCLES (1)
more

INFONA - science communication portal

1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences