1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences

Items from 1 to 20 out of 20 results

chapter

A comparative study of multiprocessor list scheduling heuristics

G. Liao, E.R. Altman, V.K. Agarwal, G.R. Gao

1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences > 1 > 68 - 77

Proceedings of the Twenty-Seventh Annual Hawaii International Conference on System Sciences

Many multiprocessor list scheduling heuristics that account for interprocessor communication delay have been proposed in recent years. However, no uniform comparative study of published heuristics has been performed in almost 20 years. This paper presents the results of a large quantitative study using random, but program-like input graphs. We found differences in the performance of the various heuristics...

chapter

Application-specific architectures for field-programmable VLSI technologies

C.H. Gebotys, R.J. Gebotys

1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences > 1 > 124 - 130

Proceedings of the Twenty-Seventh Annual Hawaii International Conference on System Sciences

New field-programmable gate array (FPGA) technologies have increased the industrial interest in tools which map a DSP application and a set of performance constraints to a specific VLSI architecture. This paper presents an optimization methodology for mapping a DSP application and a set of performance constraints into an architecture targeted for FPGA technologies with user-programmable RAM blocks...

chapter

Software versus hardware coherence: performance versus cost

R.N. Zucker, J.-L. Baer

1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences > 1 > 163 - 172

Proceedings of the Twenty-Seventh Annual Hawaii International Conference on System Sciences

Directory-based protocols are currently the method of choice to enforce cache coherence in large-scale shared-memory multiprocessors. The problems associated with these hardware schemes include their lack of scalability, although various suggestions have been made to ameliorate this drawback, and the loss of performance due to false sharing. Software controlled cache coherence (SCCC) is an alternative...

chapter

Adaptive unicast and multicast in 3D mesh networks

Ziqiang Liu, J. Duato

1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences > 1 > 173 - 182

Proceedings of the Twenty-Seventh Annual Hawaii International Conference on System Sciences

Presents an adaptive unicast and multicast routing algorithm for 3D mesh networks with wormhole routing and virtual channel flow control, which is called adaptive-cast. The unique feature of the adaptive-cast is that it is valid when messages with a single destination (unicast) and with multiple destinations (multicast) are mixed together, which drastically simplifies the implementation of the router...

chapter

Fast efficient simulation of write-buffer configurations

S.G. Abraham, R.A. Sugumar

1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences > 1 > 231 - 240

Proceedings of the Twenty-Seventh Annual Hawaii International Conference on System Sciences

Write-buffers have a significant impact on performance, especially in wide-issue superscalar systems with write-through caching. We develop fast efficient simulation methods for evaluating multiple write-buffer configurations together in a single-pass. Our results are also applicable for the simulation of other buffer structures. We first consider simulating non-coalescing write-buffers. We show that...

chapter

A systolic ON-LINE non-restoring division scheme

J.B. Andersen, A.F. Nielsen, O. Olsen

1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences > 1 > 339 - 348

Proceedings of the Twenty-Seventh Annual Hawaii International Conference on System Sciences

A new improved version of the classic binary non-restoring division algorithm is presented. It is implemented on a systolic ON-LINE architecture, targeted at use in digital signal processing applications. The overall goal is to implement DSP algorithms using redundant data representations throughout the algorithm, and to obtain a balanced architecture according to the specifications of the application...

chapter

Evaluation of pseudo vector processor based on slide-windowed registers

H. Nakamura, H. Imori, Y. Yamashita, K. Nakazawa, more

1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences > 1 > 368 - 377

Proceedings of the Twenty-Seventh Annual Hawaii International Conference on System Sciences

We present a new scalar processor for high-speed vector processing and its evaluation. The proposed processor can hide long main memory access latency by introducing slide-windowed floating-point registers with data preloading feature and pipelined memory. Owing to the slide-window structure, the proposed processor can utilize more floating-point registers in keeping upward compatibility with existing...

chapter

V++: an instruction-restructurable processor architecture

T. Arita, H. Takagi, M. Sowa

1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences > 1 > 398 - 407

Proceedings of the Twenty-Seventh Annual Hawaii International Conference on System Sciences

It is essential to extract fine grain parallelism for further increase of processor performance. This paper investigates an extension model of VLIW architecture called V++, which retains the capabilities of VLIW architecture to effectively exploit fine grain parallelism while introducing facilities for restructuring very long instruction words dynamically. V++ adopts two types of restructuring methods:...

chapter

Memory organization tradeoffs in computer systems design

J.W.C. Fu, A.L.N. Reddy

1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences > 1 > 409 - 411

Proceedings of the Twenty-Seventh Annual Hawaii International Conference on System Sciences

Advances in technology and computer design are resulting in impressive increases in raw processor power. Currently, new processor implementations are showing almost a doubling in clock frequency. Moreover, with each new generation, processor designers are incorporating more advance architecture techniques such as instruction level parallelism into these implementations. Memory technology also continues...

chapter

Performance and design choices of level-two caches

Ju-Ho Tang, Kimming So

1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences > 1 > 422 - 430

Proceedings of the Twenty-Seventh Annual Hawaii International Conference on System Sciences

The increasing disparity of speed between processor and its main memory makes ways for multi-level cache hierarchies in almost any of today's computer systems; specifically, the second-level (L2) caches with larger capacity but longer access time than the first-level (L1) caches have been adopted to reduce this memory gap. In this study an enhanced one-pass trace-driven simulation technique is used...

chapter

Experimental implementation of dynamic access ordering

S.A. McKee, R.H. Klenke, A.J. Schwab, W.A. Wulf, more

1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences > 1 > 431 - 440

Proceedings of the Twenty-Seventh Annual Hawaii International Conference on System Sciences

As microprocessor speeds increase, memory bandwidth is rapidly becoming the performance bottleneck in the execution of vector-like algorithms. Although caching provides adequate performance for many problems, caching alone is an insufficient solution for vector applications with poor temporal and spatial locality. Moreover, the nature of memories themselves has changed. Current DRAM components should...

chapter

Flash memory file caching for mobile computers

B. Marsh, F. Douglis, P. Krishnan

1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences > 1 > 451 - 460

Proceedings of the Twenty-Seventh Annual Hawaii International Conference on System Sciences

We examine the impact of using flash memory as a second-level file system buffer cache to reduce power consumption and file access latency on a mobile computer. We use trace-driven simulation to evaluate the impact of what we call a FLASHCACHE. We relate the power consumption and access latency of the storage sub-system to the characteristics of the FLASHCACHE: its size, the unit of erasure, and access...

chapter

Optical packet switching architectures for distributed computing

A. Guha

1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences > 1 > 491 - 498

Proceedings of the Twenty-Seventh Annual Hawaii International Conference on System Sciences

We propose that optical packet switching networks are better implemented through space division switching (SDS) approaches than through wavelength division multiplexing (WDM) approaches. We show that active optical networks designed from optically controlled nonblocking networks can provide higher efficiency and lower latency advantages compared to other approaches. Our self-routing optical crossbar...

chapter

Scalable shared-memory architectures. Introduction to the minitrack

P. Stenstrom

1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences > 1 > 520 - 521

Proceedings of the Twenty-Seventh Annual Hawaii International Conference on System Sciences

The single address-space that shared-memory architectures offer simplifies programming, problem partitioning, and dynamic load balancing as compared to other programming models for parallel computing systems such as e.g. Message passing. Unfortunately, as we scale shared-memory architectures to large configurations, the resulting memory system latencies may limit their performance potentials. Finding...

chapter

Simple COMA node implementations

E. Hagersten, A. Saulsbury, A. Landin

1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences > 1 > 522 - 533

Proceedings of the Twenty-Seventh Annual Hawaii International Conference on System Sciences

Shared memory architectures often have caches to reduce the number of slow remote memory accesses. The largest possible caches exist in shared memory architectures called Cache-Only Memory Architectures (COMAs). In a COMA all the memory resources are used to implement large caches. Unfortunately, these large caches also have their price. Due to its lack of physically shared memory, COMA may suffer...

chapter

Update-based cache coherence protocols for scalable shared-memory multiprocessors

D.B. Glasco, B.A. Delagi, M.J. Flynn

1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences > 1 > 534 - 545

Proceedings of the Twenty-Seventh Annual Hawaii International Conference on System Sciences

Presents two hardware-controlled update-based cache coherence protocols. The authors discuss the two major disadvantages of the update protocols: inefficiency of updates and the mismatch between the granularity of synchronization and the data transfer. They present two enhancements to the update-based protocols, a write combining scheme and a finer grain synchronization, to overcome these disadvantages...

chapter

Interleaved dual tag directory scheme for cache coherence

M. Thapar

1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences > 1 > 546 - 553

Proceedings of the Twenty-Seventh Annual Hawaii International Conference on System Sciences

Shared memory multiprocessors generally use caches to improve the performance. This introduces the cache coherence problem. Multiple copies of the data need to be kept consistent by using a suitable mechanism. The paper presents a novel mechanism for organizing the memory modules in order to provide an inexpensive implementation for cache coherence. The interleaved directory scheme uses a unique address...

chapter

Adding fault-tolerance to algorithms for weak consistency

S.F. Hummel

1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences > 1 > 564 - 573

Proceedings of the Twenty-Seventh Annual Hawaii International Conference on System Sciences

As the number of processors increases, so does communication latency and the probability of component failure. A technique that addresses these problems is data replication, which provides faster access and greater availability. Its drawback is that the replicas must be kept consistent. The author describes a family of fault-tolerant algorithms for maintaining the consistency of cacheable data. Processors...

chapter

Fast locks in distributed shared memory systems

G. Hermannsson, L. Wittie

1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences > 1 > 574 - 583

Proceedings of the Twenty-Seventh Annual Hawaii International Conference on System Sciences

Synchronization and remote memory access delays cause staggering inefficiency in most shared memory programs if run on thousands of processors. The authors introduce efficient lock synchronization using the combination of group write consistency, which guarantees write ordering within groups of processors, and eagersharing distributed memory, which sends newly written data values over fast network...

chapter

Programming, compiling and executing partially-ordered instruction streams on scalable shared-memory multiprocessors

D.K. Probst

1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences > 1 > 584 - 593

Proceedings of the Twenty-Seventh Annual Hawaii International Conference on System Sciences

Performance in large-scale shared-memory multiprocessors depends on finding a scalable solution to the memory-latency problem. The author shows that protect consistency (PRC) relaxes previous consistency models with two distinct performance benefits. First, PRC is used to expose and exploit more parallelism in the computation, giving better support to latency tolerance. Second, assuming that visible...

Filter options

Keywords:
DELAY

Publication date

Set your own date range

Keywords

BUFFER STORAGE (11)
PERFORMANCE EVALUATION (11)
COMPUTER ARCHITECTURE (8)
SHARED MEMORY SYSTEMS (7)
HARDWARE (6)
PERFORMANCE (6)
PREFETCHING (6)
BANDWIDTH (5)
COSTS (5)
MEMORY ARCHITECTURE (5)
MICROPROCESSORS (5)
PARALLEL PROCESSING (5)
STORAGE MANAGEMENT (5)
COMPUTER SCIENCE (4)
LABORATORIES (4)
PROTOCOLS (4)
REGISTERS (4)
COMPUTATIONAL MODELING (3)
CONCURRENT COMPUTING (3)
LARGE-SCALE SYSTEMS (3)
PARALLEL ARCHITECTURES (3)
PROTOTYPES (3)
TELECOMMUNICATION TRAFFIC (3)
THROUGHPUT (3)
ACCESS PROTOCOLS (2)
CACHE COHERENCE (2)
CONTRACTS (2)
DIGITAL SIGNAL PROCESSING (2)
DISTRIBUTED COMPUTING (2)
FAULT TOLERANT COMPUTING (2)
FILE ORGANISATION (2)
PARALLEL PROGRAMMING (2)
PROCESSOR SCHEDULING (2)
PROGRAMMING (2)
PROGRAMMING PROFESSION (2)
READ-WRITE MEMORY (2)
SCALABLE SHARED-MEMORY MULTIPROCESSORS (2)
SUN (2)
SYNCHRONISATION (2)
TRAFFIC CONTROL (2)
3D MESH NETWORKS (1)
ACCESS LATENCY (1)
ADAPTIVE CONTROL (1)
ADAPTIVE RESTRUCTURING (1)
ADAPTIVE UNICAST (1)
ADAPTIVE-CAST (1)
ADDRESS MAPPING (1)
ALGORITHM DESIGN AND ANALYSIS (1)
APPLICATION-SPECIFIC ARCHITECTURES (1)
APPLICATION-SPECIFIC DSP ARCHITECTURES (1)
ASSOCIATIVITY (1)
BROADCASTING (1)
BUFFER CACHE (1)
CACHE COHERENCE PROTOCOLS (1)
CACHE SIZE (1)
CACHE-ONLY MEMORY ARCHITECTURES (1)
CACHEABILITY (1)
CACHEABLE DATA (1)
CACHED DATA (1)
CACHES (1)
CACHING (1)
CALL PROGRAM STRUCTURES (1)
CENTRAL PROCESSING UNIT (1)
CLOCKS (1)
COHERENCE (1)
COMA NODE IMPLEMENTATIONS (1)
COMMUNICATION LATENCY (1)
COMPILING (1)
COMPONENT FAILURE (1)
COMPUTER APPLICATIONS (1)
COMPUTER DESIGN (1)
COMPUTER INTERFACES (1)
COMPUTER LANGUAGES (1)
COMPUTER SYSTEMS DESIGN (1)
CONFERENCE MANAGEMENT (1)
CONFLICT ARBITRATION PROBLEMS (1)
CONSISTENCY MODELS (1)
CONTENT-ADDRESSABLE STORAGE (1)
CORDIC UNIT (1)
COSTING (1)
COUNCILS (1)
DATA FLOW COMPUTING (1)
DATA PRELOADING FEATURE (1)
DATA REPLICATION (1)
DC GENERATORS (1)
DECOUPLING (1)
DEGRADATION (1)
DIGITAL ARITHMETIC (1)
DIGITAL SIGNAL PROCESSING CHIPS (1)
DIGITAL SIMULATION (1)
DISK MANAGEMENT (1)
DISTRIBUTED MEMORY SYSTEMS (1)
DISTRIBUTED ROUTING ALGORITHM (1)
DISTRIBUTED SHARED MEMORY SYSTEMS (1)
DUAL TAG DIRECTORY (1)
DYNAMIC ACCESS ORDERING (1)
DYNAMIC LOAD BALANCING (1)
DYNAMIC PROGRAMMING (1)
EAGERSHARING DISTRIBUTED MEMORY (1)
more

INFONA - science communication portal

1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

1994 Proceedings of the Twenty-Seventh Hawaii International Conference on System Sciences