Proceedings. Innovative Architecture for Future Generation High-Performance Processors and Systems

Items from 1 to 18 out of 18 results

chapter

Implementation Details and Evaluation of a New Exact and Fast Test for Array Data Dependence Analysis Based on Simplex Method

M. Mineo, S. Saito, T. Uehara, Y. Kunieda

Innovative Architecture for Future Generation High-Performance Processors and Systems (IWIA'4) > 89 - 100

Proceedings. Innovative Architecture for Future Generation High-Performance Processors and Systems

Data dependence analysis (DDA) is essential for any automatic parallelizing compiler to determine parallelizability of given portions of programs. Several techniques and tests to analyze data dependence between array elements have already been proposed. It is clear that when one examines these conventional DDA techniques, there exists a trade-off between their analysis speed and exactness of their...

chapter

Highly Functional Memory Architecture for Large-Scale Data Applications

K. Tanaka, T. Fukawa

Innovative Architecture for Future Generation High-Performance Processors and Systems (IWIA'4) > 109 - 118

Proceedings. Innovative Architecture for Future Generation High-Performance Processors and Systems

Response time in database systems is not getting small as a processor speed is accelerating because of a growing gap between speed of the processor and that of a memory, and increase in data size. A conventional memory controller and caches in a processor cannot provide enough bandwidth of data transfer between a processor and memory. For fast processing with large data, it is effective to equip a...

chapter

Memory Management for Data Localization on OSCAR Chip Multiprocessor

H. Nakano, T. Kodaka, K. Kimura, H. Kasahara

Innovative Architecture for Future Generation High-Performance Processors and Systems (IWIA'4) > 82 - 88

Proceedings. Innovative Architecture for Future Generation High-Performance Processors and Systems

Chip multiprocessor (CMP) architecture has attracting much attention as a next-generation microprocessor architecture and many kinds of CMP are widely being researched. However, CMP architectures several difficulties for effective use of memory, especially cache or local memory near a processor core. The authors have proposed OSCAR CMP architecture, which cooperatively works with multigrain parallelizing...

chapter

Array Data Dependence Testing with the Chains of Recurrences Algebra

R.A. van Engelen, J. Birch, K.A. Gallivan

Innovative Architecture for Future Generation High-Performance Processors and Systems (IWIA'4) > 70 - 81

Proceedings. Innovative Architecture for Future Generation High-Performance Processors and Systems

This paper presents a new approach to dependence testing in the presence of nonlinear and non-closed array index expressions and pointer references. The chains of recurrences formalism and algebra is used to analyze the recurrence relations of induction variables, and for constructing recurrence forms of array index expressions and pointer references. We use these recurrence forms to determine if...

chapter

GXP : An Interactive Shell for the Grid Environment

K. Taura

Innovative Architecture for Future Generation High-Performance Processors and Systems (IWIA'4) > 59 - 67

Proceedings. Innovative Architecture for Future Generation High-Performance Processors and Systems

We describe GXP, a shell for distributed multi-cluster environments. With GXP, users can quickly submit a command to many nodes simultaneously (approximately 600 milliseconds on over 300 nodes spread across five local-area networks). It therefore brings an interactive and instantaneous response to many cluster/network operations, such as trouble diagnosis, parallel program invocation, installation...

chapter

YAWARA: A Meta-Level Optimizing Computer System

T. Baba, T. Yokota, K. Ootsu, F. Furukawa, more

Innovative Architecture for Future Generation High-Performance Processors and Systems (IWIA'4) > 148 - 153

Proceedings. Innovative Architecture for Future Generation High-Performance Processors and Systems

This paper proposes a new, autonomous and dynamic optimization framework, called a meta-level computation. In this framework, a meta-level processor acquires the execution profile of a base-level processor, i.e. a conventional von Neumann machine, produces the optimized base-level configuration and performs the reconfiguration. We define the meta-level computation model based on the considerations...

chapter

Custom-Enabled System Architectures for High End Computing

T. Sterling, P. Kogge

Innovative Architecture for Future Generation High-Performance Processors and Systems (IWIA'4) > 30 - 39

Proceedings. Innovative Architecture for Future Generation High-Performance Processors and Systems

The US Federal Government has convened a major committee to determine future directions for government sponsored high end computing system acquisitions and enabling research. The High End Computing Revitalization Task Force was inaugurated in 2003 involving all Federal agencies for which high end computing is critical to meeting mission goals. As part of the HECRTF agenda, a multi-day community wide...

chapter

Direct Instruction Wakeup for Out-of-Order Processors

M.A. Ramirez, A. Cristal, A.V. Veidenbaum, L. Villa, more

Innovative Architecture for Future Generation High-Performance Processors and Systems (IWIA'4) > 2 - 9

Proceedings. Innovative Architecture for Future Generation High-Performance Processors and Systems

Instruction queues consume a significant amount of power in high-performance processors, primarily due to instruction wakeup logic access to the queue structures. The wakeup logic delay is also a critical timing parameter. This paper proposes a new queue organization using a small number of successor pointers plus a small number of dynamically allocated full successor bit vectors for cases with a...

chapter

A Super Instruction-Flow Architecture for High Performance and Low Power Processors

K. Kise, T. Katagiri, H. Honda, T. Yuba

Innovative Architecture for Future Generation High-Performance Processors and Systems (IWIA'4) > 10 - 19

Proceedings. Innovative Architecture for Future Generation High-Performance Processors and Systems

Microprocessor performance has improved at about 55% per year for the past three decades. To maintain this performance growth rate, next generation processors must achieve higher levels of instruction level parallelism. However, it is known that a conditional branch poses serious performance problems in modern processors. In addition, as an instruction pipeline becomes deep and the issue width becomes...

chapter

Title Page

Innovative Architecture for Future Generation High-Performance Processors and Systems (IWIA'4) > i - iv

Proceedings. Innovative Architecture for Future Generation High-Performance Processors and Systems

The following topics are dealt with: processors and systems, programming quality, computer architecture, compilation, and high-performance computing

chapter

A New Memory Module for COTS-Based Personal Supercomputing

N. Tanabe, M. Nakatake, H. Hakozaki, Y. Dohi, more

Innovative Architecture for Future Generation High-Performance Processors and Systems (IWIA'4) > 40 - 48

Proceedings. Innovative Architecture for Future Generation High-Performance Processors and Systems

This paper presents how to make inexpensive personal supercomputers getting the merit of commercial-off-the-shelf (COTS) continuously after the death of vector super-computer venders. It is designed to realize this goal without any modification on CPU, bridge chips on motherboard and memory chips. Only plugging a new memory module with vector load/store function make an inexpensive home-use personal...

chapter

Fault-Tolerant Adaptive Deadlock-Recovery Routing for k-ary n-cube Networks

T. Yoshinaga, H. Hosogoshi, M. Sowa

Innovative Architecture for Future Generation High-Performance Processors and Systems (IWIA'4) > 49 - 58

Proceedings. Innovative Architecture for Future Generation High-Performance Processors and Systems

This paper proposes a fault-tolerant fully adaptive deadlock-recovery routing algorithm for k-ary n-cube networks. We intend to consider both the adaptability for faults and the communication performance by integrating regular and irregular network routing. Our algorithm tolerates any number or shape of faults without disabling fault-free nodes by maintaining routing tables that are configured based...

chapter

Power-Aware Register Renaming in High-Performance Processors Power-Aware Register Renaming in High-Performance Processors

J.L. Ayala, M. Lopez-Vallejo, A. Veidenbaum

Innovative Architecture for Future Generation High-Performance Processors and Systems (IWIA'4) > 20 - 27

Proceedings. Innovative Architecture for Future Generation High-Performance Processors and Systems

This work presents an efficient multi-banked architecture of the register file, and a low-power compiler support which reduces energy consumption in this device by more than a 78%. The key idea of this work is based on a quasi-deterministic interpretation of the register assignment task, and the use of the voltage scaling techniques

chapter

Large-Scale 3-D Fluid Simulations for Implosion Hydrodynamics on the Earth Simulator

H. Sakagami, H. Murai

Innovative Architecture for Future Generation High-Performance Processors and Systems (IWIA'4) > 102 - 108

Proceedings. Innovative Architecture for Future Generation High-Performance Processors and Systems

A three-dimensional fluid code, IMPACT-3D has been parallelized with high performance Fortran (HPF) on the Earth Simulator. IMPACT-3D is an implosion analysis code using TVD scheme, which performs three-dimensional compressible and inviscid Eulerian fluid computation with the explicit 5-point stencil scheme for spatial differentiation and the fractional time step for time integration. The third dimension...

chapter

Of Piglets and Threadlets: Architectures for Self-Contained, Mobile, Memory Programming

P.M. Kogge

Innovative Architecture for Future Generation High-Performance Processors and Systems (IWIA'4) > 130 - 138

Proceedings. Innovative Architecture for Future Generation High-Performance Processors and Systems

Virtually all of the discussion on "commodity" vs. "custom" architectures, especially for highly parallel systems, has focused on the high-glamor, high complexity processor core. This paper takes a different tack - it explores the potential for directly attacking the memory wall by programming the classically "dumb" memory interface. Several related but separable techniques...

chapter

Impact of Dynamic Allocation of Physical Register Banks for an SMT Processor

N. Kato, M. Yamato, O. Tujimoto, M. Sato, more

Innovative Architecture for Future Generation High-Performance Processors and Systems (IWIA'4) > 139 - 147

Proceedings. Innovative Architecture for Future Generation High-Performance Processors and Systems

In an SMT processor, the increase of the register contexts of a thread requires a large number of physical registers. Moreover, a physical register file in an SMT processor requires more ports for the execution units, which cause significant growth of the area, access time and power consumption of the register file. These problems are critical hurdles to implement a large scale SMT processor. Especially,...

chapter

Parallel Processing using Data Localization for MPEG2 Encoding on OSCAR Chip Multiprocessor

T. Kodaka, H. Nakano, K. Kimura, H. Kasahara

Innovative Architecture for Future Generation High-Performance Processors and Systems (IWIA'4) > 119 - 127

Proceedings. Innovative Architecture for Future Generation High-Performance Processors and Systems

Currently, many people are enjoying multimedia applications with image and audio processing on PCs, PDAs, mobile phones and so on. With the popularization of the multimedia applications, needs for low cost, low power consumption and high performance processors has been increasing. To this end, chip multiprocessor architectures which allow us to attain scalable performance improvement by using multigrain...

book

Innovative Architecture for Future Generation High-Performance Processors and Systems, 2004. Proceedings

IEEE

Proceedings. Innovative Architecture for Future Generation High-Performance Processors and Systems

Filter options

Publication date

Set your own date range

Keywords

MULTIPROCESSING SYSTEMS (8)
PARALLEL ARCHITECTURES (8)
PARALLEL PROCESSING (7)
INSTRUCTION SETS (5)
MICROPROCESSOR CHIPS (5)
MEMORY ARCHITECTURE (4)
PARALLELISING COMPILERS (4)
LOW-POWER ELECTRONICS (3)
MULTI-THREADING (3)
DATA LOCALIZATION (2)
DATA STRUCTURES (2)
EARTH SIMULATOR (2)
HIGH-PERFORMANCE PROCESSORS (2)
INSTRUCTION LEVEL PARALLELISM (2)
OSCAR CHIP MULTIPROCESSOR (2)
PARALLEL SYSTEMS (2)
PROGRAM COMPILERS (2)
SYSTEM RECOVERY (2)
12X TYPE SWITCHES (1)
3D FLUID SIMULATIONS (1)
70 NM (1)
ACCESS PACKET (1)
ACCESS TIME (1)
ADAPTIVE DEADLOCK-RECOVERY ROUTING (1)
ADAPTIVE ROUTER (1)
AOTF (1)
ARRAY DATA DEPENDENCE ANALYSIS (1)
ARRAY DATA DEPENDENCE TESTING (1)
ARRAY ELEMENTS (1)
AUDIO PROCESSING (1)
AUTONOMOUS OPTIMIZATION (1)
BANDWIDTH ANALYSIS (1)
BANERJEE TEST (1)
BASELINE INSTRUCTION QUEUE POWER (1)
BOTF (1)
BRANCH INSTRUCTIONS (1)
BRIDGE CHIPS (1)
CACHE (1)
CACHE STORAGE (1)
CACHES (1)
CAM (1)
CENTRALIZED SHARED MEMORY (1)
CHECKPOINTING (1)
CHIP MULTIPROCESSOR ARCHITECTURES (1)
CLUSTER OPERATIONS (1)
CLUSTERS (1)
COMMUNICATION OPTIMIZATION (1)
COMMUNICATION PERFORMANCE (1)
COMPILATION (1)
COMPILER OPTIMIZATION (1)
COMPUTATIONAL FLUID DYNAMICS (1)
COMPUTER ARCHITECTURE (1)
COMPUTING SYSTEM ACQUISITIONS (1)
COTS INFINIBAND 4X TYPE (1)
COTS SO-DIMMS (1)
CPU PENTIUM4 PC (1)
CRITICAL TIMING PARAMETER (1)
CUSTOM SYSTEM ARCHITECTURE (1)
CUSTOM-ENABLED SYSTEM ARCHITECTURES (1)
DATA ANALYSIS (1)
DATA LOCALITY OPTIMIZATION (1)
DATA STRUCTURE (1)
DEAD PROCESS CLEANUP (1)
DEADLOCK RECOVERY (1)
DEBUGGING (1)
DIGITAL SIMULATION (1)
DIRECT WAKEUP (1)
DISTRIBUTED MULTICLUSTER ENVIRONMENTS (1)
DISTRIBUTED SHARED MEMORY (1)
DISTRIBUTED SHARED MEMORY SYSTEMS (1)
DRAM (1)
DUMB MEMORY INTERFACE (1)
DYNAMIC ALLOCATION (1)
DYNAMIC OPTIMIZATION (1)
DYNAMIC RECONFIGURATION (1)
DYNAMIC TASK SCHEDULING (1)
EULERIAN FLUID COMPUTATION (1)
EXPLOSIONS (1)
FAULT TOLERANCE (1)
FAULT TOLERANT COMPUTING (1)
FAULT-FREE TORUS NETWORK (1)
FEEDBACK-DIRECTED RESOURCE CONTROL (1)
FLUID CODE (1)
FORTRAN (1)
FUNCTIONAL MEMORY ARCHITECTURE (1)
GRID COMPUTING (1)
GRID ENVIRONMENT (1)
GXP (1)
HARDWARE MECHANISMS (1)
HARDWARE RECONFIGURATION (1)
HARDWARE SYSTEM (1)
HETEROGENEOUS ARCHITECTURE (1)
HIGH END COMPUTING (1)
HIGH END COMPUTING REVITALIZATION TASK FORCE (1)
HIGH PERFORMANCE FORTRAN (1)
HIGH-PERFORMANCE COMPUTING (1)
HOMOGENEOUS ARCHITECTURE (1)
HPF-JA EXTENSIONS (1)
HYDRODYNAMICS (1)
HYPERCUBE NETWORKS (1)
more

INFONA - science communication portal

Proceedings. Innovative Architecture for Future Generation High-Performance Processors and Systems $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

Proceedings. Innovative Architecture for Future Generation High-Performance Processors and Systems