Search results

Items from 101 to 120 out of 567 results

1 ...
3
4
5
6
7
8
9

chapter

Design and Implementation of a Uniform Platform to Support Multigenerational GPU Architectures for High Performance Stream-Based Computing

S Yamagiwa, M Arai, K Wada

2010 First International Conference on Networking and Computing > 249 - 255

2010 First International Conference on Networking and Computing (ICNC 2010)

GPU-based computing has become one of the popular high performance computing fields. The field is called GPGPU. This paper is focused on design and implementation of a uniform GPGPU application that is optimized for both the legacy and the recent GPU architectures. As a typical example of such the GPGPU application, this paper will discuss the uniform implementation of the Caravel a platform. Especially...

chapter

Multiprocessor Architectures Specialized for Multi-agent Simulation

Christian Schack, Wolfgang Heenes, Rolf Hoffmann

2010 First International Conference on Networking and Computing > 232 - 236

2010 First International Conference on Networking and Computing (ICNC 2010)

Two new multiprocessor architectures to accelerate the simulation of multi-agent worlds based on the massively parallel GCA (Global Cellular Automata) model are presented. The GCA model is suited to describe and simulate different multi-agent worlds. The designed and implemented architectures mainly consist of a set of processors (NIOS II) and a network. The multiprocessor systems allow the implementation...

chapter

Smart Core System for Dependable Many-Core Processor with Multifunction Routers

S Takamaeda, S Sato, T Miyoshi, K Kise

2010 First International Conference on Networking and Computing > 133 - 139

2010 First International Conference on Networking and Computing (ICNC 2010)

Dependability of many-core processors is a very important topic. To improve the dependability, we propose the Smart Core system, which is a smart many-core system with redundant cores and multifunction routers. The multifunction router has three functions: copying packets, changing the destinations of packets, and rendezvousing and comparing two packets from different nodes. Using these additional...

chapter

Blue Gene/Q resource management architecture

Tom Budnik, Brant Knudson, Mark Megerian, Sam Miller, more

2010 3rd Workshop on Many-Task Computing on Grids and Supercomputers > 1 - 5

2010 3rd Workshop on Many-Task Computing on Grids and Supercomputers (MTAGS 2010)

As supercomputers scale to a million processor cores and beyond, the underlying resource management architecture needs to provide a flexible mechanism to manage the wide variety of workloads executing on the machine. In this paper we describe the novel approach of the Blue Gene/Q (BG/Q) supercomputer in addressing these workload requirements by providing resource management services that support both...

chapter

An evaluation of parallel optimization for OpenSolaris^® network stack

Hongbo Zou, Wenji Wu, Xian-He Sun, P DeMar, more

IEEE Local Computer Network Conference > 296 - 299

2010 IEEE 35th Conference on Local Computer Networks (LCN 2010)

Computing is now shifting towards multiprocessing. The fundamental goal of multiprocessing is improved performance through the introduction of additional hardware threads or cores (referred to as “cores” for simplicity). Modern network stacks can exploit parallel cores to allow either message-based parallelism or connection-based parallelism as a means to enhance performance. OpenSolaris has redesigned...

chapter

Design and implementation of embedded multiprocessor architecture using FPGA

M H Salih, M R Arshad

2010 IEEE Symposium on Industrial Electronics and Applications (ISIEA) > 579 - 584

2010 IEEE Symposium on Industrial Electronics and Applications (ISIEA 2010)

Modern embedded multiprocessors are complex systems that often require years to design and verify. A significant factor is that engineers must allocate a disproportionate share of their effort to ensure that modern FPGA chips architecture behave correctly. This paper proposes a design and creation of embedded multiprocessors architecture system focusing on its design area and performance. Embedded...

chapter

An in-memory monitoring database for self adaptive MP²SoCs

E Faure, G M Almeida, M Benabdenbi, P Benoit, more

2010 Conference on Design and Architectures for Signal and Image Processing (DASIP) > 97 - 104

2010 Conference on Design and Architectures for Signal and Image Processing (DASIP 2010)

The complexity of MP²SoC architectures to come is such that many issues arise simultaneously, such as multicore programming, system performance, reliability, scalability, etc. The key to solve these issues is self-adaptability: the chips to come have to integrate the required software and hardware means to monitor and self-react to the various kinds of events that are likely to occur during chip's...

chapter

Reconfigurable parallel computing

Dietmar Tutsch

2010 First International Conference On Parallel, Distributed and Grid Computing (PDGC 2010) > 5

2010 1st International Conference on Parallel, Distributed and Grid Computing (PDGC 2010)

Summary form only given. The dynamic reconfiguration of hardware stands for the change of hardware while the system is operating. Its benefit is the adaption to different computing requirements. For instance, an improved use of communication networks can be achieved: Many networks reveal the characteristic that connections between specific communication partners show a smaller latency than others...

chapter

Performance evaluation of a novel Dimension Order Routing algorithm for Mesh-of-tree based Network-on-Chip architecture

K Manna, S Chattopadhyay, I S Gupta

2010 First International Conference On Parallel, Distributed and Grid Computing (PDGC 2010) > 135 - 139

2010 1st International Conference on Parallel, Distributed and Grid Computing (PDGC 2010)

This paper present a new dimension-oriented routing algorithm for Mesh-of-tree (MoT) based Network-on-Chip (NoC) architecture. The addressing scheme is considerably simplified that enables us to reduce the minimum flit-size to 16-bits, compared to 32-bits in the previously reported works. The same level of throughput and average latency could be achieved with a 43.86% reduction in area and 43% reduction...

chapter

The microprocessor of 2020: Why you should care, and what you can do about it

Yale Patt, Ernest Cockrell

2010 First International Conference On Parallel, Distributed and Grid Computing (PDGC 2010) > 1

2010 1st International Conference on Parallel, Distributed and Grid Computing (PDGC 2010)

The microprocessor of the year 2020 will have 1000 cores on it, and unless you get involved, it will either just be an array of cores thrown over the transom for you to figure out what to do with, or it will be easy to use but run like a turtle, compared to what it could do. These two extremes are not unlikely, unless those with applications get involved. Most of the gurus of computer architecture...

chapter

Fussli: A portable framework for exploiting hybrid task, data and pipeline parallelism on multi-cores

Xiaoye Wang, Ting Zhang

2010 International Conference on Computer Application and System Modeling (ICCASM 2010) > 11 > V11-88 - V11-95

2010 International Conference on Computer Application and System Modeling (ICCASM 2010)

Parallelism is the most important mean to exploit the computation potential of multi-core processors. Real applications, particularly, commercial applications often have strong dependence that has to be respected. In order to achieve reasonably good performance, hybrid parallelism schemes usually need to be applied in these applications. Furthermore, parallel applications with task and pipeline parallelism...

chapter

Logic associative multiprocessor for information analysis

V I Hahanov, W Gharibi, E I Litvinova, N C Umerah

2010 12th Biennial Baltic Electronics Conference > 169 - 172

2010 12th Biennial Baltic Electronics Conference (BEC 2010)

This article describes high-speed logic associative multiprocessor for concurrent analyzing information represented in analytic, graph- and table forms of associative relations to search, recognize and make a decision in n-dimensional vector discrete space. Vectorlogical process models of actual applications, for which the quality of solution is estimated by the proposed integral non-arithmetical...

chapter

Accelerating BP Neural Network-Based Image Compression by CPU and GPU Cooperation

Jinxian Lin, Jianghong Lin

2010 International Conference on Multimedia Technology > 1 - 4

2010 International Conference on Multimedia Technology (ICMT)

Recently, GPU has evolved into a highly parallel, multithreading, many core processor with tremendous computational capability and very high memory bandwidth. At the same time, multi-core CPU evolution continued and today's CPUs have 4-8 cores which offer dramatically increased performance and power savings characteristics. We are aware of very few works that consider both devices cooperating to solve...

chapter

Flexible Error Protection for Energy Efficient Reliable Architectures

T Miller, N Surapaneni, R Teodorescu

2010 22nd International Symposium on Computer Architecture and High Performance Computing > 1 - 8

2010 22nd International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD 2010)

Technology scaling is having an increasingly detrimental effect on microprocessor reliability, with increased variability and higher susceptibility to errors. At the same time, as integration of chip multiprocessors increases, power consumption is becoming a significant bottleneck that could threaten their growth. To deal with these competing trends, energy-efficient solutions are needed to deal with...

chapter

High Level Power and Energy Exploration Using ArchC

T Gupta, C Bertolini, O Heron, N Ventroux, more

2010 22nd International Symposium on Computer Architecture and High Performance Computing > 25 - 32

2010 22nd International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD 2010)

With the increase in the design complexity of MPSoC architectures, estimating power consumption is very complex and time consuming at lower level of abstraction. We propose a methodology using ArchC named Power-ArchC for a fast high-level estimation of processor power consumption. Power values are obtained by an instruction level power characterization at gate level. The requirements for power evaluation...

chapter

Adaptive TDMA bus allocation and elastic scheduling: A unified approach for enhancing robustness in multi-core RT systems

Paolo Burgio, Martino Ruggiero, Francesco Esposito, Mauro Marinoni, more

2010 IEEE International Conference on Computer Design > 187 - 194

2010 IEEE International Conference on Computer Design (ICCD 2010)

Next-generation real-time systems will be increasingly based on heterogeneous MPSoC design paradigms, where predictability and performance will be key issues to deal with. Such issues can be tackled both at the hardware level, by embedding technologies such as TDMA busses, and at the OS level, where suitable scheduling techniques can improve performance and reduce energy consumption. Among these,...

chapter

Scenario-based design space exploration of MPSoCs

P van Stralen, A Pimentel

2010 IEEE International Conference on Computer Design > 305 - 312

2010 IEEE International Conference on Computer Design (ICCD 2010)

Early design space exploration (DSE) is a key ingredient in system-level design of MPSoC-based embedded systems. The state of the art in this field typically still explores systems under a single, fixed application workload. In reality, however, the applications are concurrently executing and contending for system resources in such systems. As a result, the intensity and nature of application demands...

chapter

A fine-grained link-level fault-tolerant mechanism for networks-on-chip

Arseniy Vitkovskiy, Vassos Soteriou, Chrysostomos Nicopoulos

2010 IEEE International Conference on Computer Design > 447 - 454

2010 IEEE International Conference on Computer Design (ICCD 2010)

Silicon technology scaling is continuously enabling denser integration capabilities. However, this comes at the expense of higher variability and susceptibility to wear-out. With an escalating number of on-chip components expected to be defective in near-future chips, modern parallel systems, such as Chip Multi-Processors (CMP), become especially vulnerable to these faults. Just a single link failure...

chapter

2010 IEEE International Conference on Computer Design > 1 - 8

2010 IEEE International Conference on Computer Design (ICCD 2010)

The following topics are dealt with: high performance architecture; synchronous interfaces; cache architecture; cryptography; real-time systems; signal processing; multiprocessor systems; and networks-on-chip.

chapter

Parallel Computational Modelling of Inelastic Neutron Scattering in Multi-node and Multi-core Architectures

M T Garba, Horacio González-Vélez, D L Roach

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC) > 509 - 514

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC 2010)

This paper examines the initial parallel implementation of SCATTER, a computationally intensive inelastic neutron scattering routine with polycrystalline averaging capability, for the General Utility Lattice Program (GULP). Of particular importance to structural investigation on the atomic scale, this work identifies the computational features of SCATTER relevant to a parallel implementation and presents...

1 ...
3
4
5
6
7
8
9

Data set:
ieee
Keywords:
COMPUTER ARCHITECTURE
MULTIPROCESSING SYSTEMS

Publication date

Set your own date range

Content availability

Available (555)
None (12)

Publication type

book (471)
article (96)

Keywords

HARDWARE (157)
PROGRAM PROCESSORS (122)
MICROPROCESSOR CHIPS (107)
SYSTEM-ON-CHIP (94)
PARALLEL PROCESSING (85)
FIELD PROGRAMMABLE GATE ARRAYS (83)
COMPUTATIONAL MODELING (80)
PARALLEL ARCHITECTURES (72)
EMBEDDED SYSTEMS (65)
MICROPROCESSORS (62)
MAGNETIC CORES (61)
REGISTERS (60)
SOFTWARE (58)
NETWORK-ON-CHIP (54)
PERFORMANCE EVALUATION (52)
INSTRUCTION SETS (50)
BENCHMARK TESTING (49)
MULTICORE PROCESSING (49)
MULTI-THREADING (48)
SYSTEM-ON-A-CHIP (48)
ALGORITHM DESIGN AND ANALYSIS (42)
BANDWIDTH (42)
PROCESSOR SCHEDULING (40)
FPGA (38)
KERNEL (36)
SCHEDULING (35)
CACHE STORAGE (34)
RECONFIGURABLE ARCHITECTURES (34)
OPTIMIZATION (33)
PARALLEL PROGRAMMING (33)
PROTOCOLS (33)
SYNCHRONIZATION (33)
CLOCKS (32)
DATA MINING (32)
REAL TIME SYSTEMS (32)
MPSOC (31)
SWITCHES (31)
COPROCESSORS (30)
LOGIC DESIGN (29)
PIPELINES (28)
PROGRAMMING (28)
DECODING (27)
DELAY (27)
INTEGRATED CIRCUIT DESIGN (27)
PIPELINE PROCESSING (27)
RESOURCE MANAGEMENT (27)
THROUGHPUT (27)
LIBRARIES (26)
RANDOM ACCESS MEMORY (26)
RESOURCE ALLOCATION (25)
ROUTING (24)
COMPUTERS (23)
MULTICORE ARCHITECTURE (23)
MEMORY ARCHITECTURE (22)
MULTICORE PROCESSOR (22)
OPERATING SYSTEMS (21)
PROGRAM COMPILERS (21)
RUNTIME (21)
YARN (21)
ENERGY CONSUMPTION (20)
MULTI-CORE (20)
MULTICORE ARCHITECTURES (20)
COMPLEXITY THEORY (19)
CRYPTOGRAPHY (19)
DIGITAL SIGNAL PROCESSING (19)
GRAPHICS PROCESSING UNIT (19)
TILES (18)
MULTICORE (17)
MULTICORE PROCESSORS (17)
MULTIPROCESSOR INTERCONNECTION NETWORKS (17)
SPACE EXPLORATION (17)
DESIGN SPACE EXPLORATION (16)
EMBEDDED SYSTEM (16)
HARDWARE-SOFTWARE CODESIGN (16)
MEMORY MANAGEMENT (16)
MULTIPROCESSOR SYSTEM-ON-CHIP (16)
POWER AWARE COMPUTING (16)
POWER DEMAND (16)
TIMING (16)
ACCELERATION (15)
ANALYTICAL MODELS (15)
APPLICATION PROGRAM INTERFACES (15)
CONCURRENT COMPUTING (15)
ENGINES (15)
OPERATING SYSTEMS (COMPUTERS) (15)
POWER CONSUMPTION (15)
SCHEDULES (15)
SERVERS (15)
APPLICATION SOFTWARE (14)
LINUX (14)
PARALLEL ALGORITHMS (14)
PARALLEL MACHINES (14)
PROCESS CONTROL (14)
VLIW (14)
DESIGN METHODOLOGY (13)
ENERGY EFFICIENCY (13)
FAULT TOLERANCE (13)
HIGH PERFORMANCE COMPUTING (13)
more

INFONA - science communication portal

Search results

Design and Implementation of a Uniform Platform to Support Multigenerational GPU Architectures for High Performance Stream-Based Computing

Multiprocessor Architectures Specialized for Multi-agent Simulation

Smart Core System for Dependable Many-Core Processor with Multifunction Routers

Blue Gene/Q resource management architecture

An evaluation of parallel optimization for OpenSolaris^® network stack

Design and implementation of embedded multiprocessor architecture using FPGA

An in-memory monitoring database for self adaptive MP²SoCs

Reconfigurable parallel computing

Performance evaluation of a novel Dimension Order Routing algorithm for Mesh-of-tree based Network-on-Chip architecture

The microprocessor of 2020: Why you should care, and what you can do about it

Fussli: A portable framework for exploiting hybrid task, data and pipeline parallelism on multi-cores

Logic associative multiprocessor for information analysis

Accelerating BP Neural Network-Based Image Compression by CPU and GPU Cooperation

Flexible Error Protection for Energy Efficient Reliable Architectures

High Level Power and Energy Exploration Using ArchC

Adaptive TDMA bus allocation and elastic scheduling: A unified approach for enhancing robustness in multi-core RT systems

Scenario-based design space exploration of MPSoCs

A fine-grained link-level fault-tolerant mechanism for networks-on-chip

Table of contents

Parallel Computational Modelling of Inelastic Neutron Scattering in Multi-node and Multi-core Architectures

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options