Search results

Items from 1 to 9 out of 9 results

chapter

Communication on the Fly for Hierarchical Systems of Chip Multi-processors

M Tudruj, L Masko

2011 Sixth International Symposium on Parallel Computing in Electrical Engineering > 19 - 24

2011 6th International Symposium on Parallel Computing in Electrical Engineering (PARELEC 2011)

Systems based on many Chip Multi-Processor (CMP) modules interconnected by global networks constitute now a feasible solution, which brings back to life challenges of massively parallel systems. The paper presents new methods for data communication inside CMP modules and for inter-CMP-module data communication. Inside CMP modules data communication through shared variables is improved by the use of...

chapter

Design and implementation of a NoC supporting priority-based communications for many-core SoCs

Kuei-Chung Chang, Ing-Ming Liao, Bo-Yi Shiu

2010 International Computer Symposium (ICS2010) > 483 - 488

2010 International Computer Symposium (ICS 2010)

As technology scaling enables the integration of billions of transistors on a chip, economies of scale are prompting the move toward parallel chip architectures with application-specific systems-on-a-chip (SoC) leveraging multiple cores on a single chip for better performance at manageable design costs. The demand for communicating capability of many-core SoC will definitely increase because the traffic...

chapter

Inter-socket victim cacheing for platform power reduction

Subhra Mazumdar, Dean M Tullsen, Justin Song

2010 IEEE International Conference on Computer Design > 509 - 514

2010 IEEE International Conference on Computer Design (ICCD 2010)

On a multi-socket architecture with load below peak, as is often the case in a server installation, it is common to consolidate load onto fewer sockets to save processor power. However, this can increase main memory power consumption due to the decreased total cache space. This paper describes inter-socket victim cacheing, a technique that enables such a system to do both load consolidation and cache...

chapter

The power7^TM processor SoC

Dieter Wendel, Ronald Kalla, Joshua Friedrich, James Kahle, more

2010 IEEE International Conference on Integrated Circuit Design and Technology > 71 - 73

2010 IEEE International Conference on IC Design & Technology (ICICDT)

Introducing POWER7™ the latest member of the IBM POWER™ processor family. A 567 mm² chip implemented in 45nm SOI technology, holding eight quad threaded cores, a 32MB shared eDRAM L3, two memory controllers and high bandwidth SMP interfaces. The new out of order, shallow pipeline core with 12 execution units, multiport L1 caches and a private 256 kB L2 offers the efficiency to support 4× the number...

chapter

Pthreads Performance Characteristics on Shared Cache CMP, Private Cache CMP and SMP

Ian K T Tan, Ian Chai, Poo Kuan Hoong

2010 Second International Conference on Computer Engineering and Applications > 1 > 186 - 191

2010 Second International Conference on Computer Engineering and Applications (ICCEA 2010)

With the wide availability of chip multi-processing (CMP), software developers are now facing the task of effectively parallelizing their software code. Once they have identified the areas of parallelization, they will need to know the level of code granularity needed to ensure profitable execution. Furthermore, this problem multiplies itself with different hardware available. In this paper, we present...

chapter

Stacking SRAM banks for ultra low power standby mode operation

Adam C Cabe, Zhenyu Qi, Mircea R Stan

Design Automation Conference > 699 - 704

2010 47th ACM/EDAC/IEEE Design Automation Conference (DAC 2010)

On-chip SRAM caches have come to dominate the total chip area and leakage power consumed in state-of-the-art microprocessor designs. Such large memories are necessary to attain high performance, however it is critical to minimize the idle currents drawn while these SRAM banks are inactive. This work proposes a novel voltage reduction technique to reduce SRAM leakage power during the standby mode....

chapter

An Enhanced HyperTransport Controller with Cache Coherence Support for Multiple-CMP

Huandong Wang, Dan Tang, Xiang Gao, Yunji Chen

2009 IEEE International Conference on Networking, Architecture, and Storage > 215 - 218

2009 IEEE International Conference on Networking, Architecture, and Storage (NAS)

HyperTransport link is a high performance IO interface for system connection. In this paper, the architecture of a HyperTransport interface is introduced. This HyperTransport interface realizes efficient HT-AXI bidirectional transformation, where AXI is a popular bus protocol in SOC architectures. Furthermore, this HyperTransport interface provides dedicated hardware support for cache coherence protocol...

chapter

A Power-Scalable Switch-Based Multi-processor FFT

B.J. Mohd, E.E. Swartzlander

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors > 114 - 120

2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors

This paper examines the architecture, algorithm and implementation of a switch-based multi-processor realization of the fast Fourier transform (FFT). The architecture employs M processing elements (PEs), and provides a speedup of M compared with systems that use a single PE. An algorithm is provided to detect and resolve memory conflicts. A CMOS implementation of a four-PE processor is presented....

chapter

A Quantitative Study of the On-Chip Network and Memory Hierarchy Design for Many-Core Processor

Xu Wang, Ge Gan, J. Manzano, Dongrui Fan, more

2008 14th IEEE International Conference on Parallel and Distributed Systems > 689 - 696

2008 14th IEEE International Conference on Parallel and Distributed Systems

In this paper, we will study the on-chip network and memory hierarchy design of the Godson-T - a homogeneous many-core processor. Godson-T has 64 cores (with private L1 cache), and 16 global L2 cache banks. All these on-chip units are connected by a 2D 8 ?? 8 mesh network. Our study reveals that:(a) Global on-chip L2 cache can effectively alleviate the memory pressure caused by the data-thirsty on-chip...

Filter options

Keywords:
MICROPROCESSOR CHIPS
CACHE STORAGE
SWITCHES

Publication date

Set your own date range

Keywords

RANDOM ACCESS MEMORY (4)
SYSTEM-ON-A-CHIP (4)
CACHE (2)
INTEGRATED CIRCUIT DESIGN (2)
MAGNETIC CORES (2)
ON-CHIP INTERCONNECTION (2)
ON-CHIP NETWORK (2)
PROGRAM PROCESSORS (2)
PROTOCOLS (2)
ROUTING (2)
SYNCHRONIZATION (2)
SYSTEM-ON-CHIP (2)
ABSOLUTE DATA RETENTION VOLTAGE (1)
ACTIVE POWER MODE (1)
APPLICATION SPECIFIC SYSTEMS-ON-CHIP LEVERAGING MULTIPLE CORE (1)
BENCHMARK TESTING (1)
BRIDGES (1)
BUS PROTOCOL (1)
BUTTERFLY NETWORK (1)
CACHE AGGREGATION (1)
CACHE COHERENCE (1)
CACHE COHERENCE PROTOCOL (1)
CACHE TRASHING (1)
CACHE-FFT PROCESSOR (1)
CC-NUMA SYSTEM (1)
CHANNEL ALLOCATION (1)
CHIP MULTIPROCESSING (1)
CHIP MULTIPROCESSOR SYSTEMS (1)
CHIP MULTIPROCESSORS (1)
CMOS IMPLEMENTATION (1)
CMOS INTEGRATED CIRCUITS (1)
CMP (1)
CMP INTERNAL DATA COMMUNICATION (1)
CMP MODULE EXTERNAL SHARED MEMORY (1)
CMP MODULES DATA COMMUNICATION (1)
CODE GRANULARITY (1)
COHERENCE (1)
COMMUNICATION ARCHITECTURE (1)
COMMUNICATION PATTERN (1)
COMPILERS (1)
COMPUTER ARCHITECTURE (1)
CONFLICT-FREE (1)
CONTEXT (1)
CORE CLUSTERS (1)
DATA CACHES (1)
DATA COMMUNICATION (1)
DATA FLOW GRAPHS (1)
DATA MINING (1)
DATA READS (1)
DATA-THIRSTY ON-CHIP COMPUTING ENGINES (1)
DECODING (1)
DIGITAL ARITHMETIC (1)
DRAM CHIPS (1)
DSP (1)
DYNAMIC CORE SWITCHING (1)
DYNAMIC VICTIM CACHE MANAGEMENT (1)
ECONOMIES OF SCALE (1)
ENERGY CONSERVATION (1)
ENERGY SAVING (1)
ENHANCED HYPERTRANSPORT CONTROLLER (1)
FAST FOURIER TRANSFORM (1)
FAST FOURIER TRANSFORMS (1)
FFT (1)
FIELD PROGRAMMABLE GATE ARRAYS (1)
FLOW GRAPHS (1)
FOUR-PE PROCESSOR (1)
GLOBAL DATA COMMUNICATION (1)
GLOBAL NETWORKS (1)
GLOBAL ON-CHIP L2 CACHE (1)
GODSON-3A MULTICORE PROCESSOR CHIPS (1)
GODSON-T (1)
HIDDEN MARKOV MODELS (1)
HIERARCHICAL SYSTEMS (1)
HOMOGENEOUS MANY-CORE PROCESSOR (1)
HT-AXI BIDIRECTIONAL TRANSFORMATION (1)
HYPERCUBE NETWORKS (1)
HYPERTRANSPORT (1)
HYPERTRANSPORT INTERFACE (1)
IBM POWER PROCESSOR FAMILY (1)
IDLE PROCESSOR (1)
INSTRUCTION SETS (1)
INTEGRATED CIRCUIT INTERCONNECTIONS (1)
INTEGRATED CIRCUIT MANUFACTURE (1)
INTEGRATED CIRCUIT NOISE (1)
INTER-CMP COMMUNICATION (1)
INTER-CMP-MODULE DATA COMMUNICATION (1)
INTERCONNECT (1)
INTERCONNECTIONS (1)
INTERSOCKET VICTIM CACHEING (1)
IO INTERFACE (1)
JOINING PROCESSES (1)
L1 DATA CACHE LOOP (1)
L2 DATA CACHE (1)
LMBENCH (1)
LOAD CONSOLIDATION (1)
LOW-POWER ELECTRONICS (1)
LOW-POWER MEMORY (1)
more

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options