2009 12th Euromicro Conference on Digital System Design, Architectures, Methods and Tools

Items from 1 to 18 out of 18 results

chapter

An Effective Replacement Strategy of Cache Memory for an SMT Processor

Y. Ogasawara, H. Nakajo

2009 12th Euromicro Conference on Digital System Design, Architectures, Methods and Tools > 19 - 25

2009 12th EUROMICRO Conference on Digital System Design, Architectures, Methods and Tools (DSD 2009)

An SMT processor is designed to execute multiple threads simultaneously in order to gain higher performance with sharing resources such as ALUs and cache memory among several threads. However, sharing cache memory may cause thread conflict misses which degrades its performance. In this paper, an effective replacement strategy in which conflicts miss ratio among threads is controlled by limiting the...

chapter

Soft Error Tolerant Asynchronous Circuits Based on Dual Redundant Four State Logic

W. Friesenbichler, A. Steininger

2009 12th Euromicro Conference on Digital System Design, Architectures, Methods and Tools > 100 - 107

2009 12th EUROMICRO Conference on Digital System Design, Architectures, Methods and Tools (DSD 2009)

The continuing downscaling of integrated circuits makes modern devices more susceptible to soft errors. This paper investigates the possibility of using Four-State Logic (FSL) to improve the fault tolerance of digital circuits. FSL is a possible implementation of asynchronous Quasi Delay Insensitive (QDI) logic using a more efficient encoding and handshake protocol. The behavior of asynchronous circuits...

chapter

GridRT: A Massively Parallel Architecture for Ray-Tracing Using Uniform Grids

A.S. Nery, N. Nedjah, F. Frana

2009 12th Euromicro Conference on Digital System Design, Architectures, Methods and Tools > 211 - 216

2009 12th EUROMICRO Conference on Digital System Design, Architectures, Methods and Tools (DSD 2009)

In this paper, we propose an architecture, which we call GridRT, capable of dealing with the main features, such as shadows and reflections effects, of Ray Tracing used for rendering three-dimensional scenes. This architecture achieves an efficient overall performance yet using a simple and compact massively parallel design. The design exploits the usage of Xilinx^?? Floating Point Operator IP Core...

chapter

Double-precision Gauss-Jordan Algorithm with Partial Pivoting on FPGAs

R. Duarte, H. Neto, M. Vestias

2009 12th Euromicro Conference on Digital System Design, Architectures, Methods and Tools > 273 - 280

2009 12th EUROMICRO Conference on Digital System Design, Architectures, Methods and Tools (DSD 2009)

This work presents an architecture to compute matrix inversions in a reconfigurable digital system, benefiting from embedded processing elements present in FPGAs, and using double precision floating point representation. The main module of this system is the processing component for the Gauss-Jordan elimination. This component consists of other smaller arithmetic units, organized in pipeline. These...

chapter

Compilation Technique for Loop Overhead Minimization

N. Kroupis, P. Raghavan, M. Jayapala, F. Catthoor, more

2009 12th Euromicro Conference on Digital System Design, Architectures, Methods and Tools > 419 - 426

2009 12th EUROMICRO Conference on Digital System Design, Architectures, Methods and Tools (DSD 2009)

Modern handheld embedded systems operate under stringent power and real-time constraints. These systems run highly data-dominated applications from multimedia and wireless domains. Most of these applications spend significant amount of execution time in nested-loops. In order to reduce the loop control overhead several loop controller architectures have been proposed in the past. In this paper we...

chapter

SIMD Architectural Enhancements to Improve the Performance of the 2D Discrete Wavelet Transform

A. Shahbahrami, B. Juurlink

2009 12th Euromicro Conference on Digital System Design, Architectures, Methods and Tools > 497 - 504

2009 12th EUROMICRO Conference on Digital System Design, Architectures, Methods and Tools (DSD 2009)

The 2D Discrete Wavelet Transform (DWT) is a time-consuming kernel in many multimedia applications such as JPEG2000 and MPEG-4. The 2D DWT consists of horizontal filtering along the rows followed by vertical filtering along the columns. The vertical filtering is easy to vectorize (assuming row-major order), but to vectorize the horizontal filtering many overhead instructions are required. In this...

chapter

Simultaneous Multithreading VLIW DSP Architecture with Dynamic Dispatch Mechanism

Zheng Shen, Hu He, Yihe Sun

2009 12th Euromicro Conference on Digital System Design, Architectures, Methods and Tools > 505 - 512

2009 12th EUROMICRO Conference on Digital System Design, Architectures, Methods and Tools (DSD 2009)

This paper presents a novel simultaneous multithreading (SMT) VLIW DSP architecture with dynamic dispatch mechanism to address the challenge of the underutilization of computing resources in the non-unit assumed latency (NUAL) VLIW DSPs. The SMT technology exploits the unused instruction slots by converting the thread-level parallelism to the instruction-level parallelism, improving the efficiency...

chapter

Iterative Algorithm for Compound Instruction Selection with Register Coalescing

Minwook Ahn, J.M. Youn, Youngkyu Choi, Doosan Cho, more

2009 12th Euromicro Conference on Digital System Design, Architectures, Methods and Tools > 513 - 520

2009 12th EUROMICRO Conference on Digital System Design, Architectures, Methods and Tools (DSD 2009)

A compound instruction, encoding several ALU or memory operations within an instruction word, has been regarded as an efficient way of improving performance. In the compiler for embedded processors, the code generation algorithm for compound instructions has been built by dealing mainly with instruction selection which is a crucial phase of code generation. In this paper, we propose an iterative code...

chapter

An on Chip Network inside a FPGA for Run-Time Reconfigurable Low Latency Grid Communication

J. Strunk, T. Volkmer, W. Rehm, H. Schick

2009 12th Euromicro Conference on Digital System Design, Architectures, Methods and Tools > 539 - 546

2009 12th EUROMICRO Conference on Digital System Design, Architectures, Methods and Tools (DSD 2009)

In this paper a low latency, on chip communication network (NoC) for a run-time reconfigurable (RTR) grid inside dynamically and partially reconfigurable (DPR) FPGAs is proposed, which supports the arbitrary placement of run-time reconfigurable modules (RTRM) inside the grid. The dedicated, fully meshed, silicon network should support the arrangement of communication channels between the RTRMs within...

chapter

A Synthesisable Quasi-Delay Insensitive Result Forwarding Unit for an Asynchronous Processor

L.A. Tarazona, D.A. Edwards, L.A. Plana

2009 12th Euromicro Conference on Digital System Design, Architectures, Methods and Tools > 627 - 634

2009 12th EUROMICRO Conference on Digital System Design, Architectures, Methods and Tools (DSD 2009)

The implementation of an efficient result forwarding unit for asynchronous processors faces the problem of the inherent lack of synchronisation between result producer and consumer units. An efficient, full-custom solution to this problem has been proposed and implemented before (in the AMULET3 asynchronous processor) with the consequent limitations on design-space exploration and technology portability...

chapter

An Efficient Low-Complexity Alternative to the ROB for Out-of-Order Retirement of Instructions

S. Petit, R. Ubal, J. Sahuquillo, P. Lopez, more

2009 12th Euromicro Conference on Digital System Design, Architectures, Methods and Tools > 635 - 642

2009 12th EUROMICRO Conference on Digital System Design, Architectures, Methods and Tools (DSD 2009)

Current superscalar processors use a reorder buffer (ROB) to support speculation, precise exceptions, and register reclamation. Instructions are retired from this structure in program order, which may lead to significant performance degradation if a long latency operation blocks the ROB head. In this paper, a checkpoint-free out-of-order commit architecture is proposed, which replaces the ROB with...

chapter

Energy and Performance Model of a SPARC Leon3 Processor

S. Penolazzi, L. Bolognino, A. Hemani

2009 12th Euromicro Conference on Digital System Design, Architectures, Methods and Tools > 651 - 656

2009 12th EUROMICRO Conference on Digital System Design, Architectures, Methods and Tools (DSD 2009)

We present a general methodology to implement a processor energy model, based on instruction-level characterization, and we apply it to a SPARC-based Leon3 processor. The model is characterized by simulating back-annotated gate-level netlist and has two levels of accuracy: a coarse-grain estimation based on characterizing each single instruction and a fine-grain estimation accounting for the impact...

chapter

Acceleration of MELP Algorithm Using DSP Coprocessor with Extended Registers

Lu Gao, Li Guo, Canxing Lu

2009 12th Euromicro Conference on Digital System Design, Architectures, Methods and Tools > 659 - 666

2009 12th EUROMICRO Conference on Digital System Design, Architectures, Methods and Tools (DSD 2009)

Configurable coprocessors have been an active area for some time. The limitation of word length of instruction set and the number of operands in a single instruction have become a potential performance bottleneck for traditional SIMD extension. In this paper, we use LEON-2 as the host platform and present a novel low-cost architecture with extended shadow_f registers. In each extended instruction,...

chapter

xMAML: A Modeling Language for Dynamically Reconfigurable Architectures

J. Lallet, S. Pillement, O. Sentieys

2009 12th Euromicro Conference on Digital System Design, Architectures, Methods and Tools > 680 - 687

2009 12th EUROMICRO Conference on Digital System Design, Architectures, Methods and Tools (DSD 2009)

Constant evolution of norms and applications, usually implemented on system-on-chip (SOC), increases architecture performance and flexibility requirements. Current architectures are consequently becoming more complex and difficult to develop. One of the solutions is to develop design frameworks based on high-level architecture description languages (ADL). These ADLs are useful for a rapid description...

chapter

A Reconfigurable Frame Interpolation Hardware Architecture for High Definition Video

O. Tasdizen, I. Hamzaoglu

2009 12th Euromicro Conference on Digital System Design, Architectures, Methods and Tools > 714 - 719

2009 12th EUROMICRO Conference on Digital System Design, Architectures, Methods and Tools (DSD 2009)

Since Frame Rate Up-Conversion (FRC) is started to be used in recent consumer electronics products like High Definition TV, real-time and low cost implementation of FRC algorithms has become very important. Therefore, in this paper, we propose a low cost hardware architecture for realtime implementation of frame interpolation algorithms. The proposed hardware architecture is reconfigurable and it...

chapter

Representation of Incompletely Specified Index Generation Functions Using Minimal Number of Compound Variables

T. Sasao, T. Nakamura, M. Matsuura

2009 12th Euromicro Conference on Digital System Design, Architectures, Methods and Tools > 765 - 772

2009 12th EUROMICRO Conference on Digital System Design, Architectures, Methods and Tools (DSD 2009)

This paper shows a method to reduce the number of input variables to represent incompletely specified index generation functions. A compound variable is generated by EXORing the original input variables. By using both original and compound variables, incompletely specified index generation functions can be represented by fewer variables. As a means to select variables, a heuristic method using information...

chapter

FPGA Implementations of SHA-3 Candidates: CubeHash, Grøstl, LANE, Shabal and Spectral Hash

B. Baldwin, A. Byrne, M. Hamilton, N. Hanley, more

2009 12th Euromicro Conference on Digital System Design, Architectures, Methods and Tools > 783 - 790

2009 12th EUROMICRO Conference on Digital System Design, Architectures, Methods and Tools (DSD 2009)

Hash functions are widely used in, and form an important part of many cryptographic protocols. Currently, a public competition is underway to find a new hash algorithm(s) for inclusion in the NIST Secure Hash Standard (SHA-3). Computational efficiency of the algorithms in hardware will form one of the evaluation criteria. In this paper, we focus on five of these candidate algorithms, namely CubeHash,...

chapter

Low-Power Low-Energy Prime-Field ECC Processor Based on Montgomery Modular Inverse Algorithm

H.R. Ahmadi, A. Afzali-Kusha

2009 12th Euromicro Conference on Digital System Design, Architectures, Methods and Tools > 817 - 822

2009 12th EUROMICRO Conference on Digital System Design, Architectures, Methods and Tools (DSD 2009)

In this paper, we present a fast low-power low-energy standard public-key cryptography processor for use in power/energy-limited applications. The proposed prime-field elliptic-curve cryptography hardware uses a modified Montgomery modular inverse algorithm to minimize the total calculation time and is completely flexible in terms of the field and curve parameters. The power consumption is minimized...

Filter options

Keywords:
REGISTERS

Publication date

Set your own date range

Keywords

DATA MINING (9)
COMPUTER ARCHITECTURE (7)
HARDWARE (7)
CLOCKS (5)
FIELD PROGRAMMABLE GATE ARRAYS (5)
FPGA (5)
ENCODING (4)
MICROPROCESSOR CHIPS (4)
ALGORITHM DESIGN AND ANALYSIS (3)
EMBEDDED SYSTEMS (3)
FLOATING POINT ARITHMETIC (3)
INSTRUCTION SETS (3)
PIPELINES (3)
RECONFIGURABLE ARCHITECTURES (3)
ASYNCHRONOUS CIRCUITS (2)
COMPOUNDS (2)
COPROCESSORS (2)
DELAY (2)
DIGITAL SIGNAL PROCESSING (2)
DIGITAL SIGNAL PROCESSING CHIPS (2)
GRID COMPUTING (2)
LOGIC GATES (2)
MULTI-THREADING (2)
PARALLEL ARCHITECTURES (2)
PARALLEL PROCESSING (2)
PIXEL (2)
PROBABILITY DENSITY FUNCTION (2)
PROTOCOLS (2)
SYNCHRONIZATION (2)
SYSTEM-ON-CHIP (2)
VIRTEX-5 (2)
192-BIT SCALAR MULTIPLICATION (1)
2D DISCRETE WAVELET TRANSFORM (1)
32-BIT SINGLE-PRECISION FLOATING-POINT MULTIPLICATIONS (1)
ACCELERATION (1)
ACCURACY (1)
ADAPTIVE SELECTION (1)
ADL (1)
ALU (1)
ALU ENCODING (1)
ARCHITECTURE (1)
ARITHMETIC UNITS (1)
ASYNCHRONOUS DESIGN (1)
ASYNCHRONOUS LOGIC (1)
ASYNCHRONOUS PROCESSOR (1)
ASYNCHRONOUS QUASIDELAY INSENSITIVE LOGIC (1)
AUTOMATIC SYNTHESIS (1)
BACK-ANNOTATED GATE-LEVEL NETLIST (1)
BAND PASS FILTERS (1)
BI-COMPOUND VARIABLES (1)
BUFFER CIRCUITS (1)
CACHE MEMORY (1)
CACHE MISS LATENCY (1)
CACHE STORAGE (1)
CHECKPOINT-FREE OUT-OF-ORDER COMMIT ARCHITECTURE (1)
CIRCUIT FAULTS (1)
CMOS INTEGRATED CIRCUITS (1)
CMOS TECHNOLOGY (1)
COARSE-GRAIN ESTIMATION (1)
CODE DIVISION MULTIPLE ACCESS (1)
COMPILATION TECHNIQUE (1)
COMPILER (1)
COMPLEXITY THEORY (1)
COMPOUND INSTRUCTION (1)
COMPOUND INSTRUCTION SELECTION (1)
COMPOUND VARIABLES (1)
COMPUTATIONAL MODELING (1)
COMPUTER GRAPHICS (1)
COMPUTING RESOURCE UNDERUTILIZATION (1)
CONFIGURABLE COPROCESSORS (1)
CONTEXT (1)
COPROCESSOR (1)
CORRELATION (1)
CRYPTO-PROCESSOR (1)
CRYPTOGRAPHIC PROTOCOLS (1)
CUBEHASH ALGORITHM (1)
DATA FLOW GRAPH (1)
DATA FLOW GRAPHS (1)
DATA MODELS (1)
DATA REARRANGEMENT INSTRUCTIONS (1)
DATA REGISTERS (1)
DATA SWITCHING ACTIVITY (1)
DATA TRANSFER (1)
DAUB-4 TRANSFORM (1)
DECODING (1)
DEDICATED SILICON NETWORK (1)
DESIGN SPACE EXPLORATION (1)
DETRIMENTAL IMPACT (1)
DIGITAL ARITHMETIC (1)
DIGITAL CIRCUITS (1)
DIGITAL SIGNAL PROCESSOR (1)
DISCRETE WAVELET TRANSFORMS (1)
DIVISION OPERATIONS (1)
DOUBLE PRECISION FLOATING POINT REPRESENTATION (1)
DOUBLE-PRECISION GAUSS-JORDAN ALGORITHM (1)
DSP (1)
DSP COPROCESSOR (1)
DUAL REDUNDANCY (1)
DUAL REDUNDANT FOUR STATE LOGIC (1)
more

INFONA - science communication portal

2009 12th Euromicro Conference on Digital System Design, Architectures, Methods and Tools $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2009 12th Euromicro Conference on Digital System Design, Architectures, Methods and Tools