Embedded Computer Systems (SAMOS), 2011 International Conference on

chapter

Dedicated hardware accelerators for the epistatic analysis of human genetic data

Fabio Cancare, Alessandro Marin, Donatella Sciuto

2011 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation > 102 - 109

2011 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS XI)

The recent advances in genomic microarrays design provide the possibility to retrieve hundreds of thousands of significative genetic features from patients at affordable costs. Understanding if non-linear interactions (epistatic relationships) between these features determine or not the arising of complex common multifactorial genetic diseases is a critical task for human geneticists. The algorithms...

chapter

Vector processor customization for FFT

Bogdan Spinean, Georgi Kuzmanov, Georgi Gaydadjiev

2011 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation > 110 - 117

2011 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS XI)

Processors and memory systems suffer from a growing performance gap between them. Each technology generation increases the on-chip performance capabilities however, memory bandwidth increases at a much slower pace. Therefore, overall performance improvements are constrained by the available memory bandwidth. In this paper, we address the memory bandwidth problem of vector processors by introducing...

chapter

FPGA based application specific processing for sensor nodes

Teemu Nylanden, Janne Janhunen, Jari Hannuksela, Olli Silven

2011 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation > 118 - 123

2011 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS XI)

Energy efficient sensor nodes are among the rapidly expanding applications for embedded systems technology. Typically, the processing resources in sensor nodes are based on programmable micro-controllers and digital signal processors, and the same processing architecture is used regardless of the actual task of the node. This regularly results in at least an order of magnitude over-provisioning of...

chapter

Parametrized hardware architectures for the Lucas primality test

Adrien Le Masle, Wayne Luk, Csaba Andras Moritz

2011 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation > 124 - 131

2011 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS XI)

We present our parametric hardware architecture of the NIST approved Lucas probabilistic primality test. To our knowledge, our work is the first hardware architecture for the Lucas test. Our main contributions are a hardware architecture for calculating the Jacobi symbol based on the binary Jacobi algorithm, a pipelined modular add-shift module for calculating the Lucas sequences, methods for dependence...

chapter

Distributed resource management for concurrent execution of multimedia applications on MPSoC platforms

Ahsan Shabbir, Akash Kumar, Bart Mesman, Henk Corporaal

2011 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation > 132 - 139

2011 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS XI)

The last decade a trend can be observed towards multi-processor Systems-on-Chip (MPSoC) platforms for satisfying the high computational requirements of modern multimedia applications. The research community has mainly focused on communication issues (e.g. bus vs. networks-on-chip). Real-time operating systems for MPSoCs however, have gotten very little attention. Existing techniques like rate-monotonic...

chapter

High level quantitative hardware prediction modeling using statistical methods

Roel Meeuws, Carlo Galuzzi, Koen Bertels

2011 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation > 140 - 149

2011 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS XI)

With the increasing proliferation of heterogeneous and reconfigurable computing, it has become essential to have efficient prediction models to drive early HW-SW partitioning and co-design. In this paper, we present a high level quantitative prediction modeling approach that accurately models the relation between hardware and software metrics, based on several statistical techniques. The proposed...

chapter

Removal of unnecessary context switches from the systemc simulation kernel for fast VP simulation

Kun Lu, Daniel Muller-Gritschneder, Ulf Schlichtmann

2011 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation > 150 - 156

2011 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS XI)

Virtual prototypes are widely employed in today's development of embedded hardware and software. To model and simulate the VPs, SystemC has been adopted as a standard language tool. With SystemC, hardware modules and software codes can be modeled as processes. To model concurrency, one process can be suspended and then the SystemC scheduler selects the next process to resume. This is also known as...

chapter

A novel ADL-based compiler-centric software framework for reconfigurable mixed-ISA processors

Timo Stripf, Ralf Koenig, Juergen Becker

2011 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation > 157 - 164

2011 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS XI)

Reconfigurable processor architectures can dynamically switch their instruction set and instruction format at run time. They offer a new flexibility for adapting to changing applications' requirements in order to optimize performance and enable resource-awareness. While programmability is a key issue of such architectures, today's software toolchains are limited to static ISA architectures and must...

chapter

ADL-based specification of implementation styles for functional simulators

David A. Penry, Kurtis D. Cahill

2011 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation > 165 - 173

2011 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS XI)

Functional simulators find widespread use as sub-systems within microarchitectural simulators. The speed of functional simulators is strongly influenced by the implementation style of the functional simulator, e.g. interpreted vs. binary-translated simulation. Speed is also strongly influenced by the level of detail of the interface the functional simulator presents to the rest of the timing simulator...

chapter

A performance estimation flow for embedded systems with mixed software/hardware modeling

Joffrey Kriegel, Alain Pegatoquet, Michel Auguin, Florian Broekaert

2011 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation > 174 - 181

2011 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS XI)

This paper introduces an Y-chart methodology for performance estimation based on high level models for both application and architecture. As embedded devices are more and more complex, the choice of the best suited architecture not only in terms of processing power but also in power consumption becomes a tedious task. In this context, estimation tools are key components in architecture choice methodology...

chapter

Calibration and validation of software performance models for pedestrian detection systems

Rainer Kiesel, Martin Streubuhr, Christian Haubelt, Otto Lohlein, more

2011 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation > 182 - 189

2011 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS XI)

In recent years, road vehicles have seen a tremendous increase on driver assistance systems like lane departure warning, traffic sign recognition, or pedestrian detection. The development of efficient and cost-effective electronic control units that meet the necessary real-time performance for these systems is a complex challenge. Often, Electronic System-Level design tackles the challenge by simulation-based...

chapter

Scalable multi-core simulation using parallel dynamic binary translation

Oscar Almer, Igor Bohm, Tobias Edler von Koch, Bjorn Franke, more

2011 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation > 190 - 199

2011 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS XI)

In recent years multi-core processors have seen broad adoption in application domains ranging from embedded systems through general-purpose computing to large-scale data centres. Simulation technology for multi-core systems, however, lags behind and does not provide the simulation speed required to effectively support design space exploration and parallel software development. While state-of-the-art...

chapter

Fully-automatic derivation of exact program-flow constraints for a tighter worst-case execution-time analysis

Amine Marref

2011 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation > 200 - 208

2011 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS XI)

Obtaining tight worst-case execution-time (WCET) estimations of real-time tasks is crucial since overly-pessimistic estimations are deemed impractical. One way of making WCET estimations tighter is to incorporate more program-flow information e.g., context-sensitive loop bounds, infeasible-path and same-path information, etc. In this paper we present and evaluate a completely automatic analysis that...

chapter

A hardware accelerated configurable ASIP architecture for embedded real-time video-based driver assistance applications

Gregor Schewior, Holger Flatt, Carsten Dolar, Christian Banz, more

2011 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation > 209 - 216

2011 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS XI)

In this paper, a flexible HW architecture for video-based driver assistance applications is presented. It comprises a customizable and extensible processor template and several task-specific HW accelerators. The proposed heterogeneous architecture allows utilization of the programmable processor core for control and low data rate tasks. For the acceleration of computationally intensive tasks of the...

chapter

Task-based parallel H.264 video encoding for explicit communication architectures

Michail Alvanos, George Tzenakis, Dimitrios S. Nikolopoulos, Angelos Bilas

2011 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation > 217 - 224

2011 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS XI)

Future multi-core processors will necessitate exploitation of fine-grain, architecture-independent parallelism from applications to utilize many cores with relatively small local memories. We use c264, an end-to-end H.264 video encoder for the Cell processor based on ×264, to show that exploiting fine-grain parallelism remains challenging and requires significant advancement in runtime support. Our...

chapter

High throughput and scalable architecture for unified transform coding in embedded H.264/AVC video coding systems

Tiago Dias, Sebastian Lopez, Nuno Roma, Leonel Sousa

2011 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation > 225 - 232

2011 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS XI)

An innovative high throughput and scalable multi-transform architecture for H.264/AVC is presented in this paper. This structure can be used as a hardware accelerator in modern embedded systems to efficiently compute the 4×4 forward/inverse integer DCT, as well as the 2-D 4×4 / 2×2 Hadamard transforms. Moreover, its highly flexible design and hardware efficiency allows it to be easily scaled in terms...

chapter

Scalable ASIP implementation and parallelization of a MIMO sphere detector

Esther P. Adeva, Bjorn Mennenga, Gerhard Fettweis

2011 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation > 233 - 241

2011 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS XI)

High detection complexity is known to be one of the major challenges in MIMO communications based on spatial multiplexing. Tuple Search Detector (TSD) was recently introduced, significantly reducing detection complexity in comparison to conventional algorithms while achieving close to full max-log-APP BER performance. Besides high computational complexity, irregular control flow and sequential nature...

chapter

Using SDRAMs for two-dimensional accesses of long 2ⁿ × 2^m-point FFTs and transposing

Stefan Langemeyer, Peter Pirsch, Holger Blume

2011 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation > 242 - 248

2011 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS XI)

When transposing large matrices using SDRAM memories, typically a control overhead significantly reduces the data throughput. In this paper, a new address mapping scheme is introduced, taking advantage of multiple banks and burst capabilities of modern SDRAMs. Other address mapping strategies minimize the total number of SDRAM page-opens while traversing the two-dimensional index-space in row or column...

chapter

On-chip network resource management design and validation

Francesco Bruschi, Antonio Miele, Vincenzo Rana

2011 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation > 249 - 254

2011 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS XI)

Designing interconnection networks for systems on-a-chip is getting more complex due to the increasing number and heterogeneity of elements they connect, the variety of technologies adopted to transmit and route information, the performance and cost requirements and constraints they have to satisfy. The complexity of such transmission fabrics gets then closer to that of telecommunication networks...

chapter

Breaking the bandwidth wall in chip multiprocessors

Augusto Vega, Felipe Cabarcas, Alex Ramirez, Mateo Valero

2011 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation > 255 - 262

2011 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS XI)

In throughput-aware CMPs like GPUs and DSPs, software-managed streaming memory systems are an effective way to tolerate high latencies. E.g., the Cell/B.E. incorporates local memories, and data transfers to/from those memories are overlapped with computation using DMAs. In such designs, the latency of the memory system has little impact on performance; instead, memory bandwidth becomes critical. With...

INFONA - science communication portal

2011 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation

Dedicated hardware accelerators for the epistatic analysis of human genetic data

Vector processor customization for FFT

FPGA based application specific processing for sensor nodes

Parametrized hardware architectures for the Lucas primality test

Distributed resource management for concurrent execution of multimedia applications on MPSoC platforms

High level quantitative hardware prediction modeling using statistical methods

Removal of unnecessary context switches from the systemc simulation kernel for fast VP simulation

A novel ADL-based compiler-centric software framework for reconfigurable mixed-ISA processors

ADL-based specification of implementation styles for functional simulators

A performance estimation flow for embedded systems with mixed software/hardware modeling

Calibration and validation of software performance models for pedestrian detection systems

Scalable multi-core simulation using parallel dynamic binary translation

Fully-automatic derivation of exact program-flow constraints for a tighter worst-case execution-time analysis

A hardware accelerated configurable ASIP architecture for embedded real-time video-based driver assistance applications

Task-based parallel H.264 video encoding for explicit communication architectures

High throughput and scalable architecture for unified transform coding in embedded H.264/AVC video coding systems

Scalable ASIP implementation and parallelization of a MIMO sphere detector

Using SDRAMs for two-dimensional accesses of long 2ⁿ × 2^m-point FFTs and transposing

On-chip network resource management design and validation

Breaking the bandwidth wall in chip multiprocessors

Filter options

Publication date

Keywords

INFONA - science communication portal

2011 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2011 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation