Search results for: Luca Benini

Items from 1 to 11 out of 11 results

chapter

Lightweight virtual memory support for many-core accelerators in heterogeneous embedded SoCs

Pirmin Vogel, Andrea Marongiu, Luca Benini

2015 International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS) > 45 - 54

2015 International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS)

While high-end heterogeneous systems are increasingly supporting heterogeneous uniform memory access (hUMA) as envisioned by the Heterogeneous System Architecture (HSA) foundation, their low-power counterparts targeting the embedded domain still lack basic features like virtual memory support for accelerators. As opposed to simply passing virtual address pointers, explicit data management involving...

chapter

ANTAREX -- AutoTuning and Adaptivity appRoach for Energy Efficient eXascale HPC Systems

Cristina Silvano, Giovanni Agosta, Andrea Bartolini, Andrea Beccari, more

2015 IEEE 18th International Conference on Computational Science and Engineering > 343 - 346

2015 IEEE 18th International Conference on Computational Science and Engineering (CSE)

The main goal of the ANTAREX project is to express by a Domain Specific Language (DSL) the application self-adaptivity and to runtime manage and autotune applications for green and heterogeneous High Performance Computing (HPC) systems up to the Exascale level. Key innovations of the project include the introduction of a separation of concerns between self-adaptivity strategies and application functionalities...

chapter

A HLS-Based Toolflow to Design Next-Generation Heterogeneous Many-Core Platforms with Shared Memory

Paolo Burgio, Andrea Marongiu, Philippe Coussy, Luca Benini

2014 12th IEEE International Conference on Embedded and Ubiquitous Computing > 130 - 137

2014 12th IEEE International Conference on Embedded and Ubiquitous Computing (EUC)

This work describes how we use High-Level Synthesis to support design space exploration (DSE) of heterogeneous many-core systems. Modern embedded systems increasingly couple hardware accelerators and processing cores on the same chip, to trade specialization of the platform to an application domain for increased performance and energy efficiency. However, the process of designing such a platform is...

chapter

Tightly-coupled hardware support to dynamic parallelism acceleration in embedded shared memory clusters

Paolo Burgio, Giuseppe Tagliavini, Francesco Conti, Andrea Marongiu, more

2014 Design, Automation & Test in Europe Conference & Exhibition (DATE) > 1 - 6

2014 Design, Automation & Test in Europe Conference & Exhibition (DATE)

Modern designs for embedded systems are increasingly embracing cluster-based architectures, where small sets of cores communicate through tightly-coupled shared memory banks and high-performance interconnections. At the same time, the complexity of modern applications requires new programming abstractions to exploit dynamic and/or irregular parallelism on such platforms. Supporting dynamic parallelism...

chapter

A tightly-coupled hardware controller to improve scalability and programmability of shared-memory heterogeneous clusters

Paolo Burgio, Robin Danilo, Andrea Marongiu, Philippe Coussy, more

2014 Design, Automation & Test in Europe Conference & Exhibition (DATE) > 1 - 4

2014 Design, Automation & Test in Europe Conference & Exhibition (DATE)

Modern designs for embedded many-core systems increasingly include application-specific units to accelerate key computational kernels with orders-of-magnitude higher execution speed and energy efficiency compared to software counterparts. A promising architectural template is based on heterogeneous clusters, where simple RISC cores and specialized HW units (HWPU) communicate in a tightly-coupled manner...

chapter

OpenMP-based Synergistic Parallelization and HW Acceleration for On-Chip Shared-Memory Clusters

Paolo Burgio, Andrea Marongiu, Dominique Heller, Cyrille Chavet, more

2012 15th Euromicro Conference on Digital System Design > 751 - 758

2012 15th Euromicro Conference on Digital System Design (DSD)

Modern embedded MPSoC designs increasingly couple hardware accelerators to processing cores to trade between energy efficiency and platform specialization. To assist effective design of such systems there is the need on one hand for clear methodologies to streamline accelerator definition and instantiation, on the other for architectural templates and run-time techniques that minimize processors-to-accelerator...

chapter

Fast and lightweight support for nested parallelism on cluster-based embedded many-cores

Andrea Marongiu, Paolo Burgio, Luca Benini

2012 Design, Automation & Test in Europe Conference & Exhibition (DATE) > 105 - 110

2012 Design, Automation & Test in Europe Conference & Exhibition (DATE 2012)

Several recent many-core accelerators have been architected as fabrics of tightly-coupled shared memory clusters. A hierarchical interconnection system is used - with a crossbar-like medium inside each cluster and a network-on-chip (NoC) at the global level - which make memory operations non-uniform (NUMA). Nested parallelism represents a powerful programming abstraction for these architectures, where...

chapter

Platform 2012, a many-core computing accelerator for embedded SoCs: Performance evaluation of visual analytics applications

Diego Melpignano, Luca Benini, Eric Flamand, Bruno Jego, more

DAC Design Automation Conference 2012 > 1137 - 1142

2012 49th ACM/EDAC/IEEE Design Automation Conference (DAC)

P2012 is an area- and power-efficient many-core computing accelerator based on multiple globally asynchronous, locally synchronous processor clusters. Each cluster features up to 16 processors with independent instruction streams sharing a multi-banked one-cycle access L1 data memory, a multi-channel DMA engine and specialized hardware for synchronization and aggressive power management. P2012 is...

chapter

P2012: Building an ecosystem for a scalable, modular and high-efficiency embedded computing accelerator

Luca Benini, Eric Flamand, Didier Fuin, Diego Melpignano

2012 Design, Automation & Test in Europe Conference & Exhibition (DATE) > 983 - 987

2012 Design, Automation & Test in Europe Conference & Exhibition (DATE 2012)

P2012 is an area- and power-efficient many-core computing fabric based on multiple globally asynchronous, locally synchronous (GALS) clusters supporting aggressive fine-grained power, reliability and variability management. Clusters feature up to 16 processors and one control processor with independent instruction streams sharing a multi-banked L1 data memory, a multi-channel DMA engine, and specialized...

chapter

Efficient OpenMP data mapping for multicore platforms with vertically stacked memory

Andrea Marongiu, Martino Ruggiero, Luca Benini

2010 Design, Automation&Test in Europe Conference&Exhibition (DATE 2010) > 105 - 110

2010 Design, Automation & Test in Europe Conference & Exhibition (DATE 2010)

Emerging TSV-based 3D integration technologies have shown great promise to overcome scalability limitations in 2D designs by stacking multiple memory dies on top of a many-core die. Application software developers need programming models and tools to fully exploit the potential of vertically stacked memory. In this work, we focus on efficient data mapping for SPMD parallel applications on an explicitly...

chapter

An efficient and complete approach for throughput-maximal SDF allocation and scheduling on multi-core platforms

Alessio Bonfietti, Luca Benini, Michele Lombardi, Michela Milano

2010 Design, Automation&Test in Europe Conference&Exhibition (DATE 2010) > 897 - 902

2010 Design, Automation & Test in Europe Conference & Exhibition (DATE 2010)

Our work focuses on allocating and scheduling a synchronous data-flow (SDF) graph onto a multi-core platform subject to a minimum throughput requirement. This problem has traditionally be tackled by incomplete approaches based on problem decomposition and local search, which could not guarantee optimality. Exact algorithms used to be considered reasonable only for small problem instances. We propose...

Filter options

Keywords:
PROGRAMMING
Publication type:
book

Publication date

Set your own date range

Keywords

HARDWARE (7)
COMPUTER ARCHITECTURE (5)
PROGRAM PROCESSORS (5)
ACCELERATION (4)
PARALLEL PROCESSING (3)
ARRAYS (2)
FABRICS (2)
KERNEL (2)
OPENMP (2)
RANDOM ACCESS MEMORY (2)
SOFTWARE (2)
SYNCHRONIZATION (2)
SYSTEM-ON-A-CHIP (2)
3D STACKED MEMORY HIERARCHY (1)
3D STACKING (1)
ADAPTIVITY (1)
AUTOTUNING (1)
CLUSTERED ARCHITECTURES (1)
COMPLEXITY THEORY (1)
COMPUTATIONAL MODELING (1)
COMPUTER VISION (1)
CONSTRAINT HANDLING (1)
CONSTRAINT PROGRAMMING (1)
DATA FLOW GRAPHS (1)
DATA HANDLING (1)
DESIGN FLOW (1)
DESIGN SPACE EXPLORATION (1)
DSL (1)
DYNAMIC SCHEDULING (1)
EMBEDDED SYSTEMS (1)
FEATURE EXTRACTION (1)
HETEROGENEOUS ARCHITECTURES (1)
HETEROGENEOUS EMBEDDED SYSTEMS ON CHIP (1)
HIGH PERFORMANCE COMPUTING (1)
HLS (1)
HW ACCELERATION (1)
INSTRUCTION SETS (1)
INTEGRATED CIRCUIT INTERCONNECTIONS (1)
LOW-POWER (1)
MANY-CORE (1)
MANY-CORE SYSTEMS (1)
MEMORY MANAGEMENT (1)
MICROPROCESSOR CHIPS (1)
MONITORING (1)
MPSOCS (1)
MULTICORE PLATFORM (1)
MULTICORE PLATFORMS SCHEDULING (1)
MULTIPLE VERTICAL MEMORY STACK (1)
MULTIPROCESSING SYSTEMS (1)
NONTRIVIAL INSTANCES (1)
OPENMP DATA MAPPING (1)
OPTIMAL SCHEDULING (1)
OPTIMISATION (1)
OPTIMIZATION (1)
PORTS (COMPUTERS) (1)
PREFETCHING (1)
PROBLEM DECOMPOSITION (1)
PROCESS AWARE (1)
PROCESSOR SCHEDULING (1)
PROGRAMMABLE MANY-CORE ACCELERATORS (1)
PROGRAMMING FRAMEWORK (1)
REGISTERS (1)
RESOURCE MANAGEMENT (1)
RUNTIME (1)
SDF ALLOCATION (1)
SHARED MEMORY CLUSTERED ARCHITECTURES (1)
SHARED-MEMORY SYSTEMS (1)
SOC (1)
SPMD PARALLEL APPLICATION (1)
STORAGE MANAGEMENT (1)
SUPERCOMPUTERS (1)
SYNCHRONOUS DATAFLOW (1)
SYSTEM-ON-CHIP (1)
THREE DIMENSIONAL DISPLAYS (1)
THROUGHPUT (1)
TILES (1)
TSV BASED 3D INTEGRATION TECHNOLOGY (1)
UPPER BOUND (1)
VERTICALLY STACKED MEMORY (1)
VIRTUAL SHARED MEMORY (1)
YARN (1)
more

INFONA - science communication portal

Search results for: Luca Benini

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options