Search results for: Luca Benini

Items from 1 to 14 out of 14 results

chapter

Deep structured features for semantic segmentation

Michael Tschannen, Lukas Cavigelli, Fabian Mentzer, Thomas Wiatowski, more

2017 25th European Signal Processing Conference (EUSIPCO) > 61 - 65

2017 25th European Signal Processing Conference (EUSIPCO)

We propose a highly structured neural network architecture for semantic segmentation with an extremely small model size, suitable for low-power embedded and mobile platforms. Specifically, our architecture combines i) a Haar wavelet-based tree-like convolutional neural network (CNN), ii) a random layer realizing a radial basis function kernel approximation, and iii) a linear classifier. While stages...

chapter

A 142MOPS/mW integrated programmable array accelerator for smart visual processing

Satyajit Das, Davide Rossi, Kevin J. M. Martin, Philippe Coussy, more

2017 IEEE International Symposium on Circuits and Systems (ISCAS) > 1 - 4

2017 IEEE International Symposium on Circuits and Systems (ISCAS)

Due to increasing demand of low power computing, and diminishing returns from technology scaling, industry and academia are turning with renewed interest toward energy-efficient programmable accelerators. This paper proposes an Integrated Programmable-Array accelerator (IPA) architecture based on an innovative execution model, targeted to accelerate both data and control-flow parts of deeply embedded...

chapter

Ultra low-power visual odometry for nano-scale unmanned aerial vehicles

Daniele Palossi, Andrea Marongiu, Luca Benini

Design, Automation & Test in Europe Conference & Exhibition (DATE), 2017 > 1647 - 1650

2017 Design, Automation & Test in Europe Conference & Exhibition (DATE)

One of the fundamental functionalities for autonomous navigation of Unmanned Aerial Vehicles (UAVs) is the hovering capability. State-of-the-art techniques for implementing hovering on standard-size UAVs process camera stream to determine position and orientation (visual odometry). Similar techniques are considered unaffordable in the context of nano-scale UAVs (i.e. few centimeters of diameter),...

chapter

Always-on motion detection with application-level error control on a near-threshold approximate computing platform

Giuseppe Tagliavini, Andrea Marongiu, Davide Rossi, Luca Benini

2016 IEEE International Conference on Electronics, Circuits and Systems (ICECS) > 552 - 555

2016 IEEE International Conference on Electronics, Circuits and Systems (ICECS)

Pushing supply voltages in the near-threshold region is today one of the main avenues to minimize power consumption in digital integrated circuits. This works well with logic units, but memory operations on standard six-transistor static RAM (6T-SRAM) cells become unreliable at low voltages. Standard cell memory (SCM) works fully reliably at near-threshold voltages, but has much lower area density...

chapter

YodaNN: An Ultra-Low Power Convolutional Neural Network Accelerator Based on Binary Weights

Renzo Andri, Lukas Cavigelli, Davide Rossi, Luca Benini

2016 IEEE Computer Society Annual Symposium on VLSI (ISVLSI) > 236 - 241

2016 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)

Convolutional Neural Networks (CNNs) have revolutionized the world of image classification over the last few years, pushing the computer vision close beyond human accuracy. The required computational effort of CNNs today requires power-hungry parallel processors and GP-GPUs. Recent efforts in designing CNN Application-Specific Integrated Circuits (ASICs) and accelerators for System-On-Chip (SoC) integration...

chapter

High-efficiency logarithmic number unit design based on an improved cotransformation scheme

Youri Popoff, Florian Scheidegger, Michael Schaffner, Michael Gautschi, more

2016 Design, Automation & Test in Europe Conference & Exhibition (DATE) > 1387 - 1392

2016 Design, Automation & Test in Europe Conference & Exhibition (DATE)

The logarithmic number system (LNS) has always been an interesting alternative for floating point calculations since the implementation of several arithmetic operations such as divisions, exponentiations and square-roots, which are required for computationally intensive nonlinear functions, is greatly simplified in the logarithmic space. However, additions and subtractions become nonlinear operations...

chapter

Enabling the heterogeneous accelerator model on ultra-low power microcontroller platforms

Francesco Conti, Daniele Palossi, Andrea Marongiu, Davide Rossi, more

2016 Design, Automation & Test in Europe Conference & Exhibition (DATE) > 1201 - 1206

2016 Design, Automation & Test in Europe Conference & Exhibition (DATE)

The stringent power constraints of complex microcontroller based devices (e.g. smart sensors for the IoT) represent an obstacle to the introduction of sophisticated functionality. Programmable accelerators would be extremely beneficial to provide the flexibility and energy efficiency required by fast-evolving IoT applications; however, the integration complexity and sub-10mW power budgets have been...

chapter

4.6 A 65nm CMOS 6.4-to-29.2pJ/FLOP@0.8V shared logarithmic floating point unit for acceleration of nonlinear function kernels in a tightly coupled processor cluster

Michael Gautschi, Michael Schaffner, Frank K. Gurkaynak, Luca Benini

2016 IEEE International Solid-State Circuits Conference (ISSCC) > 82 - 83

2016 IEEE International Solid-State Circuits Conference (ISSCC)

Energy-efficient computing and ultra-low-power operation are requirements for many application areas, such as IoT and wearables. While for some applications, integer and fixed-point processor instructions suffice, others (e.g. simultaneous localization and mapping - SLAM, stereo vision, nonlinear regression and classification) require a larger dynamic range, typically obtained using single/double-precision...

chapter

Lightweight virtual memory support for many-core accelerators in heterogeneous embedded SoCs

Pirmin Vogel, Andrea Marongiu, Luca Benini

2015 International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS) > 45 - 54

2015 International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS)

While high-end heterogeneous systems are increasingly supporting heterogeneous uniform memory access (hUMA) as envisioned by the Heterogeneous System Architecture (HSA) foundation, their low-power counterparts targeting the embedded domain still lack basic features like virtual memory support for accelerators. As opposed to simply passing virtual address pointers, explicit data management involving...

chapter

Energy-efficient vision on the PULP platform for ultra-low power parallel computing

Francesco Conti, Davide Rossi, Antonio Pullini, Igor Loi, more

2014 IEEE Workshop on Signal Processing Systems (SiPS) > 1 - 6

2014 IEEE Workshop on Signal Processing Systems (SiPS)

Many-core architectures structured as fabrics of tightly-coupled clusters have shown promising results on embedded computer vision benchmarks, providing state-of-art performance with a reduced power budget. We propose PULP (Parallel processing Ultra-Low Power platform), an architecture built on clusters of tightly-coupled OpenRISC ISA cores, with advanced techniques for fast performance and energy...

chapter

A Linux-governor based Dynamic Reliability Manager for android mobile devices

Pietro Mercati, Andrea Bartolini, Francesco Paterna, Tajana Simunic Rosing, more

2014 Design, Automation & Test in Europe Conference & Exhibition (DATE) > 1 - 4

2014 Design, Automation & Test in Europe Conference & Exhibition (DATE)

Reliability is a major concern in multiprocessors. Dynamic Reliability Management (DRM) aims at trading off processor performance with lifetime. The state-of-the-art publications study only the theory supported by simulation. This paper presents the first complete software implementation, working on a real hardware, of a low-overhead, Android-compatible workload-aware DRM Governor for mobile multiprocessors...

chapter

Aging-aware compiler-directed VLIW assignment for GPGPU architectures

Abbas Rahimi, Luca Benini, Rajesh K. Gupta

2013 50th ACM/EDAC/IEEE Design Automation Conference (DAC) > 1 - 6

2013 50th ACM/EDAC/IEEE Design Automation Conference (DAC)

Negative bias temperature instability (NBTI) adversely affects the reliability of a processor by introducing new delay-induced faults. However, the effect of these delay variations is not uniformly spread across functional units and instructions: some are affected more (hence less reliable) than others. This paper proposes a NBTI-aware compiler-directed very long instruction word (VLIW) assignment...

chapter

OpenMP-based Synergistic Parallelization and HW Acceleration for On-Chip Shared-Memory Clusters

Paolo Burgio, Andrea Marongiu, Dominique Heller, Cyrille Chavet, more

2012 15th Euromicro Conference on Digital System Design > 751 - 758

2012 15th Euromicro Conference on Digital System Design (DSD)

Modern embedded MPSoC designs increasingly couple hardware accelerators to processing cores to trade between energy efficiency and platform specialization. To assist effective design of such systems there is the need on one hand for clear methodologies to streamline accelerator definition and instantiation, on the other for architectural templates and run-time techniques that minimize processors-to-accelerator...

chapter

Scalable instruction set simulator for thousand-core architectures running on GPGPUs

Shivani Raghav, Martino Ruggiero, David Atienza, Christian Pinto, more

2010 International Conference on High Performance Computing&Simulation > 459 - 466

2010 International Conference on High Performance Computing & Simulation (HPCS 2010)

Simulators are still the primary tools for development and performance evaluation of applications running on massively parallel architectures. However, current virtual platforms are not able to tackle the complexity issues introduced by 1000-core future scenarios. We present a fast and accurate simulation framework targeting extremely large parallel systems by specifically taking advantage of the...

Filter options

Keywords:
KERNEL
Publication type:
book

Publication date

Set your own date range

Keywords

COMPUTER ARCHITECTURE (6)
HARDWARE (4)
BENCHMARK TESTING (3)
COMPUTATIONAL MODELING (3)
CONVOLUTION (3)
DEGRADATION (2)
FIELD PROGRAMMABLE GATE ARRAYS (2)
GPGPU (2)
PARALLEL PROCESSING (2)
PERFORMANCE EVALUATION (2)
PROGRAM PROCESSORS (2)
PROGRAMMING (2)
RANDOM ACCESS MEMORY (2)
REGISTERS (2)
TABLE LOOKUP (2)
VISUALIZATION (2)
ACCELERATION (1)
ADAPTIVE KERNEL (1)
AGING (1)
AGING-AWARE COMPILATION (1)
ALGEBRA (1)
ARRAYS (1)
BINARYCONNECT (1)
CLOCKS (1)
CMOS INTEGRATED CIRCUITS (1)
COMPLEXITY THEORY (1)
COMPUTER GRAPHIC EQUIPMENT (1)
CONTEXT (1)
CONVOLUTION NEURAL NETWORKS ACCELERATOR (1)
CONVOLUTIONAL CODES (1)
COPROCESSORS (1)
CUDA (1)
DECODING (1)
DECONVOLUTION (1)
DESIGN FLOW (1)
DYNAMIC BINARY OPTIMIZER (1)
ENCODING (1)
ENERGY CONSUMPTION (1)
ENERGY EFFICIENCY (1)
ENGINES (1)
EUROPE (1)
FEATURE EXTRACTION (1)
GPGPUS (1)
HETEROGENEOUS EMBEDDED SYSTEMS ON CHIP (1)
HW ACCELERATION (1)
IMAGE SEGMENTATION (1)
INSTRUCTION SETS (1)
INTERPOLATION (1)
ISS (1)
LINUX (1)
MANYCORE (1)
MEASUREMENT (1)
MEMORY MANAGEMENT (1)
MOTION DETECTION (1)
MPSOCS (1)
MULTICORE PROCESSING (1)
NAVIGATION (1)
NBTI (1)
NEURAL NETWORKS (1)
OPENMP (1)
PARALLEL ARCHITECTURES (1)
PARALLEL SYSTEMS (1)
PREFETCHING (1)
PROGRAMMABLE MANY-CORE ACCELERATORS (1)
PROTOTYPES (1)
QUANTIZATION (SIGNAL) (1)
RELIABILITY (1)
RUNTIME (1)
SCALABLE INSTRUCTION SET SIMULATOR (1)
SEMANTICS (1)
SENSORS (1)
SHARED MEMORY CLUSTERED ARCHITECTURES (1)
SOFTWARE ALGORITHMS (1)
STANDARDS (1)
STRESS (1)
SUPPORT VECTOR MACHINES (1)
SYSTEM-ON-CHIP (1)
TEMPERATURE MEASUREMENT (1)
TEMPERATURE SENSORS (1)
THOUSAND-CORE ARCHITECTURES (1)
VIRTUAL PLATFORMS (1)
VIRTUAL SHARED MEMORY (1)
VLIW (1)
more

INFONA - science communication portal

Search results for: Luca Benini

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options