Search results for: Nasim Farahini

Items from 1 to 13 out of 13 results

chapter

SiLago-CoG: Coarse-Grained Grid-Based Design for Near Tape-Out Power Estimation Accuracy at High Level

Syed Mohammad Asad Hassan Jafri, Nasim Farahini, Ahmed Hemani

2017 IEEE Computer Society Annual Symposium on VLSI (ISVLSI) > 25 - 31

2017 IEEE Computer Society Annual Symposium on VLSI (ISVLSI)

It is well known that ASICs have orders of magnitude higher power efficiency than general propose processors. However, due to the high engineering and manufacturing cost only handful of companies can afford to design ASICs. To reduce this cost numerous high-level synthesis tools have emerged since last 2-3 decades. In spite of these tools, ASIC design is still considered expensive because they fail...

chapter

Atomic stream computation unit based on micro-thread level parallelism

Nasim Farahini, Ahmed Hemani

2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP) > 25 - 29

2015 IEEE 26th International Conference on Application-specific Systems, Architectures and Processors (ASAP)

The increasing demand for higher resolution of images and communication bandwidth requires the streaming applications to deal with ever increasing size of datasets. Further, with technology scaling the cost of moving data is reducing at a slower pace compared to the cost of computing. These trends have motivated the proposed micro-architectural reorganization of stream processors by dividing the stream...

chapter

Physical design aware system level synthesis of hardware

Nasim Farahini, Ahmed Hemani, Hassan Sohofi, Shuo Li

2015 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS) > 141 - 148

2015 International Conference on Embedded Computer Systems: Architectures, Modeling, and Simulation (SAMOS)

In spite of decades of research, only a small percentage of hardware is designed using high-level synthesis because of the large gap between the abstraction levels of standard cells and algorithmic level. We propose a grid-based regular physical design platform composed of large grain hardened building blocks called SiLago blocks. This platform is divided into regions which are specialized for different...

article

Parallel distributed scalable runtime address generation scheme for a coarse grain reconfigurable computation and storage fabric

Nasim Farahini, Ahmed Hemani, Hassan Sohofi, Syed M.A.H. Jafri, more

Microprocessors and Microsystems > 2014 > 38 > 8 Part A > 788-802

This paper presents a hardware based solution for a scalable runtime address generation scheme for DSP applications mapped to a parallel distributed coarse grain reconfigurable computation and storage fabric. The scheme can also deal with non-affine functions of multiple variables that typically correspond to multiple nested loops. The key innovation is the judicious use of two categories of address...

chapter

Customization methodology of a Coarse Grained Reconfigurable architecture

Siavoosh Payandeh Azad, Nasim Farahini, Ahmed Hemani

2014 NORCHIP > 1 - 4

2014 NORCHIP

Mapping algorithms on CGRAs can lead to an inefficient implementation and hardware under-utilization if there is a mismatch between the granularity of reconfigurable processing unit and the algorithm. In this paper, we introduce a tool that takes the hardware configuration of a set of applications, identifies the unused parts of the CGRA, and let the user sweep the design space from fully programmable...

chapter

A scalable custom simulation machine for the Bayesian Confidence Propagation Neural Network model of the brain

Nasim Farahini, Ahmed Hemani, Anders Lansner, Fabian Clermidy, more

2014 19th Asia and South Pacific Design Automation Conference (ASP-DAC) > 578 - 585

2014 19th Asia and South Pacific Design Automation Conference (ASP-DAC)

A multi-chip custom digital super-computer called eBrain for simulating Bayesian Confidence Propagation Neural Network (BCPNN) model of the human brain has been proposed. It uses Hybrid Memory Cube (HMC), the 3D stacked DRAM memories for storing synaptic weights that are integrated with a custom designed logic chip that implements the BCPNN model. In 22nm node, eBrain executes BCPNN in real time with...

chapter

Spiking brain models: Computation, memory and communication constraints for custom hardware implementation

Anders Lansner, Ahmed Hemani, Nasim Farahini

2014 19th Asia and South Pacific Design Automation Conference (ASP-DAC) > 556 - 562

2014 19th Asia and South Pacific Design Automation Conference (ASP-DAC)

We estimate the computational capacity required to simulate in real time the neural information processing in the human brain. We show that the computational demands of a detailed implementation are beyond reach of current technology, but that some biologically plausible reductions of problem complexity can give performance gains between two and six orders of magnitude, which put implementations within...

chapter

Distributed Runtime Computation of Constraints for Multiple Inner Loops

Nasim Farahini, Ahmed Hemani, Kolin Paul

2013 Euromicro Conference on Digital System Design > 389 - 395

2013 Euromicro Conference on Digital System Design (DSD)

This paper presents hardware solution for runtime computation of loop constraints and synchronizing delays for multiple inner loops in parallel distributed implementation of digital signal processing sub-systems. Methods to map and generate the runtime computation code for loop constraints and synchronizing delays are also presented. Compared to the traditional methods, the proposed solution achieves...

chapter

System level synthesis of hardware for DSP applications using pre-characterized function implementations

Shuo Li, Nasim Farahini, Ahmed Hemani, Kathrin Rosvall, more

2013 International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS) > 1 - 10

2013 International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS)

SYLVA is a system level synthesis framework that transforms DSP sub-systems modeled as synchronous data flow into hardware implementations in ASIC, FPGAs or CGRAs. SYLVA synthesizes in terms of pre-characterized function implementations (FTMPs). It explores the design space in three dimensions, number of FTMPs, type of FTMPs and pipeline parallelism between the producing and consuming FTMPs. We introduce...

chapter

A conceptual custom super-computer design for real-time simulation of human brain

Nasim Farahini, Ahmed Hemani

2013 21st Iranian Conference on Electrical Engineering (ICEE) > 1 - 6

2013 21st Iranian Conference on Electrical Engineering (ICEE)

In this paper, we introduce BRIC, a novel custom multi-chip digital computer architecture for simulating in realtime a model of human brain in form of a spiking Bayesian Confidence Propagation Neural Network (BCPNN). The design is conceptually dimensioned for available technology in 2015–2020 with the estimated size of a pizza box, consuming less than 3 kWs of power, delivering 800 Teraflops/sec (single...

chapter

39.9 GOPs/watt multi-mode CGRA accelerator for a multi-standard basestation

Nasim Farahini, Shuo Li, Muhammad Adeel Tajammul, Muhammad Ali Shami, more

2013 IEEE International Symposium on Circuits and Systems (ISCAS2013) > 1448 - 1451

2013 IEEE International Symposium on Circuits and Systems (ISCAS)

This paper presents an industrial case study of using a Coarse Grain Reconfigurable Architecture (CGRA) for a multi-mode accelerator for two kernels: FFT for the LTE standard and the Correlation Pool for the UMTS standard to be executed in a mutually exclusive manner. The CGRA multi-mode accelerator achieved computational efficiency of 39.94 GOPS/watt (OP is multiply-add) and silicon efficiency of...

chapter

Global Control and Storage Synthesis for a System Level Synthesis Approach

Shuo Li, Nasim Farahini, Ahmed Hemani

2013 IEEE 21st Annual International Symposium on Field-Programmable Custom Computing Machines > 239

IEEE 21st Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM 2013)

SYLVA is a System Level Architectural Synthesis Framework that translates Synchronous Data Flow (SDF) models of DSP sub-systems like modems and codecs into hardware implementation in ASIC/Standard Cells, FPGAs or CGRAs (Coarse Grain Reconfigurable Fabric).

chapter

Energy-aware coarse-grained reconfigurable architectures using dynamically reconfigurable isolation cells

Syed. M. A. H. Jafri, Ozan Bag, Ahmed Hemani, Nasim Farahini, more

International Symposium on Quality Electronic Design (ISQED) > 104 - 111

2013 14th International Symposium on Quality Electronic Design (ISQED)

This paper presents a self adaptive architecture to enhance the energy efficiency of coarse-grained reconfigurable architectures (CGRAs). Today, platforms host multiple applications, with arbitrary inter-application communication and concurrency patterns. Each application itself can have multiple versions (implementations with different degree of parallelism) and the optimal version can only be determined...

Filter options

Publication date

Set your own date range

INFONA - science communication portal

Search results for: Nasim Farahini

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Data set

Reporting an error / abuse

Sending the report failed

Accessibility options