Search results for: Guy G.F. Lemieux

Items from 1 to 10 out of 10 results

chapter

Real-time object detection in software with custom vector instructions and algorithm changes

Joe Edwards, Guy G.F. Lemieux

2017 IEEE 28th International Conference on Application-specific Systems, Architectures and Processors (ASAP) > 75 - 82

2017 IEEE 28th International Conference on Application-specific Systems, Architectures and Processors (ASAP)

Real-time vision applications place stringent performance requirements on embedded systems. To meet performance requirements, embedded systems often require hardware implementations. This approach is unfavorable as hardware development can be difficult to debug, time-consuming, and require extensive skill. This paper presents a case study of accelerating face detection, often part of a complex image...

chapter

Modular SRAM-Based Binary Content-Addressable Memories

Ameer M.S. Abdelhadi, Guy G.F. Lemieux

2015 IEEE 23rd Annual International Symposium on Field-Programmable Custom Computing Machines > 207 - 214

2015 IEEE 23rd Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)

Binary Content Addressable Memories (BCAMs), also known as associative memories, are hardware-based search engines. BCAMs employ a massively parallel exhaustive search of the entire memory space, and are capable of matching a specific data within a single cycle. Networking, memory management, pattern matching, data compression, DSP, and other applications utilize CAMs as single-cycle associative search...

chapter

Rapid Overlay Builder for Xilinx FPGAs

Michael Xi Yue, Dirk Koch, Guy G.F. Lemieux

2015 IEEE 23rd Annual International Symposium on Field-Programmable Custom Computing Machines > 17 - 20

2015 IEEE 23rd Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)

Overlays are emerging as useful design patterns for solving reconfigurable computing problems. Overlays consist of compiler-like tools and an architecture written in RTL, making it easier for users to quickly compile high-level languages into FPGAs. Despite a high degree of regularity and repetition present in most overlays, it takes a long time for FPGA tools to generate the configuration bit stream...

chapter

Embedded supercomputing in FPGAs with the VectorBlox MXP Matrix Processor

Aaron Severance, Guy G.F. Lemieux

2013 International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS) > 1 - 10

2013 International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS)

Embedded systems frequently use FPGAs to perform highly parallel data processing tasks. However, building such a system usually requires specialized hardware design skills with VHDL or Verilog. Instead, this paper presents the VectorBlox MXP Matrix Processor, an FPGA-based soft processor capable of highly parallel execution. Programmed entirely in C, the MXP is capable of executing data-parallel software...

chapter

TputCache: High-frequency, multi-way cache for high-throughput FPGA applications

Aaron Severance, Guy G.F. Lemieux

2013 23rd International Conference on Field programmable Logic and Applications > 1 - 6

2013 23rd International Conference on Field Programmable Logic and Applications (FPL)

Throughput processing involves using many different contexts or threads to solve multiple problems or subproblems in parallel, where the size of the problem is large enough that latency can be tolerated. Bandwidth is required to support multiple concurrent executions, however, and utilizing multiple external memory channels is costly. For small working sets, FPGA designers can use on-chip BRAMs achieve...

chapter

Safe Overclocking of Tightly Coupled CGRAs and Processor Arrays using Razor

Alexander Brant, Ameer Abdelhadi, Douglas H.H. Sim, Shao Lin Tang, more

2013 IEEE 21st Annual International Symposium on Field-Programmable Custom Computing Machines > 37 - 44

IEEE 21st Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM 2013)

Overclocking a CPU is a common practice among home-built PC enthusiasts where the CPU is operated at a higher frequency than its speed rating. This practice is unsafe because timing errors cannot be detected by modern CPUs and they can be practically undetectable by the end user. Using a timing speculation technique such as Razor, it is possible to detect timing errors in CPUs. To date, Razor has...

chapter

Pipeline frequency boosting: Hiding dual-ported block RAM latency using intentional clock skew

Alexander Brant, Ameer Abdelhadi, Aaron Severance, Guy G.F. Lemieux

2012 International Conference on Field-Programmable Technology > 235 - 238

2012 International Conference on Field-Programmable Technology (FPT)

FPGAs are increasingly being used to implement many new applications, including pipelined processor designs. Designers often employ memories to communicate and pass data between these pipeline stages. However, one-cycle communication between sender and receiver is often required. To implement this read-immediately-after-write functionality, bypass registers are needed by most FPGA memory blocks. Read...

chapter

ZUMA: An Open FPGA Overlay Architecture

Alexander Brant, Guy G.F. Lemieux

2012 IEEE 20th International Symposium on Field-Programmable Custom Computing Machines > 93 - 96

2012 IEEE 20th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)

This paper presents the ZUMA open FPGA overlay architecture. It is an open-source, cross-compatible embedded FPGA architecture that is intended to overlay on top of an existing FPGA, in essence an "FPGA-on-an-FPGA." This approach has a number of benefits, including bit stream compatibility between different vendors and parts, compatibility with open FPGA tool flows, and the ability to embed...

chapter

Configuration Bitstream Reduction for SRAM-based FPGAs by Enumerating LUT Input Permutations

Ameer Abdelhadi, Guy G.F. Lemieux

2011 International Conference on Reconfigurable Computing and FPGAs > 20 - 26

2011 International Conference on Reconfigurable Computing and FPGAs (ReConFig 2011)

SRAM-based Field-Programmable Gate Arrays (FPGAs) are configured from off-chip memory through a serial link. Hence, a large configuration bit stream adversely increases off-chip memory size as well as bit stream loading time. The following work proposes a novel method to reduce the number of programming bits required for look-up tables (LUT), thereby reducing overall configuration bit stream size...

chapter

Deterministic Timing-Driven Parallel Placement by Simulated Annealing Using Half-Box Window Decomposition

Jeffrey B. Goeders, Guy G.F. Lemieux, Steven J.E. Wilton

2011 International Conference on Reconfigurable Computing and FPGAs > 41 - 48

2011 International Conference on Reconfigurable Computing and FPGAs (ReConFig 2011)

As each generation of FPGAs grow in size, the run time of the associated CAD tools is rapidly increasing. Many past efforts have aimed at improving the CAD run time through parallelization of the placement algorithm. Wang and Lemieux presented an algorithm that is scalable, deterministic, timing-driven and achieves speedup over VPR [Wang and Lemieux FPGA'11]. This paper provides two significant alterations...

Filter options

Publication date

Set your own date range

Keywords

FIELD PROGRAMMABLE GATE ARRAYS (5)
ANNEALING (1)
ASSOCIATIVE ARRAY (1)
ASSOCIATIVE MEMORY (1)
BITSREAM COMPRESSION (1)
CAD (1)
CATALOG MEMORY (1)
CGRA (1)
CLOCKS (1)
COMPONENT-BASED DESIGN (1)
CONTENT ADDRESSABLE MEMORY (1)
DATA ADDRESSABLE MEMORY (1)
DEGRADATION (1)
DESIGN AND APPLICATIONS (1)
DESIGN AUTOMATION (1)
ENGINES (1)
FACE (1)
FACE DETECTION (1)
FIELD-PROGRAMMABLE GATE ARRAY (FPGA) (1)
FINITE ELEMENT METHODS (1)
FPGA (1)
HARDWARE (1)
LOADING (1)
LOGIC FUNCTIONS (1)
LOGIC GATES (1)
LUT OPTIMIZATION (1)
MEMORY ARCHITECTURES (1)
MODULE RELOCATION (1)
MODULE STITCHING (1)
MODULE VARIANTS (1)
OBJECT DETECTION (1)
OVERLAYS (1)
PARALLEL PLACEMENT (1)
PIPELINE PROCESSING (1)
PIPELINES (1)
PRODUCTIVITY (1)
PROGRAM PROCESSORS (1)
RANDOM ACCESS MEMORY (1)
RECONFIGURABLE ARCHITECTURES (1)
RECONFIGURABLE COMPUTING (1)
REDUNDANCY (1)
REGISTERS (1)
SCHEDULES (1)
SYNCHRONIZATION (1)
TABLE LOOKUP (1)
TIMING (1)
TIMING OPTIMIZATION (1)
more

INFONA - science communication portal

Search results for: Guy G.F. Lemieux

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options