Search results

Items from 1 to 20 out of 59 results

chapter

On accelerating pair-HMM computations in programmable hardware

Subho S. Banerjee, Mohamed el-Hadedy, Ching Y. Tan, Zbigniew T. Kalbarczyk, more

2017 27th International Conference on Field Programmable Logic and Applications (FPL) > 1 - 8

2017 27th International Conference on Field Programmable Logic and Applications (FPL)

This paper explores hardware acceleration to significantly improve the runtime of computing the forward algorithm on Pair-HMM models, a crucial step in analyzing mutations in sequenced genomes. We describe 1) the design and evaluation of a novel accelerator architecture that can efficiently process real sequence data without performing wasteful work; and 2) aggressive memoization techniques that can...

chapter

Reconfigurable acceleration of genetic sequence alignment: A survey of two decades of efforts

Ho-Cheung Ng, Shuanglong Liu, Wayne Luk

2017 27th International Conference on Field Programmable Logic and Applications (FPL) > 1 - 8

2017 27th International Conference on Field Programmable Logic and Applications (FPL)

Genetic sequence alignment has always been a computational challenge in bioinformatics. Depending on the problem size, software-based aligners can take multiple CPU-days to process the sequence data, creating a bottleneck point in bioinformatic analysis flow. Reconfigurable accelerator can achieve high performance for such computation by providing massive parallelism, but at the expense of programming...

chapter

Ultrasonic flaw detection based on temporal and subband signals applied to neural network

Boyang Wang, Jafar Saniie

2017 IEEE International Ultrasonics Symposium (IUS) > 1

2017 IEEE International Ultrasonics Symposium (IUS)

Ultrasonic NDE uses high frequency acoustic waves to evaluate materials, and often signal processing is required to detect echoes from defects in the presence of microstructure scattering noise. Scattering noise, also known as clutter, interferes with the flaw signal and cannot be completely eliminated by using classical signal processing methods such as band-pass filtering. In this paper, neural...

chapter

Acceleration of RSA processes based on hybrid ARM-FPGA cluster

Xu Bai, Lei Jiang, Qiong Dai, Jiajia Yang, more

2017 IEEE Symposium on Computers and Communications (ISCC) > 682 - 688

2017 IEEE Symposium on Computers and Communications (ISCC)

Cooperation of software and hardware with hybrid architectures, such as Xilinx Zynq SoC combining ARM CPU and FPGA fabric, is a high-performance and low-power platform for accelerating RSA Algorithm. This paper adopts the none-subtraction Montgomery algorithm and the Chinese Remainder Theorem (CRT) to implement high-speed RSA processors, and deploys a 48-node cluster infrastructure based on Zynq SoC...

chapter

A Highly Scalable and Efficient Parallel Design of N-Body Simulation on FPGA

Emanuele Del Sozzo, Lorenzo Di Tucci, Marco D. Santambrogio

2017 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) > 241 - 246

2017 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

N-Body simulation simulates the evolution of a system that is composed of N particles, where each element receives a force that is due to the interaction with all the other elements within the system. Usually, the influence of external physical forces, such as gravity, is involved too. This methodology is widely used in different fields that range from astrophysics, where it is used to study the interaction...

chapter

A Hardware Acceleration for Surface EMG Non-Negative Matrix Factorization

Luca Cerina, Pierandrea Cancian, Giuseppe Franco, Marco Domenico Santambrogio

2017 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) > 168 - 174

2017 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

To the present day, a multitude of studies aims to understand how the Central Nervous System (CNS) translates neural pulses to muscle motor tasks, through the analysis of surface EMG (sEMG) recordings. One of the most considerable methods applies the Non-Negative Matrix Factorization (NMF) to data recorded from sEMG electrodes, to extract coordinated motor patterns, the so-called muscle synergies,...

chapter

Minimalist Design for Accelerating Convolutional Neural Networks for Low-End FPGA Platforms

Raghid Morcel, Haitham Akkary, Hazem Hajj, Mazen Saghir, more

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM) > 196

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)

Deep neural networks have gained tremendous attention in both the academic and industrial communities due to their performance in many artificial intelligence applications, particularly in computer vision. However, these algorithms are known to be computationally very demanding for both scoring and model learning applications. State-of-the-art recognition models use tens of millions of parameters...

chapter

ProFAX: A hardware acceleration of a protein folding algorithm

Giulia Guidi, Lorenzo Di Tucci, Marco D. Santambrogio

2016 IEEE 2nd International Forum on Research and Technologies for Society and Industry Leveraging a better tomorrow (RTSI) > 1 - 6

2016 IEEE 2nd International Forum on Research and Technologies for Society and Industry Leveraging a better tomorrow (RTSI)

Protein folding is the physical process by which a sequence of amino acids in a protein folds into its tertiary structure, which determines the functionality of the protein. The knowledge of this structure is crucial for the development of new pharmaceutical therapies. For this reason, many drug industries are interested in applying these kind of algorithms. There are various methods to perform this...

chapter

High-level synthesis for medical image processing on Systems on Chip: A case study

Fraser D Robinson, Louise H Crockett, William H Nailon, Robert W Stewart

2016 26th International Conference on Field Programmable Logic and Applications (FPL) > 1 - 2

2016 26th International Conference on Field Programmable Logic and Applications (FPL)

Adaptive radiotherapy is a technique intended to increase the accuracy of radiotherapy. Currently, it is not clinically feasible due to the time required to process the images of patient anatomy. Hardware acceleration of image processing algorithms may allow them to be carried out in a clinically acceptable timeframe. This paper presents the experiences encountered using high-level synthesis tools...

chapter

Optimal random sampling based path planning on FPGAs

Size Xiao, Adam Postula, Neil Bergmann

2016 26th International Conference on Field Programmable Logic and Applications (FPL) > 1 - 2

2016 26th International Conference on Field Programmable Logic and Applications (FPL)

Random sampling based path planning algorithms have shown their high efficiency in robotics, navigation and related fields. The Rapidly-Exploring Random Trees (RRT) is the typical method and works well in a variety of applications. Due to the sub-optimal issue of original RRT, the recent algorithm, known as RRT*, significantly improves the optimality of solution by adding the “cost review” procedure...

chapter

Scalable and modularized RTL compilation of Convolutional Neural Networks onto FPGA

Yufei Ma, Naveen Suda, Yu Cao, Jae-sun Seo, more

2016 26th International Conference on Field Programmable Logic and Applications (FPL) > 1 - 8

2016 26th International Conference on Field Programmable Logic and Applications (FPL)

Despite its popularity, deploying Convolutional Neural Networks (CNNs) on a portable system is still challenging due to large data volume, intensive computation and frequent memory access. Although previous FPGA acceleration schemes generated by high-level synthesis tools (i.e., HLS, OpenCL) have allowed for fast design optimization, hardware inefficiency still exists when allocating FPGA resources...

chapter

Hardware acceleration of feature detection and description algorithms on low-power embedded platforms

Onur Ulusel, Christopher Picardo, Christopher B. Harris, Sherief Reda, more

2016 26th International Conference on Field Programmable Logic and Applications (FPL) > 1 - 9

2016 26th International Conference on Field Programmable Logic and Applications (FPL)

Image features are broadly used in embedded computer vision applications, from object detection and tracking to motion estimation and 3D reconstruction. Efficient feature extraction and description are crucial due to the real-time requirements of such applications over a constant stream of input data. High-speed computation typically comes at the cost of high power dissipation, yet embedded systems...

chapter

SCADIS: A Scalable Accelerator for Data-Intensive String Set Matching on FPGAs

Shiming Lei, Chao Wang, Haijie Fang, Xi Li, more

2016 IEEE Trustcom/BigDataSE/ISPA > 1190 - 1197

2016 IEEE Trustcom/BigDataSE/ISPA

String matching has become essential and widely applied in modern computer applications, especially with explosive data scale. As a classic fast and exact single pattern matching algorithm, Knuth-Morris-Pratt (KMP) algorithm has been demonstrated in network security and computational biology. However, with the increasing amount of data in the modern society, it becomes increasing important and essential...

chapter

Configurable FPGA architecture for hardware-software merge sorting

Patricia Carla Petrut, Alexandru Amaricai, Oana Boncalo

2016 MIXDES - 23rd International Conference Mixed Design of Integrated Circuits and Systems > 179 - 182

2016 MIXDES - 23rd International Conference "Mixed Design of Integrated Circuits and Systems"

Sorting represents one of the most important operations in data center applications. In this paper, we propose a hardware-software FPGA accelerated based solution for very large data set merge sorting. The accelerator is using a FIFO based approach for sorting. The main contributions of the proposed solution are: (i) configurable FIFO buffers in order to address the variable size of the pre-sorted...

chapter

On the Automation of High Level Synthesis of Convolutional Neural Networks

Emanuele Del Sozzo, Andrea Solazzo, Antonio Miele, Marco D. Santambrogio

2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) > 217 - 224

2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

Convolutional Neural Networks (CNNs) are a particular type of Artificial Neural Networks (ANNs) inspired by cells in the primary visual cortex of animals, and represent the state of the art in image recognition and classification. Nowadays, such supervised learning technique is very popular in Big Data analytics. In this context, due to the huge amount of data to be processed, it is crucial to find...

chapter

Power-Efficient Accelerated Genomic Short Read Mapping on Heterogeneous Computing Platforms

Ernst Joachim Houtgast, Vlad-Mihai Sima, Giacomo Marchiori, Koen Bertels, more

2016 IEEE 24th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM) > 28

2016 IEEE 24th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)

We propose a novel FPGA-accelerated BWA-MEM implementation, a popular tool for genomic data mapping. The performance and power-efficiency of the FPGA implementation on the single Xilinx Virtex-7 Alpha Data add-in card is compared against a software-only baseline system. By offloading the Seed Extension phase onto the FPGA, a two-fold speedup in overall application-level performance is achieved and...

chapter

Acceleration of the Pair-HMM Algorithm for DNA Variant Calling

Gowthami Jayashri Manikandan, Sitao Huang, Kyle Rupnow, Wen-Mei W. Hwu, more

2016 IEEE 24th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM) > 137

2016 IEEE 24th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)

In this project, we propose an SoC solution to accelerate the Pair-HMM's forward algorithm which is the key performance bottleneck in the GATK's HaplotypeCaller tool for DNA variant calling. We develop two versions of the Pair-HMM accelerator: one using High Level Synthesis (HLS), and another ring-based manual RTL implementation. We investigate the performance of the manual RTL design and HLS design...

chapter

FPGA acceleration of reference-based compression for genomic data

James Arram, Moritz Pflanzer, Thomas Kaplan, Wayne Luk

2015 International Conference on Field Programmable Technology (FPT) > 9 - 16

2015 International Conference on Field Programmable Technology (FPT)

One of the key challenges facing genomics today is efficiently storing the massive amounts of data generated by next-generation sequencing platforms. Reference-based compression is a popular strategy for reducing the size of genomic data, whereby sequence information is encoded as a mapping to a known reference sequence. Determining the mapping is a computationally intensive problem, and is the bottleneck...

chapter

A bi-objective heuristic for heterogeneous MPSoC design space exploration

Braham Lotfi Mediouni, Smail Niar, Rachid Benmansour, Karima Benatchba, more

2015 10th International Design & Test Symposium (IDT) > 90 - 95

2015 10th International Design & Test Symposium (IDT)

Recent technology advances allow new generation reconfigurable-based embedded systems to contain a large number of cores and reconfigurable logic elements. Consequently, to take benefit of such very powerful hybrid reconfigurable MPSoC, designers need tools to explore the large design space of the possible configurations. In this paper, we develop a new hybrid bi-objective genetic and parallel variable...

chapter

FPGA acceleration for feature based processing applications

Gooitzen van der Wal, David Zhang, Indu Kandaswamy, James Marakowitz, more

2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 42 - 47

2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

Feature based vision applications rely on highly efficient extraction and analysis of features from images to reach satisfactory levels of performance and latency. In this paper, we describe the implementation of an algorithm that combines distributed feature detector (D-HCD) with a rotational invariant feature descriptor (R-HOG). Based on an algorithmic comparison with other feature detectors and...

Data set:
ieee
Keywords:
FIELD PROGRAMMABLE GATE ARRAYS
ALGORITHM DESIGN AND ANALYSIS
ACCELERATION

Publication date

Set your own date range

INFONA - science communication portal

Search results

On accelerating pair-HMM computations in programmable hardware

Reconfigurable acceleration of genetic sequence alignment: A survey of two decades of efforts

Ultrasonic flaw detection based on temporal and subband signals applied to neural network

Acceleration of RSA processes based on hybrid ARM-FPGA cluster

A Highly Scalable and Efficient Parallel Design of N-Body Simulation on FPGA

A Hardware Acceleration for Surface EMG Non-Negative Matrix Factorization

Minimalist Design for Accelerating Convolutional Neural Networks for Low-End FPGA Platforms

ProFAX: A hardware acceleration of a protein folding algorithm

High-level synthesis for medical image processing on Systems on Chip: A case study

Optimal random sampling based path planning on FPGAs

Scalable and modularized RTL compilation of Convolutional Neural Networks onto FPGA

Hardware acceleration of feature detection and description algorithms on low-power embedded platforms

SCADIS: A Scalable Accelerator for Data-Intensive String Set Matching on FPGAs

Configurable FPGA architecture for hardware-software merge sorting

On the Automation of High Level Synthesis of Convolutional Neural Networks

Power-Efficient Accelerated Genomic Short Read Mapping on Heterogeneous Computing Platforms

Acceleration of the Pair-HMM Algorithm for DNA Variant Calling

FPGA acceleration of reference-based compression for genomic data

A bi-objective heuristic for heterogeneous MPSoC design space exploration

FPGA acceleration for feature based processing applications

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options