Search results for: Eriko Nurvitadhi

Items from 1 to 16 out of 16 results

chapter

High performance binary neural networks on the Xeon+FPGA™ platform

Duncan J. M. Moss, Eriko Nurvitadhi, Jaewoong Sim, Asit Mishra, more

2017 27th International Conference on Field Programmable Logic and Applications (FPL) > 1 - 4

2017 27th International Conference on Field Programmable Logic and Applications (FPL)

Convolutional neural networks (CNNs) are deployed in a wide range of image recognition, scene segmentation and object detection applications. Achieving state of the art accuracy in CNNs often results in large models and complex topologies that require significant compute resources to complete in a timely manner. Binarised neural networks (BNNs) have been proposed as an optimised variant of CNNs, which...

chapter

Accelerating Deep Convolutional Networks using low-precision and sparsity

Ganesh Venkatesh, Eriko Nurvitadhi, Debbie Marr

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 2861 - 2865

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

We explore techniques to significantly improve the compute efficiency and performance of Deep Convolution Networks without impacting their accuracy. To improve the compute efficiency, we focus on achieving high accuracy with extremely low-precision (2-bit) weight networks, and to accelerate the execution time, we aggressively skip operations on zero-values. We achieve the highest reported accuracy...

chapter

Fine-grained accelerators for sparse machine learning workloads

Asit K. Mishra, Eriko Nurvitadhi, Ganesh Venkatesh, Jonathan Pearce, more

2017 22nd Asia and South Pacific Design Automation Conference (ASP-DAC) > 635 - 640

2017 22nd Asia and South Pacific Design Automation Conference (ASP-DAC)

Text analytics applications using machine learning techniques have grown in importance with ever increasing amount of data being generated from web-scale applications, social media and digital repositories. Apart from being large in size, these generated data are often unstructured and are heavily sparse in nature. The performance of these applications on current systems is hampered by hard to predict...

chapter

Accelerating Binarized Neural Networks: Comparison of FPGA, CPU, GPU, and ASIC

Eriko Nurvitadhi, David Sheffield, Jaewoong Sim, Asit Mishra, more

2016 International Conference on Field-Programmable Technology (FPT) > 77 - 84

2016 International Conference on Field-Programmable Technology (FPT)

Deep neural networks (DNNs) are widely used in data analytics, since they deliver state-of-the-art accuracies. Binarized neural networks (BNNs) are recently proposed optimized variant of DNNs. BNNs constraint network weight and/or neuron value to either +1 or −1, which is representable in 1 bit. This leads to dramatic algorithm efficiency improvement, due to reduction in the memory and computational...

chapter

Accelerating recurrent neural networks in analytics servers: Comparison of FPGA, CPU, GPU, and ASIC

Eriko Nurvitadhi, Jaewoong Sim, David Sheffield, Asit Mishra, more

2016 26th International Conference on Field Programmable Logic and Applications (FPL) > 1 - 4

2016 26th International Conference on Field Programmable Logic and Applications (FPL)

Recurrent neural networks (RNNs) provide state-of-the-art accuracy for performing analytics on datasets with sequence (e.g., language model). This paper studied a state-of-the-art RNN variant, Gated Recurrent Unit (GRU). We first proposed memoization optimization to avoid 3 out of the 6 dense matrix vector multiplications (SGEMVs) that are the majority of the computation in GRU. Then, we study the...

chapter

Hardware accelerator for analytics of sparse data

Eriko Nurvitadhi, Asit Mishra, Yu Wang, Ganesh Venkatesh, more

2016 Design, Automation & Test in Europe Conference & Exhibition (DATE) > 1616 - 1621

2016 Design, Automation & Test in Europe Conference & Exhibition (DATE)

Rapid growth of Internet led to web applications that produce large unstructured sparse datasets (e.g., texts, ratings). Machine learning (ML) algorithms are the basis for many important analytics workloads that extract knowledge from these datasets. This paper characterizes such workloads on a high-end server for real-world datasets and shows that a set of sparse matrix operations dominates runtime...

chapter

A sparse matrix vector multiply accelerator for support vector machine

Eriko Nurvitadhi, Asit Mishra, Debbie Marr

2015 International Conference on Compilers, Architecture and Synthesis for Embedded Systems (CASES) > 109 - 116

2015 International Conference on Compilers, Architecture and Synthesis for Embedded Systems (CASES)

Sparse matrix vector multiplication (SpMV) is a linear algebra construct commonly found in machine learning (ML) algorithms, such as support vector machine (SVM). We profiled a popular SVM software (libSVM) on an energy-efficient microserver and a high-performance server for real-world ML datasets, and observed that SpMV dominates runtime. We propose a novel SpMV algorithm tailored for ML and a hardware...

chapter

Fast hierarchical implementation of sequential tree-reweighted belief propagation for probabilistic inference

Skand Hurkat, Jungwook Choi, Eriko Nurvitadhi, Jose F. Martinez, more

2015 25th International Conference on Field Programmable Logic and Applications (FPL) > 1 - 8

2015 25th International Conference on Field Programmable Logic and Applications (FPL)

Maximum a posteriori probability (MAP) inference on Markov random fields (MRF) is the basis of many computer vision applications. Sequential tree-reweighted belief propagation (TRW-S) has been shown to provide very good inference quality and strong convergence properties. However, software TRW-S solvers are slow due to the algorithm's high computational requirements. A state-of-the-art FPGA implementation...

chapter

GraphGen: An FPGA Framework for Vertex-Centric Graph Computation

Eriko Nurvitadhi, Gabriel Weisz, Yu Wang, Skand Hurkat, more

2014 IEEE 22nd Annual International Symposium on Field-Programmable Custom Computing Machines > 25 - 28

2014 IEEE 22nd Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)

Vertex-centric graph computations are widely used in many machine learning and data mining applications that operate on graph data structures. This paper presents GraphGen, a vertex-centric framework that targets FPGA for hardware acceleration of graph computations. GraphGen accepts a vertex-centric graph specification and automatically compiles it onto an application-specific synthesized graph processor...

chapter

3D Point Cloud Reduction Using Mixed-Integer Quadratic Programming

Hyun Soo Park, Yu Wang, Eriko Nurvitadhi, James C. Hoe, more

2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops > 229 - 236

2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

Large scale 3D image localization requires computationally expensive matching between 2D feature points in the query image and a 3D point cloud. In this paper, we present a method to accelerate the matching process and to reduce the memory footprint by analyzing the view-statistics of points in a training corpus. Given a training image set that is representative of common views of a scene, our approach...

chapter

Hardware-efficient stereo estimation using a residual-based approach

Abhishek A. Sharma, Kaustubh Neelathalli, Diana Marculescu, Eriko Nurvitadhi

2013 IEEE International Conference on Acoustics, Speech and Signal Processing > 2693 - 2696

ICASSP 2013 - 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Many promising embedded computer vision applications, such as stereo estimation, rely on inference computation on Markov Random Fields (MRFs). Sequential Tree-Reweighted Message passing (TRW-S) is a superior MRF solving method, which provides better convergence and energy than others (e.g., belief propagation). Since software TRW-S solvers are slow, custom TRW-S hardware has been proposed to improve...

chapter

MEMOCODE 2013 hardware/software co-design contest: Stereo matching

Eriko Nurvitadhi

10th ACM/IEEE International Conference on Formal Methods and Models for Codesign > 131 - 134

2013 Eleventh IEEE/ACM International Conference on Formal Methods and Models for Codesign (MEMOCODE 2013)

The MEMOCODE 2013 design contest problem is stereo matching. Given a stereo image pair (i.e., a left image and a right image), the challenge is to infer the depth information (i.e., third dimension) for each pixel in the image utilizing belief propagation algorithm. Contestants were given a month to develop a fast and/or cost-effective system for stereo matching. From a total of eight participating...

chapter

Integrating formal verification and high-level processor pipeline synthesis

Eriko Nurvitadhi, James C. Hoe, Timothy Kam, Shih-Lien L. Lu

2011 IEEE 9th Symposium on Application Specific Processors (SASP) > 22 - 29

2011 IEEE 9th Symposium on Application Specific Processors (SASP)

When a processor implementation is synthesized from a specification using an automatic framework, this implementation still should be verified against its specification to ensure the automatic framework introduced no error. This paper presents our effort in integrating fully automated formal verification with a high-level processor pipeline synthesis framework. As an integral part of the pipeline...

chapter

Automatic multithreaded pipeline synthesis from transactional datapath specifications

Eriko Nurvitadhi, James C Hoe, Shih-Lien L Lu, Timothy Kam

Design Automation Conference > 314 - 319

2010 47th ACM/EDAC/IEEE Design Automation Conference (DAC 2010)

We present a technique to automatically synthesize a multithreaded in-order pipeline from a high-level unpipelined datapath specification. This work extends the previously proposed transactional specification (T-spec) and synthesis technology (T-piper). The technique not only works with instruction processors but also flexible enough to accept any sequential datapath. It maintains previously proposed...

chapter

Automatic pipelining from transactional datapath specifications

Eriko Nurvitadhi, James C Hoe, Timothy Kam, Shih-Lien L Lu

2010 Design, Automation&Test in Europe Conference&Exhibition (DATE 2010) > 1001 - 1004

2010 Design, Automation & Test in Europe Conference & Exhibition (DATE 2010)

We present a transactional datapath specification (T-spec) and the tool (T-piper) to synthesize automatically an in-order pipelined implementation from it. T-spec abstractly views a datapath as executing one transaction at a time, computing next system states based on current ones. From a T-spec, T-piper can synthesize a pipelined implementation that preserves original transaction semantics, while...

article

Dynamic voltage scaling techniques for power efficient video decoding

Ben Lee, Eriko Nurvitadhi, Reshma Dixit, Chansu Yu, more

Journal of Systems Architecture > 2005 > 51 > 10-11 > 633-652

This paper presents a comparison of power-aware video decoding techniques that utilize dynamic voltage scaling (DVS). These techniques reduce the power consumption of a processor by exploiting high frame variability within a video stream. This is done through scaling of the voltage and frequency of the processor during the video decoding process. However, DVS causes frame deadline misses due to inaccuracies...

Filter options

Publication date

Set your own date range

INFONA - science communication portal

Search results for: Eriko Nurvitadhi

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Data set

Reporting an error / abuse

Sending the report failed

Accessibility options