Search results for: Yu Wang

Items from 1 to 6 out of 6 results

article

Angel-Eye: A Complete Design Flow for Mapping CNN Onto Embedded FPGA

Kaiyuan Guo, Lingzhi Sui, Jiantao Qiu, Jincheng Yu, more

IEEE Transactions on Computer-Aided Design of Integrated Circuits and... > 2018 > 37 > 1 > 35 - 47

Convolutional neural network (CNN) has become a successful algorithm in the region of artificial intelligence and a strong candidate for many computer vision algorithms. But the computation complexity of CNN is much higher than traditional algorithms. With the help of GPU acceleration, CNN-based applications are widely deployed in servers. However, for embedded platforms, CNN-based solutions are still...

chapter

Exploring the Granularity of Sparsity in Convolutional Neural Networks

Huizi Mao, Song Han, Jeff Pool, Wenshuo Li, more

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 1927 - 1934

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

Sparsity helps reducing the computation complexity of DNNs by skipping the multiplication with zeros. The granularity of sparsity affects the efficiency of hardware architecture and the prediction accuracy. In this paper we quantitatively measure the accuracy-sparsity relationship with different granularity. Coarse-grained sparsity brings more regular sparsity pattern, making it easier for hardware...

chapter

An FPGA Design Framework for CNN Sparsification and Acceleration

Sicheng Li, Wei Wen, Yu Wang, Song Han, more

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM) > 28

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)

Convolutional neural networks (CNNs) have recently broken many performance records in image recognition and object detection problems. The success of CNNs, to a great extent, is enabled by the fast scaling-up of the networks that learn from a huge volume of data. The deployment of big CNN models can be both computation-intensive and memory-intensive, leaving severe challenges to hardware implementations...

chapter

Low power Convolutional Neural Networks on a chip

Yu Wang, Lixue Xia, Tianqi Tang, Boxun Li, more

2016 IEEE International Symposium on Circuits and Systems (ISCAS) > 129 - 132

2016 IEEE International Symposium on Circuits and Systems (ISCAS)

Deep learning, and especially Convolutional Neural Network (CNN, is among the most powerful and widely used techniques in computer vision. Applications range from image classification to object detection, segmentation, Optical Character Recognition (OCR), etc. At the same time, CNNs are both computationally intensive and memory intensive, making them difficult to be deployed on low power lightweight...

chapter

Dynamic Stencil: Effective exploitation of run-time resources in reconfigurable clusters

Xinyu Niu, Jose G. F. Coutinho, Yu Wang, Wayne Luk

2013 International Conference on Field-Programmable Technology (FPT) > 214 - 221

2013 International Conference on Field-Programmable Technology (FPT)

Computing nodes in reconfigurable clusters are occupied and released by applications during their execution. At compile time, application developers are not aware of the amount of resources available at run time. Dynamic Stencil is an approach that optimises stencil applications by constructing scalable designs which can adapt to available run-time resources in a reconfigurable cluster. This approach...

chapter

Gemma in April: A matrix-like parallel programming architecture on OpenCL

Tianji Wu, Di Wu, Yu Wang, Xiaorui Zhang, more

2011 Design, Automation&Test in Europe > 1 - 6

2011 Design, Automation & Test in Europe

Nowadays, Graphics Processing Unit (GPU), as a kind of massive parallel processor, has been widely used in general purposed computing tasks. Although there have been mature development tools, it is not a trivial task for programmers to write GPU programs. Based on this consideration, we propose a novel parallel computing architecture. The architecture includes a parallel programming model, named Gemma,...

Filter options

Keywords:
KERNEL
COMPUTATIONAL MODELING

Publication date

Set your own date range

Publication type

book (5)
article (1)

Keywords

FIELD PROGRAMMABLE GATE ARRAYS (4)
HARDWARE (3)
ACCELERATION (2)
BANDWIDTH (2)
COMPUTER ARCHITECTURE (2)
CONVOLUTION (2)
NEURAL NETWORKS (2)
APRIL (1)
COMPLEXITY THEORY (1)
COMPUTER GRAPHIC EQUIPMENT (1)
CONVOLUTIONAL NEURAL NETWORK (CNN) (1)
COPROCESSORS (1)
DATA STORING (1)
DATA TRANSFERRING (1)
DELAYS (1)
DESIGN FLOW (1)
EMBEDDED FIELD-PROGRAMMABLE GATE ARRAY (FPGA) (1)
FEATURE EXTRACTION (1)
GEMMA (1)
GRAIN SIZE (1)
GRAPHICS PROCESSING UNIT (1)
HARDWARE/SOFTWARE CO-DESIGN (1)
MATRIX-LIKE PARALLEL PROGRAMMING ARCHITECTURE (1)
NEURONS (1)
OPEN COMPUTING LANGUAGE (1)
OPENCL KERNELS (1)
OPTIMIZATION (1)
PARALLEL ALGORITHMS (1)
PARALLEL COMPUTING ARCHITECTURE (1)
PARALLEL PROCESSING (1)
PARALLEL PROGRAMMING (1)
PROGRAMMING LANGUAGES (1)
QUANTIZATION (SIGNAL) (1)
SPARSE MATRICES (1)
TENSILE STRESS (1)
more

INFONA - science communication portal

Search results for: Yu Wang

Angel-Eye: A Complete Design Flow for Mapping CNN Onto Embedded FPGA

Exploring the Granularity of Sparsity in Convolutional Neural Networks

An FPGA Design Framework for CNN Sparsification and Acceleration

Low power Convolutional Neural Networks on a chip

Dynamic Stencil: Effective exploitation of run-time resources in reconfigurable clusters

Gemma in April: A matrix-like parallel programming architecture on OpenCL

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options