Search results for: Leibo Liu

Items from 1 to 20 out of 98 results

chapter

Aggressive pipelining of irregular applications on reconfigurable hardware

Zhaoshi Li, Leibo Liu, Yangdong Deng, Shouyi Yin, more

2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA) > 575 - 586

2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA)

CPU-FPGA heterogeneous platforms offer a promising solution for high-performance and energy-efficient computing systems by providing specialized accelerators with post-silicon reconfigurability. To unleash the power of FPGA, however, the programmability gap has to be filled so that applications specified in high-level programming languages can be efficiently mapped and scheduled on FPGA. The above...

chapter

AEPE: An area and power efficient RRAM crossbar-based accelerator for deep CNNs

Shibin Tang, Shouyi Yin, Shixuan Zheng, Peng Ouyang, more

2017 IEEE 6th Non-Volatile Memory Systems and Applications Symposium (NVMSA) > 1 - 6

2017 IEEE 6th Non-Volatile Memory Systems and Applications Symposium (NVMSA)

Deep convolutional neural networks (CNN) have shown great accuracy on object recognition and classification tasks. Deep CNNs are computation intensive algorithms, hence many customized RRAM crossbar-based accelerators are proposed to meet the computing demands in deep CNNs, but the area costs and the power consumption are still great challenges for RRAM crossbar-based accelerators. In this work, we...

chapter

A 1.06-to-5.09 TOPS/W reconfigurable hybrid-neural-network processor for deep learning applications

Shouyi Yin, Peng Ouyang, Shibin Tang, Fengbin Tu, more

2017 Symposium on VLSI Circuits > C26 - C27

2017 Symposium on VLSI Circuits

An energy-efficient hybrid neural network (NN) processor is implemented in a 65nm technology. It has two 16×16 reconfigurable heterogeneous processing elements (PEs)arrays. To accelerate a hybrid-NN, the PE array is designed to support on demand partitioning and reconfiguration for parallel processing different NNs. To improve energy efficiency, each PE supports bit-width adaptive computing to meet...

chapter

Fast and efficient integration of human upper-body detection and orientation estimation in RGB-D video

Tao Ji, Leibo Liu, Wenping Zhu, Jinghe Wei, more

2017 IEEE 9th International Conference on Communication Software and Networks (ICCSN) > 1178 - 1181

2017 IEEE 9th International Conference on Communication Software and Networks (ICCSN)

Automatic and accurate human upper-body detection and orientation estimation have great practical value in several computer vision applications. Most previous works on human upper-body orientation estimation assume that the human upper-body region is already detected and aligned. However, this is not the case in many real-world scenarios. Additional human detector is essential which is usually much...

chapter

DFGNet: Mapping dataflow graph onto CGRA by a deep learning approach

Shouyi Yin, Dajiang Liu, Lifeng Sun, Leibo Liu, more

2017 IEEE International Symposium on Circuits and Systems (ISCAS) > 1 - 4

2017 IEEE International Symposium on Circuits and Systems (ISCAS)

The coarse-grained reconfigurable architecture (C-GRA) is a promising platform that provides both high performance and high power-efficiency. Dataflow graph (DFG) mapping is critical to tap the potentials of CGRAs. Inspired from the great progress made in tree search game using deep neural network, we proposed a frame work for learning convolutional neural network for mapping DFGs onto spatial programmable...

chapter

Memory fartitioning-based modulo scheduling for high-level synthesis

Tianyi Lu, Shouyi Yin, Xianqing Yao, Zhicong Xie, more

2017 IEEE International Symposium on Circuits and Systems (ISCAS) > 1 - 4

2017 IEEE International Symposium on Circuits and Systems (ISCAS)

High-Level Synthesis (HLS) has been widely recognized as an efficient compilation process targeting FPGAs for algorithm evaluation and product prototyping. However, the massively parallel memory access demands and the extremely expensive cost of single-bank memory with multi-port have impeded loop pipelining performance. Thus, based on an alternative multi-bank memory architecture, a joint approach...

chapter

Area-efficient polynomial modular multiplication over GF(2∧n) and application to AES

Qihuan Huang, Leibo Liu, Hai Huang, Shaojun Wei

2017 IEEE 9th International Conference on Communication Software and Networks (ICCSN) > 1128 - 1132

2017 IEEE 9th International Conference on Communication Software and Networks (ICCSN)

Due to low masking-complexity property of the addition chain, it has been widely researched for evaluating the S-boxes in the recent literatures. This paper summarizes four main addition chains developed for the AES S-box in the existing literatures and chooses the most area-efficient addition chain. To further reduce the masking complexity, this paper proposes an improved algorithm for evaluating...

chapter

Hardware efficient signal detector based on lanczos method for massive MIMO systems

Yang Xue, Leibo Liu, Guiqiang Peng, Huaibo Zhang, more

2017 IEEE 9th International Conference on Communication Software and Networks (ICCSN) > 523 - 527

2017 IEEE 9th International Conference on Communication Software and Networks (ICCSN)

Minimum mean square error hometd has proved its superiority for signal detection in massive multiple-input multiple-output (MIMO) systems for its near-optimal performance. However, the detection efficiency is restrained by a high computation complexity and low parallelism operation of matrix inversion. This paper presented a hardware efficient signal detector based on low complexity Lanczos Method,...

chapter

Bit-Width Based Resource Partitioning for CNN Acceleration on FPGA

Jianxin Guo, Shouyi Yin, Peng Ouyang, Leibo Liu, more

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM) > 31

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)

Convolutional neural networks (CNNs) haveachieved great success in many applications. Recently, variousFPGA-based accelerators have been proposed to improve theperformance of CNNs. However, current most FPGA-basedmethods use single bit-width selection for all CNN layers, which lead to very low resource utilization efficiency anddifficulty in further performance improvement. In this paper, we propose...

chapter

A 700fps optimized coarse-to-fine shape searching based hardware accelerator for face alignment

Qiang Wang, Leibo Liu, Wenping Zhu, Huiyu Mo, more

2017 54th ACM/EDAC/IEEE Design Automation Conference (DAC) > 1 - 6

2017 54th ACM/EDAC/IEEE Design Automation Conference (DAC)

In this work, a fast shape searching face alignment (F-SSFA) algorithm based accelerator is proposed to achieve real-time processing. Firstly, a learning based low-dimensional SURF feature is introduced to reduce the computation cost in the cascaded regression. Then the Euclidean distance and shape affine transformation are utilized to accelerate the shape searching procedure. F-SSFA therefore greatly...

chapter

Minimizing pipeline stalls in distributed-controlled coarse-grained reconfigurable arrays with Triggered Instruction issue and execution

Yanan Lu, Leibo Liu, Yangdong Deng, Jian Weng, more

2017 54th ACM/EDAC/IEEE Design Automation Conference (DAC) > 1 - 6

2017 54th ACM/EDAC/IEEE Design Automation Conference (DAC)

The pipeline stall in distributed-controlled coarse-grained reconfigurable arrays is a major source stumbling performance. This work presents a Triggered-Issue and Triggered-Execution (TITE) paradigm motivated from the Triggered Instruction Architecture (TIA) which converts control and data dependencies into predicate dependencies as triggers for spatial parallelism. TITE separately triggers the issuing...

chapter

Special session paper: an efficient hardware design for cerebellar models using approximate circuits

Honglan Jiang, Leibo Liu, Jie Han

2017 International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS) > 1 - 2

2017 International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS)

The superior controllability of the cerebellum has motivated extensive interest in the development of computational cerebellar models. Many models have been applied to the motor control and image stabilization in robots. Often computationally complex, cerebellar models have rarely been implemented in dedicated hardware. Here, we propose an efficient hardware design for cerebellar models using approximate...

chapter

Energy-aware loops mapping on multi-vdd CGRAs without performance degradation

Jiangyuan Gu, Shouyi Yin, Leibo Liu, Shaojun Wei

2017 22nd Asia and South Pacific Design Automation Conference (ASP-DAC) > 312 - 317

2017 22nd Asia and South Pacific Design Automation Conference (ASP-DAC)

Coarse Grained Reconfigurable Architectures (C-GRAs) have been paid an increasing attention due to their inherent advantages of high performance and energy efficiency. As we know, multi-V_dd technique is popularly used to reduce energy consumption, and modulo scheduling is one of widely-used pipeline techniques to improve performance. To achieve both high performance and energy-efficiency simultaneously,...

chapter

Energy management on DVS based coarse-grained reconfigurable platform

Peng Ouyang, Shouyi Yin, Chunxiao Xing, Leibo Liu, more

2016 IEEE/ACM International Symposium on Nanoscale Architectures (NANOARCH) > 49 - 54

2016 IEEE/ACM International Symposium on Nanoscale Architectures (NANOARCH)

The coarse-grained reconfigurable architecture (CGRA) is a promising platform for mobile computing. In this work, based on the battery nonlinear effects, we propose a method to achieve co-optimization of task partition and multi-cell battery scheduling with dynamical voltage scaling (DVS) for CGRA computing platform. Experimental results show that average 33.6% improvement in battery runtime over...

chapter

OPTMR: Optimal data flow graph partitioning for triple modular redundancy against hardware Trojan in reconfigurable hardware

Shengyang Mao, Leibo Liu

2016 6th International Conference on Electronics Information and Emergency Communication (ICEIEC) > 68 - 71

2016 6th International Conference on Electronics Information and Emergency Communication (ICEIEC)

Hardware Trojans have become a significant threat to computing reliability and data security in reconfigurable hardware. One of the most effective techniques of run-time detection and recovery is based on Triple Modular Redundancy (TMR) mechanism; however, this mechanism causes a large resource overhead because the protected circuit needs to be totally duplicated twice for detection stage and decision...

chapter

Low complexity signal detector based on Lanczos method for large-scale MIMO systems

Huaibo Zhang, Guiqiang Peng, Leibo Liu

2016 6th International Conference on Electronics Information and Emergency Communication (ICEIEC) > 6 - 9

2016 6th International Conference on Electronics Information and Emergency Communication (ICEIEC)

For large-scale multiple-input multiple-output (MIMO) systems, linear minimum mean square error (MMSE) method is one of the most near-optimal ways for signal detection. However, MMSE involves matrix inversion which is of high complexity for computation. In this paper, a Lanczos-based method is proposed to solve the problem by transferring the matrix inversion computation into an iteration process...

chapter

HCGM-based high-efficiency temperature evaluation scheme for NoCs

Jiqiang Chen, Chen Wu, Leibo Liu

2016 6th International Conference on Electronics Information and Emergency Communication (ICEIEC) > 105 - 108

2016 6th International Conference on Electronics Information and Emergency Communication (ICEIEC)

Temperature evaluation is a key point to the static power calculation and thermal management for application mapping. This paper proposes a design-time (offline) Heat Conduction Grid Model (HCGM) to evaluate the temperature of network-on-chip, which is based on the temperature dependency on heat flux density produced by ambient tiles as well as itself. This model incorporates (1) short running time...

chapter

Large-scale MIMO detection design and FPGA implementations using SOR method

Peng Zhang, Leibo Liu, Guiqiang Peng, Shaojun Wei

2016 8th IEEE International Conference on Communication Software and Networks (ICCSN) > 206 - 210

2016 8th IEEE International Conference on Communication Software and Networks (ICCSN)

In this paper, we propose a very large scale integration design method for a large-scale multiple-input multiple-output detection algorithm. Our design uses a modified version of the Successive Over Relaxation (SOR) method, which substantially reduces the highly computational complexity of data detection and achieves the near-optimal performance. We use a reconfigurable Processing Elements Array (PEA)...

chapter

A novel hardware accelerator guideline for ANN with high performance

Tianbao Chen, Shouyi Yin, Peng Ouyang, Fengbin Tu, more

2016 5th International Symposium on Next-Generation Electronics (ISNE) > 1 - 2

2016 5th International Symposium on Next-Generation Electronics (ISNE)

Artificial Neural Network (ANN) is widely used in machine learning and artificial intelligence areas. But ANN requires a long running time and induces a high power consumption when running on a GPU or CPU which may hinder its application in embedded system. This paper proposes a hardware accelerator design guideline for ANN with arbitrary scales and depths. We take full consideration of the hardware...

chapter

User Behavior Pattern Analysis and Prediction Based on Mobile Phone Sensors

Jiqiang Song, Eugene Y. Tang, Leibo Liu

Lecture Notes in Computer Science > Network and Parallel Computing > Session 3: Network > 177-189

More and more mobile phones are equipped with multiple sensors today. This creates a new opportunity to analyze users’ daily behaviors and evolve mobile phones into truly intelligent personal devices, which provide accurate context-adaptive and individualized services. This paper proposed a MAST (Movement, Action, and Situation over Time) model to explore along this direction and identified key technologies...

Publication type:
book

Publication date

Set your own date range

Keywords

COMPUTER ARCHITECTURE (21)
HARDWARE (19)
KERNEL (17)
ARRAYS (15)
RECONFIGURABLE ARCHITECTURES (15)
CONTEXT (11)
DECODING (11)
ALGORITHM DESIGN AND ANALYSIS (10)
FIELD PROGRAMMABLE GATE ARRAYS (9)
PIPELINES (9)
SYSTEM-ON-CHIP (9)
VIDEO CODING (9)
COMPUTATIONAL MODELING (7)
FEATURE EXTRACTION (7)
PARALLEL PROCESSING (7)
PIPELINE PROCESSING (7)
RECONFIGURABLE COMPUTING (7)
MULTIMEDIA COMMUNICATION (6)
OPTIMIZATION (6)
STREAMING MEDIA (6)
CLOCKS (5)
DISCRETE COSINE TRANSFORMS (5)
MICROPROCESSORS (5)
POWER DEMAND (5)
RECONFIGURABLE MULTIMEDIA SYSTEM (5)
THROUGHPUT (5)
CGRA (4)
ENERGY CONSUMPTION (4)
ENERGY EFFICIENCY (4)
EQUATIONS (4)
HISTOGRAMS (4)
IMAGE CODING (4)
MATHEMATICAL MODEL (4)
MEMORY MANAGEMENT (4)
REAL-TIME SYSTEMS (4)
REMUS (4)
ROBUSTNESS (4)
ROUTING (4)
SCHEDULING (4)
SWITCHES (4)
THREE-DIMENSIONAL DISPLAYS (4)
VLSI (4)
WIRELESS COMMUNICATION (4)
0.18 MICRON (3)
ADDERS (3)
APPLICATION SPECIFIC INTEGRATED CIRCUITS (3)
BANDWIDTH (3)
BATTERIES (3)
BATTERY LIFETIME (3)
CLASSIFICATION ALGORITHMS (3)
CMOS INTEGRATED CIRCUITS (3)
CMOS TECHNOLOGY (3)
COMPLEXITY THEORY (3)
DATA MINING (3)
DIGITAL SIGNAL PROCESSING CHIPS (3)
FILTERING (3)
H.264 (3)
IMAGE COLOR ANALYSIS (3)
LIGHTING (3)
LOW-POWER ELECTRONICS (3)
MOBILE COMPUTING (3)
MULTIMEDIA SYSTEMS (3)
MULTIPROCESSING SYSTEMS (3)
PARALLEL ARCHITECTURES (3)
PARTITIONING ALGORITHMS (3)
POWER CONSUMPTION (3)
PROCESS CONTROL (3)
REGISTERS (3)
RESOURCE MANAGEMENT (3)
SILICON (3)
SYSTEM-ON-A-CHIP (3)
VLSI ARCHITECTURE (3)
WIRELESS SENSOR NETWORKS (3)
1.8 V (2)
100 MHZ (2)
ACCELERATION (2)
ADAPTATION MODEL (2)
ARTIFICIAL NEURAL NETWORKS (2)
AUTOMATIC VOLTAGE CONTROL (2)
AVS (2)
BATTERY MANAGEMENT SYSTEMS (2)
CAMERAS (2)
CIPHERS (2)
COARSE-GRAINED RECONFIGURABLE PROCESSOR (2)
COMPILATION (2)
CONFERENCES (2)
CONSUMER ELECTRONICS (2)
CONVOLUTION (2)
COPROCESSORS (2)
COST FUNCTION (2)
DATA FLOW GRAPH (2)
DATA PARALLELISM (2)
DCT (2)
DEAD LOCK PROBLEM (2)
DEBLOCKING FILTER (2)
DELAY (2)
DELAY LOCK LOOPS (2)
DELAYS (2)
DIGITAL SIGNAL PROCESSING (2)
EDUCATIONAL INSTITUTIONS (2)
more

Data set

ieee (97)
Springer (1)

INFONA - science communication portal

Search results for: Leibo Liu

Aggressive pipelining of irregular applications on reconfigurable hardware

AEPE: An area and power efficient RRAM crossbar-based accelerator for deep CNNs

A 1.06-to-5.09 TOPS/W reconfigurable hybrid-neural-network processor for deep learning applications

Fast and efficient integration of human upper-body detection and orientation estimation in RGB-D video

DFGNet: Mapping dataflow graph onto CGRA by a deep learning approach

Memory fartitioning-based modulo scheduling for high-level synthesis

Area-efficient polynomial modular multiplication over GF(2∧n) and application to AES

Hardware efficient signal detector based on lanczos method for massive MIMO systems

Bit-Width Based Resource Partitioning for CNN Acceleration on FPGA

A 700fps optimized coarse-to-fine shape searching based hardware accelerator for face alignment

Minimizing pipeline stalls in distributed-controlled coarse-grained reconfigurable arrays with Triggered Instruction issue and execution

Special session paper: an efficient hardware design for cerebellar models using approximate circuits

Energy-aware loops mapping on multi-vdd CGRAs without performance degradation

Energy management on DVS based coarse-grained reconfigurable platform

OPTMR: Optimal data flow graph partitioning for triple modular redundancy against hardware Trojan in reconfigurable hardware

Low complexity signal detector based on Lanczos method for large-scale MIMO systems

HCGM-based high-efficiency temperature evaluation scheme for NoCs

Large-scale MIMO detection design and FPGA implementations using SOR method

A novel hardware accelerator guideline for ANN with high performance

User Behavior Pattern Analysis and Prediction Based on Mobile Phone Sensors

Filter options

Publication date

Keywords

Data set

INFONA - science communication portal

Search results for: Leibo Liu

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Data set

Reporting an error / abuse

Sending the report failed

Accessibility options