Search results for: Shinya Takamaeda-Yamazaki

Items from 1 to 15 out of 15 results

chapter

FPGA implementation of edge-guided pattern generation for motion-vector estimation of textureless objects

Aoi Tanibata, Alexandre Schmid, Shinya Takamaeda-Yamazaki, Masayuki Ikebe, more

2017 27th International Conference on Field Programmable Logic and Applications (FPL) > 1

2017 27th International Conference on Field Programmable Logic and Applications (FPL)

The widely accepted block-matching technique, which is required to identify motion vectors, fails in cases in which texture is not existent. In [1], we proposed a hardware-oriented cellular-automaton algorithm that generates spatial patterns on textureless objects and backgrounds, aiming at motion-vector estimation of textureless moving objects. This demonstration presents a field-programmable gate...

chapter

In-memory area-efficient signal streaming processor design for binary neural networks

Haruyoshi Yonekawa, Shimpei Sato, Hiroki Nakahara, Kota Ando, more

2017 IEEE 60th International Midwest Symposium on Circuits and Systems (MWSCAS) > 116 - 119

2017 IEEE 60th International Midwest Symposium on Circuits and Systems (MWSCAS)

The expanding use of deep learning algorithms causes the demands for accelerating neural network (NN) signal processing. For the NN processing, in-memory computation is desired, in which expensive data transfer can be eliminated. In reflection of recently proposed binary neural networks (BNNs), which can reduce the computation resource and area requirements, we designed an in-memory BNN signal processor...

chapter

BRein memory: A 13-layer 4.2 K neuron/0.8 M synapse binary/ternary reconfigurable in-memory deep neural network accelerator in 65 nm CMOS

Kota Ando, Kodai Ueyoshi, Kentaro Orimo, Haruyoshi Yonekawa, more

2017 Symposium on VLSI Circuits > C24 - C25

2017 Symposium on VLSI Circuits

A versatile reconfigurable accelerator for binary/ternary deep neural networks (DNNs) is presented. It features a massively parallel in-memory processing architecture and stores varieties of binary/ternary DNNs with a maximum of 13 layers, 4.2 K neurons, and 0.8 M synapses on chip. The 0.6 W, 1.4 TOPS chip achieves performance and energy efficiency that is 10–10² and 10²–10⁴ times better than a CPU/GPU/FPGA.

chapter

CPRring: A Structure-Aware Ring-Based Checkpointing Architecture for FPGA Computing

Hoang Gia Vu, Shinya Takamaeda-Yamazaki, Takashi Nakada, Yasuhiko Nakashima

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM) > 192

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)

In this paper, we present a new architecture forFPGA checkpointing along with an efficient mechanism. Wethen provide a static analysis of original HDL source code toreduce the cost of hardware for checkpointing functionality. Ourevaluations show that with the proposals, checkpointing hardwarecauses small degradation in maximum clock frequency (less than10%). The LUT overhead varies from 14.4% (Dijkstra)...

chapter

CPRtree: A Tree-Based Checkpointing Architecture for Heterogeneous FPGA Computing

Hoang Gia Vu, Supasit Kajkamhaeng, Shinya Takamaeda-Yamazaki, Yasuhiko Nakashima

2016 Fourth International Symposium on Computing and Networking (CANDAR) > 57 - 66

2016 Fourth International Symposium on Computing and Networking (CANDAR)

FPGAs provide reconfigurability and high performance for parallel applications. Modern FPGAs can be integrated in computing systems as accelerators so that they can combine with host CPU to execute offload applications. This integration puts more pressure on the fault tolerance of computing systems and the question how to improve the dependability becomes crucial. Similar to CPU-based system, checkpoint/restart...

chapter

Stop the World: A Lightweight Runtime Power-Capping Mechanism for FPGAs

Keisuke Fujimoto, Shinya Takamaeda-Yamazaki, Yasuhiko Nakashima

2016 Fourth International Symposium on Computing and Networking (CANDAR) > 361 - 367

2016 Fourth International Symposium on Computing and Networking (CANDAR)

Power-constrained computing is now becoming essential paradigm in both high performance computing and embedded systems. Power budget is dynamically assigned to each computing resource for improving energy efficiency and system throughput. Modern computer systems have accelerator devices, such as GPUs and FPGAs, for higher energy efficiency and performance. Therefore, power management mechanisms of...

chapter

CPU Meets VR: A Scalable 3D Representation of Manycores for Behavior Analysis

Hiromasa Kato, Satoshi Shimaya, Keisuke Fujimoto, Tomoya Kameda, more

2016 Fourth International Symposium on Computing and Networking (CANDAR) > 375 - 380

2016 Fourth International Symposium on Computing and Networking (CANDAR)

Modern microprocessors have a number of cores and complicated structures, such as multi-level caches. Behavior analysis of modern complicated processors is important for software performance optimizations, processor architecture researches, and education purposes. Currently, a number of tools are available for checking the behavior of processors such as processor simulators, debuggers and profilers...

chapter

A Distributed Memory Based Embedded CGRA for Accelerating Stencil Computations

Shohei Takeuchi, Yuttakon Yuttakonkit, Shinya Takamaeda-Yamazaki, Yasuhiko Nakashima

2015 Third International Symposium on Computing and Networking (CANDAR) > 385 - 391

2015 Third International Symposium on Computing and Networking (CANDAR)

Stencil computation is one of the basic but important operation patterns for various applications, such as image processing. Various GPU-based and application-specific hardware approaches have been recently proposed. However, available absolute energy capacity and hardware size are limited in embedded systems. Therefore, energy efficient, small footprint, and high performance accelerator is necessary...

chapter

Performance evaluation of 802.11ah Viterbi decoder for IoT applications

Thi Hong Tran, Hiromasa Kato, Shinya Takamaeda-Yamazaki, Yasuhiko Nakashima

2015 International Conference on Advanced Technologies for Communications (ATC) > 320 - 325

2015 International Conference on Advanced Technologies for Communications (ATC)

This research is our first step on the purpose of developing low-complex Viterbi decoder for IoT applications. We evaluate how the values of Viterbi decoder's parameters such as trace back length (L), input data bit-width (D), and LLR truncated value (E), affects to BER and PER of a communication system. The IEEE 802.11ah simulator is used with AWGN channel and BPSK modulation. Our simulation results...

chapter

A CGRA-Based Approach for Accelerating Convolutional Neural Networks

Masakazu Tanomoto, Shinya Takamaeda-Yamazaki, Jun Yao, Yasuhiko Nakashima

2015 IEEE 9th International Symposium on Embedded Multicore/Many-core Systems-on-Chip > 73 - 80

2015 IEEE 9th International Symposium on Embedded Multicore/Many-core Systems-on-Chip (MCSoC)

Convolutional neural network (CNN) is an emerging approach for achieving high recognition accuracy in various machine learning applications. To accelerate CNN computations, various GPU-based or application-specific hardware approaches have been recently proposed. However, since they require large computing hardware and absolute energy amount, they are not suitable for embedded applications. In this...

chapter

Performance Evaluation of a 3D-Stencil Library for Distributed Memory Array Accelerators

Yoshikazu Inagaki, Shinya Takamaeda-Yamazaki, Jun Yao, Yasuhiko Nakashima

2014 Second International Symposium on Computing and Networking > 388 - 393

2014 Second International Symposium on Computing and Networking (CANDAR)

EMAX: Energy-aware Multimode Accelerator Extension is equipped with distributed single-port local memories and ring-formed interconnections. The accelerator is designed to achieve extremely high throughput for scientific computations, big data and image processing and also to achieve low power consumption. However, before mapping algorithms on the accelerator, application developers should have sufficient...

chapter

A framework for efficient rapid prototyping by virtually enlarging FPGA resources

Shinya Takamaeda-Yamazaki, Kenji Kise

2014 International Conference on ReConFigurable Computing and FPGAs (ReConFig14) > 1 - 8

2014 International Conference on ReConFigurable Computing and FPGAs (ReConFig)

Rapid prototyping using FPGAs is a widely-applied approach for efficient evaluation of hardware structures. We present a rapid prototyping framework by virtually enlarging available FPGA resources. In order to mitigate the development complexity of FPGA-based hardware prototype, the framework provides two abstractions of resources on FPGA platforms: Memory systems and inter-FPGA interconnections on...

chapter

flipSyrup: Cycle-accurate hardware simulation framework on abstract FPGA platforms

Shinya Takamaeda-Yamazaki, Kenji Kise

2014 24th International Conference on Field Programmable Logic and Applications (FPL) > 1 - 4

2014 24th International Conference on Field Programmable Logic and Applications (FPL)

FPGA-based rapid prototyping is widely applied for fast simulations of hardware structure verifications. In this paper, we propose flipSyrup, a prototyping framework for cycle-accurate hardware simulations on abstract FPGA platforms. In order to mitigate the development complexity of FPGA-based simulators, the framework provides two abstractions of resources on FPGA platforms: Memory systems and inter-FPGA...

chapter

Ultrasmall: The smallest MIPS soft processor

Hiroshi Nakatsuka, Yuichiro Tanaka, Thiem Van Chu, Shinya Takamaeda-Yamazaki, more

2014 24th International Conference on Field Programmable Logic and Applications (FPL) > 1 - 4

2014 24th International Conference on Field Programmable Logic and Applications (FPL)

Soft processors have been commonly used in FPGAbased designs to perform various useful functions. Some of these functions are not performance-critical and required to be implemented using very few FPGA resources. For such cases, it is desired to reduce circuit area of the soft processor as much as possible. This paper proposes Ultrasmall, a small soft processor for FPGAs. Ultrasmall supports a subset...

chapter

Towards a Low-Power Accelerator of Many FPGAs for Stencil Computations

Ryohei Kobayashi, Shinya Takamaeda-Yamazaki, Kenji Kise

2012 Third International Conference on Networking and Computing > 343 - 349

2012 Third International Conference on Networking and Computing (ICNC)

We have proposed the effective stencil computation method and the architecture by employing multiple small FPGAs with 2D-mech topology. In this paper, we show that our proposed architecture works correctly on the real 2D-mesh connected FPGA array. We developed a software simulator in C++, which emulates our proposed architecture, and implemented two prototype systems in Verilog HDL. One prototype...

Filter options

Publication date

Set your own date range

Keywords

FIELD PROGRAMMABLE GATE ARRAYS (8)
COMPUTER ARCHITECTURE (5)
HARDWARE (5)
RANDOM ACCESS MEMORY (4)
CGRA (3)
FPGA (3)
MEMORY MANAGEMENT (3)
SYSTEM-ON-CHIP (3)
ABSTRACTS (2)
ACCELERATOR (2)
ARTIFICIAL NEURAL NETWORKS (2)
BIOLOGICAL NEURAL NETWORKS (2)
CHECKPOINTING (2)
ENGINES (2)
HARDWARE DESIGN LANGUAGES (2)
MATHEMATICAL MODEL (2)
NEURONS (2)
OPTIMIZATION (2)
PROTOTYPES (2)
REGISTERS (2)
STENCIL (2)
TABLE LOOKUP (2)
ACCELERATOR ARCHITECTURE (1)
BIT ERROR RATE (1)
CACHE MEMORY (1)
CAMERAS (1)
COARSE GRAINED RECONFIGURABLE ARCHITECTURE (1)
CONVOLUTIONAL NEURAL NETWORKS (1)
CPRRING (1)
DATA VISUALIZATION (1)
DECODING (1)
DEPENDABILITY (1)
ESTIMATION (1)
FPGA ACCELERATOR (1)
FREQUENCY CONTROL (1)
IMAGE COLOR ANALYSIS (1)
INFORMATION SCIENCE (1)
LIBRARY (1)
LOWPOWER (1)
MANY-CORE PROCESSOR (1)
MEASUREMENT (1)
MULTIPLEXING (1)
POWER CAPPING (1)
POWER CONTROL (1)
POWER DEMAND (1)
POWER MANAGEMENT (1)
PROCESSOR SIMULATOR (1)
PROGRAM PROCESSORS (1)
REAL-TIME SYSTEMS (1)
RING-BASED (1)
RUNTIME (1)
SHIFT REGISTERS (1)
SIGNAL PROCESSING (1)
STANDARDS (1)
STENCIL COMPUTATION (1)
THREE-DIMENSIONAL DISPLAYS (1)
TRANSCEIVERS (1)
TREE-BASED (1)
VERY LARGE SCALE INTEGRATION (1)
VIRTUAL REALITY (1)
VISUALIZATION (1)
VITERBI ALGORITHM (1)
WIRES (1)
ZYNQ (1)
more

INFONA - science communication portal

Search results for: Shinya Takamaeda-Yamazaki

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options