Search results for: Yasuaki Ito

Items from 1 to 20 out of 49 results

chapter

Simple and Fast Parallel Algorithms for the Voronoi Map and the Euclidean Distance Map, with GPU Implementations

Takumi Honda, Shinnosuke Yamamoto, Hiroaki Honda, Koji Nakano, more

2017 46th International Conference on Parallel Processing (ICPP) > 362 - 371

2017 46th International Conference on Parallel Processing (ICPP)

The complete Voronoi map of a binary image with black and white pixels is a matrix of the same size such that each element is the closest black pixel of the corresponding pixel. The complete Voronoi map visualizes the influence region of each black pixel. However, each region may not be connected due to exclave pixels. The connected Voronoi map is a modification of the complete Voronoi map so that...

chapter

Photomosaic Generation by Rearranging Subimages, with GPU Acceleration

Yi Yang, Yasuaki Ito, Koji Nakano

2017 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) > 942 - 951

2017 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

The main contribution of this paper is to show a new photomosaic generation method by rearranging subimages of an image. In the photomosaic generation, an input image is divided into small subimages and they are rearranged such that the rearranged image reproduces another image given as a target image. Therefore, this problem can be considered as a combinatorial optimization problem to obtain the...

chapter

Accelerating the Smith-Waterman Algorithm Using Bitwise Parallel Bulk Computation Technique on GPU

Takahiro Nishimura, Jacir L. Bordim, Yasuaki Ito, Koji Nakano

2017 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) > 932 - 941

2017 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

The bulk execution of a sequential algorithm is to execute it for many different inputs in turn or at the same time. It is known that the bulk execution of an oblivious sequential algorithm can be implemented to run efficiently on a GPU. The bulk execution supports fine grained bitwise parallelism, allowing it to achieve high acceleration over a straightforward sequential computation. The main contribution...

chapter

An Evaluation of the Parallella Architecture for the Convex Hull Computation

Keisuke Nakata, Yasuaki Ito

2016 Fourth International Symposium on Computing and Networking (CANDAR) > 704 - 706

2016 Fourth International Symposium on Computing and Networking (CANDAR)

The main contribution of this paper is to show an implementation of the parallel convex hull algorithm on the Parallella architecture. Parallella is a single-board computer with 16 mesh-connected cores. We have considered the memory architecture and mesh-connected network of the Parallella architecture. We evaluated the computing time and the energy-efficiency by comparing with various computing platforms...

chapter

GPU-Accelerated Bulk Computation of the Eigenvalue Problem for Many Small Real Non-symmetric Matrices

Hiroki Tokura, Takumi Honda, Yasuaki Ito, Koji Nakano, more

2016 Fourth International Symposium on Computing and Networking (CANDAR) > 490 - 496

2016 Fourth International Symposium on Computing and Networking (CANDAR)

The main contribution of this paper is to present a very efficient GPU implementation of bulk computation of eigenvalues for a large number of small non-symmetric real matrices. This work is motivated by the necessity of such bulk computation in design of control systems, which requires to compute the eigenvalues of hundreds of thousands non-symmetric real matrices of size up to 30x30. In our GPU...

chapter

A Memory-Access-Efficient Implementation of the Approximate String Matching Algorithm on GPU

Lucas S. N. Nunes, J. L. Bordim, Koji Nakano, Yasuaki Ito

2016 Fourth International Symposium on Computing and Networking (CANDAR) > 483 - 489

2016 Fourth International Symposium on Computing and Networking (CANDAR)

The task of finding strings having a partial match to a given pattern is of interest to a number of practical applications, including DNA sequencing and text searching. Owing to its importance, alternatives to accelerate the Approximate String Matching (ASM) have been widely investigated in the literature. The main contribution of this work is to present a memory-access-efficient implementation for...

chapter

A Hardware Sorter for Almost Sorted Sequences, with FPGA Implementations

Naoaki Harada, Naoyuki Matsumoto, Koji Nakano, Yasuaki Ito

2016 Fourth International Symposium on Computing and Networking (CANDAR) > 565 - 571

2016 Fourth International Symposium on Computing and Networking (CANDAR)

Suppose that a sequence of sensing data with timestamps are transferred asynchronously. Some of sensing data may be delayed by some period of time and the sequence is not in proper increasing order of timestamps. A sequence of timestamps to, t1,..., tn-l is d-sorted if ti

chapter

Accelerating Ant Colony Optimization for the Vertex Coloring Problem on the GPU

Ryouhei Murooka, Yasuaki Ito, Koji Nakano

2016 Fourth International Symposium on Computing and Networking (CANDAR) > 469 - 475

2016 Fourth International Symposium on Computing and Networking (CANDAR)

Vertex coloring is an assignment of colors to vertex of an undirected graph such that no two vertices sharing the same edge have the same color. The vertex coloring problem is to find the minimum number of colors necessary to color a graph given, which is an NP-hard problem in combinatorial optimization. Ant Colony Optimization (ACO) is a well-known meta-heuristic in which a colony of artificial ants...

chapter

Accelerating the CKY Parsing Using FPGAs

Jacir L. Bordim, Yasuaki Ito, Koji Nakano

Lecture Notes in Computer Science > High Performance Computing — HiPC 2002 > Algorithms I > 41-51

The main contribution of this paper is to present an FPGAbased implementation of an instance-specific hardware which accelerates the CKY (Cook-Kasami-Younger) parsing for context-free grammars. Given a context-free grammar G and a string x, the CKY parsing determines if G derives x. We have developed a hardware generator that creates a Verilog HDL source to perform the CKY parsing for any given context-free...

chapter

Bitwise Parallel Bulk Computation on the GPU, with Application to the CKY Parsing for Context-Free Grammars

Toru Fujita, Koji Nakano, Yasuaki Ito

2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) > 589 - 598

2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

The main contribution of this paper is topresent Bitwise Parallel Bulk Computation (BPBC) technique, to accelerate bulk computation, which executes the same algorithm for a lot of instances in turn or in parallel. The idea of the BPBC technique isto simulate a combinational logic circuit for 32 inputsat the same time using bitwise logic operators for 32-bit integerssupported by most processing devices...

chapter

An Efficient Implementation of LZW Decompression in the FPGA

Xin Zhou, Yasuaki Ito, Koji Nakano

2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) > 599 - 607

2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

LZW algorithm is one of the most famous dictionary-based compression and decompression algorithms. The main contribution of this paper is to present a hardware LZW decompression algorithm and to implement it in an FPGA. The experimental results show that one proposed module on Virtex-7 family FPGA XC7VX485T-2 runs up to 2.16 times faster than sequential LZW decompression on a single CPU, where the...

chapter

Fast LZW Compression Using a GPU

Shunji Funasaka, Koji Nakano, Yasuaki Ito

2015 Third International Symposium on Computing and Networking (CANDAR) > 303 - 308

2015 Third International Symposium on Computing and Networking (CANDAR)

The LZW compression is a well known patented lossless compression method used in Unix file compression utility "compress" and in GIF and TIFF image formats. It converts an input string of characters (or 8-bit unsigned integers) into a string of codes using a code table (or dictionary) that maps strings into codes. Since the code table is generated by repeatedly adding newly appeared substrings...

chapter

A Flexible-Length-Arithmetic Processor Based on FDFM Approach in FPGAs

Tatsuya Kawamoto, Yasuaki Ito, Koji Nakano

2015 Third International Symposium on Computing and Networking (CANDAR) > 364 - 370

2015 Third International Symposium on Computing and Networking (CANDAR)

The main contribution of this paper is to present an intermediate approach of software and hardware using FPGAs. More specifically, we present a processor based on FDFM (Few DSP slices and Few Memory blocks) approach that supports arithmetic operations with flexibly many bits, and implement it in the Xilinx Virtex-6 FPGA. Arithmetic instructions of our processor architecture include addition, subtraction,...

chapter

A Fast Approximate String Matching Algorithm on GPU

Lucas S.N. Nunes, Jacir L. Bordim, Koji Nakano, Yasuaki Ito

2015 Third International Symposium on Computing and Networking (CANDAR) > 188 - 192

2015 Third International Symposium on Computing and Networking (CANDAR)

The approximate string matching (ASM) problem asks to find a substring of string Y of length n that is most similar to string X of length m. The ASM can be solved by dynamic programming technique, which computes a table of size m × n. The main contribution of this work is to present a memory-access-efficient implementation for computing the ASM on a GPU. The key idea of our implementation relies on...

chapter

A Warp-Synchronous Implementation for Multiple-Length Multiplication on the GPU

Takumi Honda, Yasuaki Ito, Koji Nakano

2015 Third International Symposium on Computing and Networking (CANDAR) > 96 - 102

2015 Third International Symposium on Computing and Networking (CANDAR)

If we process large-integers on the computer, they are represented by multiple-length integer. Multiple-length multiplication is widely used in areas such as scientific computation and cryptography processing. However, the computation cost is very high since CPU does not support a multiple-length integer. In this paper, we present a GPU implementation of bulk multiple-length multiplications. The idea...

chapter

Efficient GPU Implementations for the Conway's Game of Life

Toru Fujita, Daigo Nishikori, Koji Nakano, Yasuaki Ito

2015 Third International Symposium on Computing and Networking (CANDAR) > 11 - 20

2015 Third International Symposium on Computing and Networking (CANDAR)

The Conway's Game of Life is the most well-known cellular automaton. The universe of the Game of Life is a 2-dimensional array of cells, each of which takes two possible states, alive or dead. The state of every cell is repeatedly updated according to those of eight neighbors. A cell will be alive if exactly three neighbors are alive, or if it is alive and two or three neighbors are alive. The main...

chapter

Parallelization Techniques for Error Diffusion with GPU Implementations

Akihiko Kasagi, Koji Nakano, Yasuaki Ito

2015 Third International Symposium on Computing and Networking (CANDAR) > 30 - 39

2015 Third International Symposium on Computing and Networking (CANDAR)

Error diffusion is a classical but still popular method for generating a binary image that reproduces an original gray-scale image. In error diffusion, pixel values are rounded to binary in raster scan order and the rounding error is distributed to neighboring pixels that have not yet been processed. The main contribution of this paper is to show several parallel algorithms and implementation techniques...

chapter

GPU-Accelerated Digital Halftoning by the Local Exhaustive Search

Hiroaki Kouge, Yasuaki Ito, Koji Nakano

2015 14th International Symposium on Parallel and Distributed Computing > 82 - 89

2015 14th International Symposium on Parallel and Distributed Computing (ISPDC)

The main contribution of this paper is to show a new GPU implementation for the digital half toning by the local exhaustive search that can generate high quality binary images. We have considered programming issues of the GPU architecture to implement these two methods on the GPU. The experimental result shows that our GPU implementation for the local exhaustive search on NVIDIA GeForce GTX 980 for...

chapter

Optimal Parallel Hardware K-Sorter and Top K-Sorter, with FPGA Implementations

Naoyuki Matsumoto, Koji Nakano, Yasuaki Ito

2015 14th International Symposium on Parallel and Distributed Computing > 138 - 147

2015 14th International Symposium on Parallel and Distributed Computing (ISPDC)

This paper presents a FIFO-based parallel merge sorter optimized for the latest FPGA. More specifically, we show a sorter that sorts K keys in latency K + log2 K -- 1 using log2 K comparators. It uses K/M + log2 K + log2 M -- 1 memory blocks with capacity M to implement FIFOs. It receives K keys one by one in every clock cycle and outputs the sorted sequence of them from K + log2 K -- 1 clock cycles...

chapter

Bulk GCD Computation Using a GPU to Break Weak RSA Keys

Toru Fujita, Koji Nakano, Yasuaki Ito

2015 IEEE International Parallel and Distributed Processing Symposium Workshop > 385 - 394

2015 IEEE International Parallel and Distributed Processing Symposium Workshop (IPDPSW)

RSA is one the most well-known public-key cryptosystems widely used for secure data transfer. An RSA encryption key includes a modulus n which is the product of two large prime numbers p and q. If an RSA modulus n can be decomposed into p and q, the corresponding decryption key can be computed easily from them and the original message can be obtained using it. RSA cryptosystem relies on the hardness...

Publication type:
book

Publication date

Set your own date range

Keywords

GPU (24)
CUDA (22)
GRAPHICS PROCESSING UNITS (12)
FPGA (11)
INSTRUCTION SETS (11)
PARALLEL ALGORITHMS (10)
COMPUTER ARCHITECTURE (8)
FIELD PROGRAMMABLE GATE ARRAYS (8)
MEMORY MACHINE MODELS (8)
PARALLEL PROCESSING (7)
RANDOM ACCESS MEMORY (7)
GPGPU (6)
IMAGE PROCESSING (6)
ACCELERATION (5)
BLOCK RAMS (5)
DIGITAL SIGNAL PROCESSING (5)
EMBEDDED BLOCK RAMS (5)
EMBEDDED DSP SLICES (5)
HOUGH TRANSFORM (5)
ALGORITHM DESIGN AND ANALYSIS (4)
PIPELINES (4)
APPROXIMATE STRING MATCHING (3)
APPROXIMATION ALGORITHMS (3)
BULK COMPUTATION (3)
COMPUTATIONAL MODELING (3)
EDIT DISTANCE (3)
HARDWARE (3)
HIDDEN MARKOV MODELS (3)
LINE DETECTION (3)
ANT COLONY OPTIMIZATION (2)
BANK CONFLICT (2)
BIG DATA (2)
BITWISE OPERATIONS (2)
BLOCK RAM (2)
DATA COMPRESSION (2)
DIGITAL HALFTONING (2)
DSP SLICES (2)
DYNAMIC PROGRAMMING (2)
HARDWARE ALGORITHMS (2)
HEURISTIC ALGORITHMS (2)
LOCAL EXHAUSTIVE SEARCH (2)
MEMORY ACCESS CONGESTION (2)
MEMORY BANK CONFLICTS (2)
MEMORY MANAGEMENT (2)
MONTGOMERY MODULAR MULTIPLICATION (2)
MULTIPLE-LENGTH-ARITHMETIC (2)
PARALLEL ALGORITHM (2)
PARALLEL SORTING ALGORITHMS (2)
PHASE CHANGE RANDOM ACCESS MEMORY (2)
PIPELINE (2)
RANDOMIZED TECHNIQUE (2)
REGISTERS (2)
SHARED MEMORY (2)
SIGNAL PROCESSING ALGORITHMS (2)
TEMPLATE MATCHING (2)
TRANSFORMS (2)
TWO DIMENSIONAL DISPLAYS (2)
WRITING (2)
ADDERS (1)
APPROXIMATION METHODS (1)
ART (1)
ASCII ART (1)
ASYNCHRONOUS READ OPERATIONS (1)
BROADCAST ALGORITHMS (1)
CANNY EDGE DETECTION ALGORITHM (1)
CELLULAR AUTOMATON (1)
CIRCLES DETECTION (1)
CIRCUIT REWRITING ALGORITHM (1)
CLASSIFICATION (1)
CLOCK CYCLE MEASUREMENT (1)
CLOCKS (1)
COALESCED MEMORY ACCESS (1)
COALESCING ACCESS (1)
COLLATZ CONJECTURE (1)
COLOR (1)
COMBINATIONAL CIRCUITS (1)
COMBINATORIAL OPTIMIZATION PROBLEM (1)
COMPUTE UNIFIED DEVICE ARCHITECTURE (1)
COMPUTER GRAPHIC EQUIPMENT (1)
COMPUTER SYSTEM (1)
COMPUTERS (1)
CONFERENCES (1)
CONTEXT-FREE GRAMMAR (1)
CONVENTIONAL SOFTWARE IMPLEMENTATION (1)
CONVEX HULL (1)
CONWAY'S GAME OF LIFE (1)
COPROCESSOR (1)
COPROCESSORS (1)
CORRELATION (1)
CPU (1)
DATA MINING (1)
DATA MOVEMENT (1)
DEBUGGING (1)
DICTIONARIES (1)
DISTRIBUTED PROCESSING (1)
DSP BLOCKS (1)
DSP SLICE (1)
DSP48 SLICE (1)
DSP48E BLOCKS (1)
EDGE DETECTION (1)
more

Data set

ieee (48)
Springer (1)

INFONA - science communication portal

Search results for: Yasuaki Ito

Simple and Fast Parallel Algorithms for the Voronoi Map and the Euclidean Distance Map, with GPU Implementations

Photomosaic Generation by Rearranging Subimages, with GPU Acceleration

Accelerating the Smith-Waterman Algorithm Using Bitwise Parallel Bulk Computation Technique on GPU

An Evaluation of the Parallella Architecture for the Convex Hull Computation

GPU-Accelerated Bulk Computation of the Eigenvalue Problem for Many Small Real Non-symmetric Matrices

A Memory-Access-Efficient Implementation of the Approximate String Matching Algorithm on GPU

A Hardware Sorter for Almost Sorted Sequences, with FPGA Implementations

Accelerating Ant Colony Optimization for the Vertex Coloring Problem on the GPU

Accelerating the CKY Parsing Using FPGAs

Bitwise Parallel Bulk Computation on the GPU, with Application to the CKY Parsing for Context-Free Grammars

An Efficient Implementation of LZW Decompression in the FPGA

Fast LZW Compression Using a GPU

A Flexible-Length-Arithmetic Processor Based on FDFM Approach in FPGAs

A Fast Approximate String Matching Algorithm on GPU

A Warp-Synchronous Implementation for Multiple-Length Multiplication on the GPU

Efficient GPU Implementations for the Conway's Game of Life

Parallelization Techniques for Error Diffusion with GPU Implementations

GPU-Accelerated Digital Halftoning by the Local Exhaustive Search

Optimal Parallel Hardware K-Sorter and Top K-Sorter, with FPGA Implementations

Bulk GCD Computation Using a GPU to Break Weak RSA Keys

Filter options

Publication date

Keywords

Data set

INFONA - science communication portal

Search results for: Yasuaki Ito

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Data set

Reporting an error / abuse

Sending the report failed

Accessibility options