Search results for: Tao Tang

Items from 1 to 10 out of 10 results

chapter

A 0.18µm, 0.6V, 83.5µW integer DCT processor for neural signal applications

Tao Tang, Wang Ling Goh, Xin Liu, Chao Wang

2016 International Symposium on Integrated Circuits (ISIC) > 1 - 4

2016 International Symposium on Integrated Circuits (ISIC)

Neural recording is one of the most noteworthy technologies in today's world, where large amount of recorded neural signal over prolonged duration consumes hefty time and energy for data transmission. During the past decade, the discrete cosine transform (DCT) has been used for data compression in bio-medical application due to its high energy efficiency. In this paper, a multiplication-free integer...

chapter

Evaluating the Performance Impact of Multiple Streams on the MIC-Based Heterogeneous Platform

Zhaokui Li, Jianbin Fang, Tao Tang, Xuhao Chen, more

2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) > 1341 - 1350

2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

Using multiple streams can improve the overall system performance by mitigating the data transfer overhead on heterogeneous systems. Prior work focuses a lot on GPUs but little is known about the performance impact on (Intel Xeon) Phi. In this work, we apply multiple streams into six real-world applications on Phi. We then systematically evaluate the performance benefits of using multiple streams...

article

A Kernel Clustering Algorithm With Fuzzy Factor: Application to SAR Image Segmentation

Deliang Xiang, Tao Tang, Canbin Hu, Yu Li, more

IEEE Geoscience and Remote Sensing Letters > 2014 > 11 > 7 > 1290 - 1294

The presence of multiplicative noise in synthetic aperture radar (SAR) images makes segmentation and classification difficult to handle. Although a fuzzy C-means (FCM) algorithm and its variants (e.g., the FCM_S, the fast generalized FCM, the fuzzy local information C-means, etc.) can achieve satisfactory segmentation results and are robust to Gaussian noise, uniform noise, and salt and pepper noise,...

chapter

Sim-spm: A SimpleScalar-Based Simulator for Multi-level SPM Memory Hierarchy Architecture

Xiaoguang Ren, Yuhua Tang, Tao Tang, Sen Ye, more

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC) > 17 - 23

2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC 2010)

As a fast on-chip SRAM managed by software (the application and/or compiler), Scratchpad Memory (SPM) is widely used in many fields. This paper presents a Simple Scalar-based multi-level SPM memory hierarchy architecture simulator Sim-spm. We simulate the hardware of the multi-level SPM memory hierarchy successfully by extending Sim-outorder, which is an out-of-order simulator from Simple Scalar....

chapter

Mapping OpenMP concepts to the stream programming model

Tao Tang, Yisong Lin, Xiaoguang Ren

2010 5th International Conference on Computer Science&Education > 1900 - 1905

2010 5th International Conference on Computer Science & Education (ICCSE 2010)

OpenMP is a widely used parallel programming model on traditional multi-core processors. Generally, OpenMP is used to develop fine-grained parallelism through a multi-thread model. Stream programming model is a new kind of parallel programming model for stream architectures. OpenMP bears a resemblance to the stream programming model at some level. The transformation between the two models has attracted...

chapter

A Data Communication Scheduler for Stream Programs on CPU-GPU Platform

Tao Tang, Xinhai Xu, Yisong Lin

2010 10th IEEE International Conference on Computer and Information Technology > 139 - 146

2010 IEEE 10th International Conference on Computer and Information Technology (CIT)

In recent years, heterogeneous parallel system have become a focus research area in high performance computing field. Generally, in a heterogeneous parallel system, CPU provides the basic computing environment and special purpose accelerator (GPU in this paper) provides high computing performance. However, the overall performance of the system is prone to be limited by the data communication between...

chapter

Spatial statistical modeling of the pollution impact of old industrial sites on colon and lung cancer incidents in New York State, USA

Tao Tang, C Anderson

2010 18th International Conference on Geoinformatics > 1 - 4

2010 18th International Conference on Geoinformatics

This research visualizes the spatial patterns of diagnosed colon and lung cancer mortalities across the New York State. Kernel density analysis was applied to visualize the spatial patterns of old industrial sites across the state. Geographically Weighted Regression (GWR) was applied to model the possible pollution impact of old industrial sites on colon and lung cancer incidents. GWR is a local spatial...

chapter

Program Optimization of Array-Intensive SPEC2k Benchmarks on Multithreaded GPU Using CUDA and Brook+

Guibin Wang, Tao Tang, Xudong Fang, Xiaoguang Ren

2009 15th International Conference on Parallel and Distributed Systems > 292 - 299

2009 IEEE 15th International Conference on Parallel and Distributed Systems (ICPADS 2009)

Graphic Processing Unit (GPU), with many light-weight data-parallel cores, can provide substantial parallel computing power to accelerate several general purpose applications. Both the AMD and NVIDIA corps provide their specific high performance GPUs and software platforms. As the floating-point computing capacity increases continually, the problem of ``memory-wall'' becomes more serious, especially...

chapter

Program Optimization of Stencil Based Application on the GPU-Accelerated System

Guibin Wang, Xuejun Yang, Ying Zhang, Tao Tang, more

2009 IEEE International Symposium on Parallel and Distributed Processing with Applications > 219 - 225

2009 IEEE International Symposium on Parallel and Distributed Processing with Applications (ISPA)

Graphic Processing Unit (GPU), with many light-weight data-parallel cores, can provide substantial parallel computational power to accelerate general purpose applications. But the powerful computing capacity could not be fully utilized for memory-intensive applications, which are limited by off-chip memory bandwidth and latency. Stencil computation has abundant parallelism and low computational intensity...

chapter

Model-guided strip size selection for minimal execution time on imagine stream processor

Jing Du, Yuhua Tang, Fujiang Ao, Tao Tang, more

2008 8th IEEE International Conference on Computer and Information Technology > 267 - 272

2008 8th IEEE International Conference on Computer and Information Technology

Strip-mining is a critical optimization for improving the effectiveness of memory hierarchy of Imagine. In this paper, we present an efficient compiler algorithm for selecting the optimal strip size to minimize the execution time of stream programs. First, we build a graceful analytical model that characterizes the effect of strip size on key performance factors. Then, we design a novel algorithm...

Filter options

Keywords:
KERNEL

Publication date

Set your own date range

INFONA - science communication portal

Search results for: Tao Tang

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options