Search results for: Wei Zhang

Items from 1 to 20 out of 22 results

chapter

WCET analysis of the shared data cache in integrated CPU-GPU architectures

Yijie Huangfu, Wei Zhang

2017 IEEE High Performance Extreme Computing Conference (HPEC) > 1 - 7

2017 IEEE High Performance Extreme Computing Conference (HPEC)

By taking the advantages of both CPU and GPU as well as the shared DRAM and cache, the integrated CPU-GPU architecture has the potential to boost the performance for a variety of applications, including real-time applications as well. However, before being applied to the hard real-time and safety-critical applications, the time-predictability of the integrated CPU-GPU architecture needs to be studied...

chapter

Binarized Mode Seeking for Scalable Visual Pattern Discovery

Wei Zhang, Xiaochun Cao, Rui Wang, Yuanfang Guo, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6827 - 6835

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper studies visual pattern discovery in large-scale image collections via binarized mode seeking, where images can only be represented as binary codes for efficient storage and computation. We address this problem from the perspective of binary space mode seeking. First, a binary mean shift (bMS) is proposed to discover frequent patterns via mode seeking directly in binary space. The binomial-based...

chapter

Static WCET Analysis of GPUs with Predictable Warp Scheduling

Yijie Huangfu, Wei Zhang

2017 IEEE 20th International Symposium on Real-Time Distributed Computing (ISORC) > 101 - 108

2017 IEEE 20th International Symposium on Real-Time Distributed Computing (ISORC)

The capability of GPUs to accelerate general-purpose applications that can be parallelized into massive number of threads makes it promising to apply GPUs to real-time applications as well, where high throughput and intensive computation are also needed. However, due to the different architecture and programming model of GPUs, the worst-case execution time (WCET) analysis methods and techniques designed...

chapter

FP-DNN: An Automated Framework for Mapping Deep Neural Networks onto FPGAs with RTL-HLS Hybrid Templates

Yijin Guan, Hao Liang, Ningyi Xu, Wenqiang Wang, more

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM) > 152 - 159

2017 IEEE 25th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM)

DNNs (Deep Neural Networks) have demonstrated great success in numerous applications such as image classification, speech recognition, video analysis, etc. However, DNNs are much more computation-intensive and memory-intensive than previous shallow models. Thus, it is challenging to deploy DNNs in both large-scale data centers and real-time embedded systems. Considering performance, flexibility, and...

chapter

FlexCL: An analytical performance model for OpenCL workloads on flexible FPGAs

Shuo Wang, Yun Liang, Wei Zhang

2017 54th ACM/EDAC/IEEE Design Automation Conference (DAC) > 1 - 6

2017 54th ACM/EDAC/IEEE Design Automation Conference (DAC)

The recent adoption of OpenCL programming model by FPGA vendors has realized the function portability of OpenCL workloads on FPGA. However, the poor performance portability prevents its wide adoption. To harness the power of FPGAs using OpenCL programming model, it is advantageous to design an analytical performance model to estimate the performance of OpenCL workloads on FPGAs and provide insights...

chapter

Relational query processing on OpenCL-based FPGAs

Zeke Wang, Johns Paul, Hui Yan Cheah, Bingsheng He, more

2016 26th International Conference on Field Programmable Logic and Applications (FPL) > 1 - 10

2016 26th International Conference on Field Programmable Logic and Applications (FPL)

The release of OpenCL support for FPGAs represents a significant improvement in extending database applications to the reconfigurable domain. Taking advantage of the programmability offered by the OpenCL HLS tool, an OpenCL database can be easily ported and re-designed for FPGAs. A single SQL query in these database systems usually consists of multiple operators, and each one of these operators in...

chapter

HeteroSim: A heterogeneous CPU-FPGA simulator

Liang Feng, Hao Liang, Sharad Sinha, Wei Zhang

2016 26th International Conference on Field Programmable Logic and Applications (FPL) > 1

2016 26th International Conference on Field Programmable Logic and Applications (FPL)

Heterogeneous computing is rapidly gaining increased attention due to the promise it holds in overcoming power and performance walls in traditional computing systems. With its focus on customized processing nodes dedicated to the different tasks in an application, it is hoped that these walls will be overcome. Therefore, CPU-FPGA co-architectures are also gaining ground in application areas like recognition,...

chapter

Optimized Inter-domain Communications Among Multiple Virtual Machines Based on Shared Memory

Congfeng Jiang, Jian Wan, Hongyuan Wu, Wei Zhang, more

2015 IEEE 17th International Conference on High Performance Computing and Communications, 2015 IEEE 7th International Symposium on Cyberspace Safety and Security, and 2015 IEEE 12th International Conference on Embedded Software and Systems > 921 - 922

2015 IEEE 17th International Conference on High Performance Computing and Communications (HPCC), 2015 IEEE 7th International Symposium on Cyberspace Safety and Security (CSS) and 2015 IEEE 12th International Conf on Embedded Software and Systems (ICESS)

In elastic cloud computing environment, multiple virtual machines may reside in the same physical machine for services consolidation. For the same residential guest domains or multi-tiered hosting services, the inter-domain communications are complex and frequent. However, traditional inter-domain communications are conducted through the virtual network interfaces of both sending and receiving virtual...

chapter

Hardware-Based and Hybrid L1 Data Cache Bypassing to Improve GPU Performance

Yijie Huangfu, Wei Zhang

Intelligent GPU cache bypassing can improve the efficiency of using GPU memory bandwidth, which can benefit GPU performance. In this paper, we study a pure hardware-based GPU cache bypassing method that can be applied to GPU applications without having to recompile the programs. Moreover, we introduce a hybrid method that can exploit profiling information to further enhance the hardware-based bypassing...

chapter

Boosting GPU Performance by Profiling-Based L1 Data Cache Bypassing

Yijie Huangfu, Wei Zhang

2015 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing > 1119 - 1122

2015 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid)

Cache memories have been introduced in recent generations of Graphics Processing Units (GPUs) to benefit general-purpose computing on GPUs (GPGPUs). In this work, we analyze the memory access patterns of GPGPU applications and propose a cost-effective profiling-based method to identify the data accesses that should bypass the L1 data cache to improve performance. The evaluation indicates that the...

chapter

Real-Time GPU Computing: Cache or No Cache?

Yijie Huangfu, Wei Zhang

2015 IEEE 18th International Symposium on Real-Time Distributed Computing > 182 - 189

2015 IEEE 18th International Symposium on Real-Time Distributed Computing (ISORC)

Recent Graphics Processing Units (GPUs) have employed cache memories to boost performance. However, cache memories are well known to be harmful to time predictability for CPUs. For high-performance real-time systems using GPUs, it remains unknown whether or not cache memories should be employed. In this paper, we quantitatively compare the performance for GPUs with and without caches, and find that...

chapter

Defend GPUs against DoS attacks

Wei Zhang

2013 IEEE 32nd International Performance Computing and Communications Conference (IPCCC) > 1 - 2

2013 IEEE 32nd International Performance Computing and Communications Conference (IPCCC)

Graphics Processing Units (GPUs) have become a popular choice for general-purpose high-performance computing. Encryption and decryption algorithms such as the Advanced Encryption Standard (AES) have been implemented on GPUs to gain significant speedup. However, the security of the GPU architecture is not well studied, making it potentially risky to offload sensitive computation to GPUs. In this paper,...

chapter

A hardware-based computational platform for Generalized Laguerre-Volterra MIMO model for neural activities

Will X. Y. Li, Rosa H. M. Chan, Wei Zhang, Ray C. C. Cheung, more

2011 Annual International Conference of the IEEE Engineering in Medicine and Biology Society > 7282 - 7285

2011 33rd Annual International Conference of the IEEE Engineering in Medicine and Biology Society

A parallelized and pipelined architecture based on FPGA and a higher-level Self Reconfiguration Platform are proposed in this paper to model Generalized Laguerre-Volterra MIMO system essential in identifying the time-varying neural dynamics underlying spike activities. Our proposed design is based on the Xilinx Virtex-6 FPGA platform and the processing core can produce data samples at a speed of 1...

chapter

Spatially adaptive image deblurring based on nonlocal means

Ming Zhao, Wei Zhang, Zhile Wang, Fugang Wang

2010 3rd International Congress on Image and Signal Processing > 2 > 853 - 858

3rd International Congress on Image and Signal Processing (CISP 2010)

The deconvolution of blurred and noisy images is an ill-posed inverse problem, which can be regularized under the Bayesian framework by introducing an appropriate image prior. In this paper, inspired by the state-of-art nonlocal means(NLM) denoising technique which exploits the similarity of the image patches, we construct an inhomogeneous and anisotropic image prior under the Markov random field...

chapter

Least squares support vector machines based on fuzzy rough set

Zhi-Wei Zhang, De-Gang Chen, Qiang He, Hui Wang

2010 IEEE International Conference on Systems, Man and Cybernetics > 3834 - 3838

2010 IEEE International Conference on Systems, Man and Cybernetics (SMC 2010)

In this paper, a new approach to improve least squares support vector machines is presented. We consider the membership of every sample in constraints, that is to say, every sample are not fully assigned to one class. The membership is computed by employing the technique of fuzzy rough sets, and then a new least squares support vector machine algorithm based on fuzzy rough sets is proposed, experiments...

chapter

High quality artifact-free super-resolution

Wei Zhang, Wai-Kuen Cham

2010 IEEE International Conference on Image Processing > 889 - 892

2010 17th IEEE International Conference on Image Processing (ICIP 2010)

Blurring and jaggy artifacts are the primal culprits that plague the current super-resolution techniques. In this paper, we propose a simple but effective approach which is capable of producing a pleasant artifact-free high-resolution image from a single low-resolution input. Specifically, we first magnify the low-resolution image to the desired resolution through structure adaptive interpolation...

chapter

Face recognition with supervised spectral regression and multiple kernel SVM

Yongliang Xiao, Limin Xia, Wei Zhang

2010 2nd International Conference on Advanced Computer Control > 4 > 343 - 346

2010 2nd International Conference on Advanced Computer Control (ICACC 2010)

Face recognition plays an every important role in security surveillance, secure access and identity authentication. In this paper, we propose a novel face recognition method based on supervised learning. Our method consists in first extracting face feature using a supervised spectral regression, then we use multiple kernel SVM to classify face. Experimental results on Yale B face database and AR face...

chapter

SVM optimized scheme based PSO in application of engineering industry process

Ming-Bao Li, Jia-Wei Zhang

2009 International Conference on Machine Learning and Cybernetics > 3 > 1246 - 1251

2009 Eighth International Conference on Machine Learning and Cybernetics (ICMLC)

Aimed to the problem that it is hardship to get real-time and on-line measuring parameters in wood drying process, a novel PSO-SVM model that hybridized the particle swarm optimization (PSO) and support vector machines (SVM) to improve the nonlinearity caused by ambient temperature and other disturbance factors is presented. Support vector machines (SVM) based on statistical learning theory and structural...

chapter

Network Traffic Analysis Using Refined Bayesian Reasoning to Detect Flooding and Port Scan Attacks

Dai-ping Liu, Ming-wei Zhang, Tao Li

2008 International Conference on Advanced Computer Theory and Engineering > 1000 - 1004

2008 International Conference on Advanced Computer Theory and Engineering (ICACTE)

Dynamical analysis of the current network status is critical to detect large scale intrusions and to ensure the networks to continually function. Collecting and analyzing traffic in real time and reporting the current status in time provide a feasible way. In this paper we used a refined naive Bayes method, naive Bayes kernel estimator (NBKE), to identify flooding attacks and port scans from normal...

chapter

A single image based blind super-resolution approach

Wei Zhang, Wai-Kuen Cham

2008 15th IEEE International Conference on Image Processing > 329 - 332

2008 15th IEEE International Conference on Image Processing - ICIP 2008

In this paper, we address the problem of producing super- resolved image from a single low-resolution input. Unlike most previous work, the camera's point spread function (PSF) is not assumed to be known in advance and the single image super-resolution problem is formulated as a blind deconvolution problem under a MAP framework which can be optimized effectively in an iterative manner. Experimental...

Keywords:
KERNEL
Publication type:
book

Publication date

Set your own date range

Keywords

GRAPHICS PROCESSING UNITS (6)
COMPUTER ARCHITECTURE (5)
FIELD PROGRAMMABLE GATE ARRAYS (5)
INSTRUCTION SETS (5)
TRAINING (4)
ANALYTICAL MODELS (3)
BENCHMARK TESTING (3)
HARDWARE (3)
SUPPORT VECTOR MACHINES (3)
ACCURACY (2)
BAYES METHODS (2)
CACHE MEMORY (2)
COMPUTATIONAL MODELING (2)
DATA MODELS (2)
DEBLURRING (2)
DECONVOLUTION (2)
DELAYS (2)
ESTIMATION (2)
GPU (2)
IMAGE EDGE DETECTION (2)
IMAGE RESOLUTION (2)
IMAGE RESTORATION (2)
INTERPOLATION (2)
MATHEMATICAL MODEL (2)
NIOBIUM (2)
OPTIMIZATION (2)
PIXEL (2)
REAL-TIME SYSTEMS (2)
SVM (2)
ACCELERATION (1)
ACTUATORS (1)
ADAPTATION MODEL (1)
AR FACE DATABASE (1)
ARTIFACT-FREE HIGH RESOLUTION IMAGE (1)
AUTOMATION (1)
BAGGING (1)
BANDWIDTH (1)
BAYESIAN FRAMEWORK (1)
BAYESIAN METHODS (1)
BINARY CODES (1)
BLIND DECONVOLUTION PROBLEM (1)
CACHE BYPASSING (1)
CAMERA POINT SPREAD FUNCTION (1)
CENTRAL PROCESSING UNIT (1)
CIRCULAR BUFFER OPTIMIZATION (1)
CLASSIFICATION ALGORITHMS (1)
COMPUTER CRIME (1)
CONSTANT DELAYS (1)
CUDA (1)
DATABASES (1)
DEEP NEURAL NETWORKS (1)
DELAY (1)
DISPERSION (1)
DIVERSIFIED SUPPORT VECTOR MACHINES (1)
DOS ATTACK (1)
DYNAMIC SCHEDULING (1)
DYNAMICAL ANALYSIS (1)
ENCRYPTION (1)
ENGINEERING INDUSTRY PROCESS (1)
ENSEMBLE (1)
ENSEMBLE IMPLEMENTATIONS (1)
EQUATIONS (1)
FACE (1)
FACE RECOGNITION (1)
FACE RECOGNITION METHOD (1)
FEATURE EXTRACTION (1)
FLICKR (1)
FLOODING DETECTION (1)
FLOODS (1)
FPGA (1)
FUZZY CONTROL (1)
FUZZY MEMBERSHIP (1)
FUZZY PID (1)
FUZZY PID CONTROL METHOD (1)
FUZZY ROUGH SETS (1)
FUZZY TRANSITIVE KERNELS (1)
GABOR FILTERS (1)
GENERATORS (1)
HAND-IDENTIFIED TRAFFIC INSTANCE (1)
HIGH QUALITY ARTIFACT-FREE SUPER RESOLUTION (1)
IDENTITY AUTHENTICATION (1)
ILL-POSED INVERSE PROBLEM (1)
IMAGE DECONVOLUTION (1)
IMAGE DENOISING (1)
IMAGE PATCHES SIMILARITY (1)
IMAGE SUPER-RESOLUTION (1)
IMAGE-BASED BLIND SUPER-RESOLUTION (1)
INTER-DOMAIN COMMUNICATION (1)
INTERNET (1)
INVERSE PROBLEMS (1)
IONOSPHERE (1)
ITERATIVE METHOD (1)
ITERATIVE METHODS (1)
JAGGY ARTIFACT (1)
LARGE SCALE INTRUSIONS (1)
LEARNING (ARTIFICIAL INTELLIGENCE) (1)
LEAST SQUARES SUPPORT VECTOR MACHINES (1)
LINUX (1)
LS-SVM (1)
more

INFONA - science communication portal

Search results for: Wei Zhang

WCET analysis of the shared data cache in integrated CPU-GPU architectures

Binarized Mode Seeking for Scalable Visual Pattern Discovery

Static WCET Analysis of GPUs with Predictable Warp Scheduling

FP-DNN: An Automated Framework for Mapping Deep Neural Networks onto FPGAs with RTL-HLS Hybrid Templates

FlexCL: An analytical performance model for OpenCL workloads on flexible FPGAs

Relational query processing on OpenCL-based FPGAs

HeteroSim: A heterogeneous CPU-FPGA simulator

Optimized Inter-domain Communications Among Multiple Virtual Machines Based on Shared Memory

Hardware-Based and Hybrid L1 Data Cache Bypassing to Improve GPU Performance

Boosting GPU Performance by Profiling-Based L1 Data Cache Bypassing

Real-Time GPU Computing: Cache or No Cache?

Defend GPUs against DoS attacks

A hardware-based computational platform for Generalized Laguerre-Volterra MIMO model for neural activities

Spatially adaptive image deblurring based on nonlocal means

Least squares support vector machines based on fuzzy rough set

High quality artifact-free super-resolution

Face recognition with supervised spectral regression and multiple kernel SVM

SVM optimized scheme based PSO in application of engineering industry process

Network Traffic Analysis Using Refined Bayesian Reasoning to Detect Flooding and Port Scan Attacks

A single image based blind super-resolution approach

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results for: Wei Zhang

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options