Search results

Items from 81 to 100 out of 506 results

chapter

An improved nonparametric CFAR method for ship detection in single polarization synthetic aperetuer radar imagery

S.R. Tian, C. Wang, H. Zhang

2016 IEEE International Geoscience and Remote Sensing Symposium (IGARSS) > 6637 - 6640

IGARSS 2016 - 2016 IEEE International Geoscience and Remote Sensing Symposium

In this letter, an improved kernel density estimation (KDE) constant false alarm rate (CFAR) method is proposed for ship detection in single polarization synthetic aperture radar (SAR) images. The proposed method consists of a target enhancement filter, an adaptive KDE bandwidth estimation method and an improved KDE-CFAR. The gravity-based target enhancement filter is utilized to remove the inhomogeneity...

chapter

Extended Kalman filter under maximum correntropy criterion

Xi Liu, Hua Qu, Jihong Zhao, Badong Chen

2016 International Joint Conference on Neural Networks (IJCNN) > 1733 - 1737

2016 International Joint Conference on Neural Networks (IJCNN)

As a nonlinear extension of Kalman filter, the extended Kalman filter (EKF) is also based on the minimum mean square error (MMSE) criterion. In general, the EKF performs well in Gaussian noises. But its performance may deteriorate substantially when the system is disturbed by heavy-tailed impulsive noises. In order to improve the robustness of EKF against impulsive noises, a new filter for nonlinear...

chapter

Real-time detection of performance anomalies for cloud services

Olumuyiwa Ibidunmoye, Thijs Metsch, Erik Elmroth

2016 IEEE/ACM 24th International Symposium on Quality of Service (IWQoS) > 1 - 2

2016 IEEE/ACM 24th International Symposium on Quality of Service (IWQoS)

Service performance degradation and downtimes are a common on the Internet today. Many on-line services (e.g. Amazon.com, Spotify, and Netflix, etc.) report huge loss in revenue and traffic per episode. This is perhaps due to the correlation between performance and end-users's satisfaction.

chapter

Employing Compression Solutions under OpenACC

Ebad Salehi, Ahmad Lashgar, Amirali Baniasadi

2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) > 348 - 356

2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

For GPUs to achieve their peak performance, effective and efficient usage of memory bandwidth is necessary. To this end, programmers invest extensive development effort to optimize a GPU program, specially its memory bandwidth usage. The OpenACC programming model has been introduced to tackle the accelerators programming complexity. However, this model's coarse-grained control on a program can make...

chapter

Low power Convolutional Neural Networks on a chip

Yu Wang, Lixue Xia, Tianqi Tang, Boxun Li, more

2016 IEEE International Symposium on Circuits and Systems (ISCAS) > 129 - 132

2016 IEEE International Symposium on Circuits and Systems (ISCAS)

Deep learning, and especially Convolutional Neural Network (CNN, is among the most powerful and widely used techniques in computer vision. Applications range from image classification to object detection, segmentation, Optical Character Recognition (OCR), etc. At the same time, CNNs are both computationally intensive and memory intensive, making them difficult to be deployed on low power lightweight...

chapter

HMC-Sim-2.0: A Simulation Platform for Exploring Custom Memory Cube Operations

John D. Leidel, Yong Chen

2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW) > 621 - 630

2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

The recent advent of stacked memory devices has led to a resurgence of researchassociated with the fundamental memory hierarchy and associated memory pipeline. The bandwidth advantages provided by stacked logic and DRAM devices haveinspired research associated with eliminating the bandwidth bottlenecksassociated with many applications in high performance computing. Further, recent efforts have focused...

chapter

Optimization of time-frequency curve description via kernel smoothing

Jitka Pomenkova, Eva Klejmova

2016 International Conference on Systems, Signals and Image Processing (IWSSIP) > 1 - 4

2016 International Conference on Systems, Signals and Image Processing (IWSSIP)

Presented paper deals with trend modeling of spectral coefficients represents material properties at very rapid load. We focus on identification and optimization of the curve describing dependence between frequencies and time in spectrogram. The spectrogram is firstly processed with the aim to specify significant spectral coefficients. Consequently for such coefficient we apply non-parametric kernel...

chapter

The weighted kernel density estimation methods for analysing reliability of electricity supply

Miroslaw Kornatka

2016 17th International Scientific Conference on Electric Power Engineering (EPE) > 1 - 4

2016 17th International Scientific Conference on Electric Power Engineering (EPE)

The paper presents an assessment of the reliability of medium voltage networks within a power company. Reliability of power supply in medium voltage networks is one of the commonly recognized targets of Smart Grid. Novel approaches are needed for evaluating the reliability of electricity distribution and the reliability of supply in distribution network planning. This paper presents a stochastic supply...

chapter

Poster Abstract: A Framework for Chainsaw Detection Using One-Class and WSNs

Juan G. Colonna, Bernardo B. Gatto, Eduardo F. Nakamura, Eulanda M. dos Santos

2016 15th ACM/IEEE International Conference on Information Processing in Sensor Networks (IPSN) > 1 - 2

2016 15th ACM/IEEE International Conference on Information Processing in Sensor Networks (IPSN)

The Amazon Rainforest degradation is a worldwide concern. The rainforest has been endangered by the illegal wood extraction without control even in the preservation areas. Due to the large geography extension prevent these crimes with an unmanned aerial vehicle (UAV) is not always possible. The Wireless Acoustics Sensor Network (WASNs) technology can alleviate this problem. Here, we present an acoustical...

chapter

A Capital Market Metaphor for Content Delivery Network Resources

Elias Vathias, Dimitris Nikolopoulos, Stathes Hadjiefthymiades

2016 IEEE 30th International Conference on Advanced Information Networking and Applications (AINA) > 101 - 108

2016 IEEE 30th International Conference on Advanced Information Networking and Applications (AINA)

We establish a framework that can be used by Origin Servers (content-generating organizations) for claiming Content Delivery Network (CDN) resources in a fine-grained way. The basis of our work lies in the use of Stocks as well as a Secondary Market for the stock trading, tools and products commonly used in modern capital markets. Network and disk resources are being monitored through well-established...

chapter

A Quantitative Performance Evaluation of Fast on-Chip Memories of GPUs

Elias Konstantinidis, Yiannis Cotronis

2016 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP) > 448 - 455

2016 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP)

Modern Graphics Processing Units (GPUs) have evolved to high performance general purpose processors, forming an alternative to CPUs. However, programming them effectively has proven to be a challenge, not only due to the mandatory requirement of extracting massive fine grained parallelism but also due to its susceptible performance on memory traffic. Apart from regular memory caches, GPUs feature...

chapter

Exploiting Very-Wide Vectors on Intel Xeon Phi with Lattice-QCD Kernels

Andreas Diavastos, Giannos Stylianou, Giannis Koutsou

2016 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP) > 296 - 300

2016 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP)

Our target in this work is to study ways of exploring the parallelism offered by vectorization on accelerators with very wide vector units. To this end, we implemented two kernels that derive from the Wilson Dslash operator and investigate several data layout techniques for increasing the scalability of lattice QCD scientific kernels suitable for the Intel Xeon Phi. In parts of the application where...

chapter

A Quantitative Performance Evaluation of Fast on-Chip Memories of GPUs

Elias Konstantinidis, Yiannis Cotronis

2016 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP) > 448 - 455

2016 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP)

chapter

Exploiting Very-Wide Vectors on Intel Xeon Phi with Lattice-QCD Kernels

Andreas Diavastos, Giannos Stylianou, Giannis Koutsou

2016 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP) > 296 - 300

2016 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP)

chapter

Scheduling techniques for GPU architectures with processing-in-memory capabilities

Ashutosh Pattnaik, Xulong Tang, Adwait Jog, Onur Kayiran, more

2016 International Conference on Parallel Architecture and Compilation Techniques (PACT) > 31 - 44

2016 International Conference on Parallel Architecture and Compilation Techniques (PACT)

Processing data in or near memory (PIM), as opposed to in conventional computational units in a processor, can greatly alleviate the performance and energy penalties of data transfers from/to main memory. Graphics Processing Unit (GPU) architectures and applications, where main memory bandwidth is a critical bottleneck, can benefit from the use of PIM. To this end, an application should be properly...

chapter

Automatically exploiting implicit Pipeline Parallelism from multiple dependent kernels for GPUs

Gwangsun Kim, Jiyun Jeong, John Kim, Mark Stephenson

2016 International Conference on Parallel Architecture and Compilation Techniques (PACT) > 339 - 350

2016 International Conference on Parallel Architecture and Compilation Techniques (PACT)

Execution of GPGPU workloads consists of different stages including data I/O on the CPU, memory copy between the CPU and GPU, and kernel execution. While GPU can remain idle during I/O and memory copy, prior work has shown that overlapping data movement (I/O and memory copies) with kernel execution can improve performance. However, when there are multiple dependent kernels, the execution of the kernels...

chapter

A data locality-aware design framework for reconfigurable sparse matrix-vector multiplication kernel

Sicheng Li, Yandan Wang, Wujie Wen, Yu Wang, more

2016 IEEE/ACM International Conference on Computer-Aided Design (ICCAD) > 1 - 6

2016 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)

Sparse matrix-vector multiplication (SpMV) is an important computational kernel in many applications. For performance improvement, software libraries designated for SpMV computation have been introduced, e.g., MKL library for CPUs and cuSPARSE library for GPUs. However, the computational throughput of these libraries is far below the peak floating-point performance offered by hardware platforms, because...

chapter

Caffeine: Towards uniformed representation and acceleration for deep convolutional neural networks

Chen Zhang, Zhenman Fang, Peipei Zhou, Peichen Pan, more

2016 IEEE/ACM International Conference on Computer-Aided Design (ICCAD) > 1 - 8

2016 IEEE/ACM International Conference on Computer-Aided Design (ICCAD)

With the recent advancement of multilayer convolutional neural networks (CNN), deep learning has achieved amazing success in many areas, especially in visual content understanding and classification. To improve the performance and energy-efficiency of the computation-demanding CNN, the FPGA-based acceleration emerges as one of the most attractive alternatives. In this paper we design and implement...

chapter

TEMP: Thread batch enabled memory partitioning for GPU

Mengjie Mao, Wujie Wen, Xiaoxiao Liu, Jingtong Hu, more

2016 53nd ACM/EDAC/IEEE Design Automation Conference (DAC) > 1 - 6

2016 53nd ACM/EDAC/IEEE Design Automation Conference (DAC)

As massive multi-threading in GPU imposes tremendous pressure on memory subsystems, efficient bandwidth utilization becomes a key factor affecting the GPU throughput. In this work, we propose thread batch enabled memory partitioning (TEMP), to improve GPU performance through the improvement of memory bandwidth utilization. In particular, TEMP clusters multiple thread blocks sharing the same set of...

chapter

A study on user-level remote memory extension system

Shinyoung Ahn, Gyuil Cha, Youngho Kim, Eunji Lim, more

2016 18th International Conference on Advanced Communication Technology (ICACT) > 234 - 239

2016 18th International Conference on Advanced Communication Technology (ICACT)

The speed of memory capacity expansion of the computer system has not kept up with the speed of the increase of the memory requirement of large memory applications. Also, big memory system has been too expensive for many researchers and students. Therefore, approaches to utilize remote memory has been considered as a cost effective way to run large memory applications in the cluster environment where...

Keywords:
BANDWIDTH
KERNEL

Publication date

Set your own date range

Content availability

Available (503)
None (3)

Keywords

ESTIMATION (80)
GRAPHICS PROCESSING UNITS (48)
LINUX (43)
COMPUTATIONAL MODELING (39)
OPTIMIZATION (39)
COMPUTER ARCHITECTURE (37)
MEMORY MANAGEMENT (37)
HARDWARE (35)
INSTRUCTION SETS (35)
SERVERS (32)
PERFORMANCE EVALUATION (31)
THROUGHPUT (31)
BENCHMARK TESTING (29)
CLUSTERING ALGORITHMS (28)
PROTOCOLS (28)
FIELD PROGRAMMABLE GATE ARRAYS (26)
RANDOM ACCESS MEMORY (25)
TRAINING (25)
GRAPHICS PROCESSING UNIT (24)
HISTOGRAMS (24)
GPU (23)
IMAGE SEGMENTATION (23)
KERNEL DENSITY ESTIMATION (23)
PARALLEL PROCESSING (23)
ALGORITHM DESIGN AND ANALYSIS (21)
MEAN SHIFT (20)
PIXEL (20)
SUPPORT VECTOR MACHINES (20)
DATA MODELS (19)
COPROCESSORS (18)
PROGRAM PROCESSORS (18)
REGRESSION ANALYSIS (18)
SMOOTHING METHODS (18)
FEATURE EXTRACTION (17)
IMAGE COLOR ANALYSIS (17)
REGISTERS (17)
ROBUSTNESS (17)
VECTORS (17)
DATA MINING (16)
DELAY (16)
LIBRARIES (16)
MATHEMATICAL MODEL (16)
MEASUREMENT (16)
MONITORING (16)
PROGRAMMING (16)
SHAPE (16)
IP NETWORKS (15)
NOISE (15)
PATTERN CLUSTERING (15)
SPARSE MATRICES (15)
TARGET TRACKING (15)
COMPUTER GRAPHIC EQUIPMENT (14)
DELAYS (14)
PROBABILITY DENSITY FUNCTION (14)
RECEIVERS (14)
ARRAYS (13)
CONVERGENCE (13)
MULTIPROCESSING SYSTEMS (13)
OBJECT DETECTION (13)
TRANSPORT PROTOCOLS (13)
ADAPTATION MODEL (12)
APPROXIMATION METHODS (12)
GPGPU (12)
POLYNOMIALS (12)
STANDARDS (12)
TRACKING (12)
MESSAGE PASSING (11)
MULTICORE PROCESSING (11)
OBJECT TRACKING (11)
OPERATING SYSTEM KERNELS (11)
QUALITY OF SERVICE (11)
VIRTUALIZATION (11)
COMPUTER VISION (10)
FPGA (10)
INTERNET (10)
ITERATIVE METHODS (10)
LEARNING (ARTIFICIAL INTELLIGENCE) (10)
MACHINE LEARNING (10)
RESOURCE MANAGEMENT (10)
SCHEDULING (10)
VIRTUAL MACHINING (10)
ACCURACY (9)
CLUSTERING (9)
CORRELATION (9)
CUDA (9)
EQUATIONS (9)
ESTIMATION THEORY (9)
GAUSSIAN PROCESSES (9)
MEDICAL IMAGE PROCESSING (9)
MEMORY BANDWIDTH (9)
OPENCL (9)
PROBABILITY (9)
SOCKETS (9)
STREAMING MEDIA (9)
SYNCHRONIZATION (9)
TCP (9)
TELECOMMUNICATION CONGESTION CONTROL (9)
ACCELERATION (8)
more

INFONA - science communication portal

Search results

An improved nonparametric CFAR method for ship detection in single polarization synthetic aperetuer radar imagery

Extended Kalman filter under maximum correntropy criterion

Real-time detection of performance anomalies for cloud services

Employing Compression Solutions under OpenACC

Low power Convolutional Neural Networks on a chip

HMC-Sim-2.0: A Simulation Platform for Exploring Custom Memory Cube Operations

Optimization of time-frequency curve description via kernel smoothing

The weighted kernel density estimation methods for analysing reliability of electricity supply

Poster Abstract: A Framework for Chainsaw Detection Using One-Class and WSNs

A Capital Market Metaphor for Content Delivery Network Resources

A Quantitative Performance Evaluation of Fast on-Chip Memories of GPUs

Exploiting Very-Wide Vectors on Intel Xeon Phi with Lattice-QCD Kernels

A Quantitative Performance Evaluation of Fast on-Chip Memories of GPUs

Exploiting Very-Wide Vectors on Intel Xeon Phi with Lattice-QCD Kernels

Scheduling techniques for GPU architectures with processing-in-memory capabilities

Automatically exploiting implicit Pipeline Parallelism from multiple dependent kernels for GPUs

A data locality-aware design framework for reconfigurable sparse matrix-vector multiplication kernel

Caffeine: Towards uniformed representation and acceleration for deep convolutional neural networks

TEMP: Thread batch enabled memory partitioning for GPU

A study on user-level remote memory extension system

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options