Search results

chapter

A new transformation algorithm for multi-granularity unbalanced linguistic terms

Xianqin Wang, Bin Zhou, Liangzhong Yi, Xiaohui Li

2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC) > 1640 - 1644

2017 IEEE 2nd Advanced Information Technology, Electronic and Automation Control Conference (IAEAC)

The aim of this paper is to propose a transformation algorithm for multi-granularity linguistic information assessed in different unbalanced linguistic term sets together with its application in linguistic group decision making (LGDM) problem. Assuming that the linguistic information given to the alternatives by different decision makers distribute in different granularity and/or semantic term sets...

chapter

An embedding mechanism for natural steganography after down-sampling

Patrick Bas

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 2127 - 2131

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Natural Steganography (NS) uses the concept of cover-source switching to provide good undetectability performances [1]. The sensor noise of the source (camera) for a given ISO sensitivity ISO₁ is first modeled as an independent Gaussian distribution for each photo-site, then the embedding mimics a switch to another sensitivity ISO₂(> ISO₁). Because the embedding has to be performed on developed...

chapter

Bag of Fisher Vectors representation of images by saliency-based spatial partitioning

Abin Jose, Iris Heisterklaus

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1762 - 1766

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In content-based image retrieval systems, visual content of the image is the criterion for measuring image similarity. We propose a method to solve the problem of loss of spatial information of objects when local descriptors from an image with multiple objects are aggregated to form a global representation. In our approach, after saliency-based spatial partitioning, local feature descriptors from...

chapter

Interpretable human action recognition in compressed domain

Vignesh Srinivasan, Sebastian Lapuschkin, Cornelius Hellge, Klaus-Robert Muller, more

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 1692 - 1696

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

Compressed domain human action recognition algorithms are extremely efficient, because they only require a partial decoding of the video bit stream. However, the question what exactly makes these algorithms decide for a particular action is still a mystery. In this paper, we present a general method, Layer-wise Relevance Propagation (LRP), to understand and interpret action recognition algorithms...

chapter

Higher-Order Pooling of CNN Features via Kernel Linearization for Action Recognition

Anoop Cherian, Piotr Koniusz, Stephen Gould

2017 IEEE Winter Conference on Applications of Computer Vision (WACV) > 130 - 138

2017 IEEE Winter Conference on Applications of Computer Vision (WACV)

Most successful deep learning algorithms for action recognition extend models designed for image-based tasks such as object recognition to video. Such extensions are typically trained for actions on single video frames or very short clips, and then their predictions from sliding-windows over the video sequence are pooled for recognizing the action at the sequence level. Usually this pooling step uses...

chapter

Evaluating matrix representations for error-tolerant computing

Pareesa Ameneh Golnari, Sharad Malik

Design, Automation & Test in Europe Conference & Exhibition (DATE), 2017 > 1659 - 1662

2017 Design, Automation & Test in Europe Conference & Exhibition (DATE)

We propose a methodology to determine the suitability of different data representations in terms of their error-tolerance for a given application with accelerator-based computing. This methodology helps match the characteristics of a representation to the data access patterns in an application. For this, we first identify a benchmark of key kernels from linear algebra that can be used to construct...

chapter

Image Set Classification Using Sparse Bayesian Regression

Mohammed E. Fathy, Rama Chellappa

2017 IEEE Winter Conference on Applications of Computer Vision (WACV) > 1187 - 1196

2017 IEEE Winter Conference on Applications of Computer Vision (WACV)

This paper presents Bayesian Representation-based Classification (BRC), an approach based on sparse Bayesian regression and subspace clustering for image set classification. Similar to existing representation-based approaches such as Sparse RC (SRC) and Collaborative RC (CRC), BRC assumes that a test image is approximated by a linear combination of the gallery images of the true class. However, we...

chapter

From exaflop to exaflow

Tobias Becker, Pavel Burovskiy, Anna Maria Nestorov, Hristina Palikareva, more

Design, Automation & Test in Europe Conference & Exhibition (DATE), 2017 > 404 - 409

2017 Design, Automation & Test in Europe Conference & Exhibition (DATE)

Exascale computing is facing a gap between the ever increasing demand for application performance and the underlying chip technology that does no longer deliver the expected exponential increases in CPU performance. The industry is now progressively moving towards dedicated accelerators to deliver high performance and better energy efficiency. However, the question of programmability still remains...

chapter

TwinKernels: An execution model to improve GPU hardware scheduling at compile time

Xiang Gong, Zhongliang Chen, Amir Kavyan Ziabari, Rafael Ubal, more

2017 IEEE/ACM International Symposium on Code Generation and Optimization (CGO) > 39 - 49

2017 IEEE/ACM International Symposium on Code Generation and Optimization (CGO)

As throughput-oriented accelerators, GPUs provide tremendous processing power by running a massive number of threads in parallel. However, exploiting high degrees of thread-level parallelism (TLP) does not always translate to the peak performance that GPUs can offer, leaving the GPU's resources often under-utilized. Compared to compute resources, memory resources can tolerate considerably lower levels...

chapter

Phase-aware optimization in approximate computing

Subrata Mitra, Manish K. Gupta, Sasa Misailovic, Saurabh Bagchi

2017 IEEE/ACM International Symposium on Code Generation and Optimization (CGO) > 185 - 196

2017 IEEE/ACM International Symposium on Code Generation and Optimization (CGO)

This paper shows that many applications exhibit execution-phase-specific sensitivity towards approximation of the internal subcomputations. Therefore, approximation in certain phases can be more beneficial than others. Further, this paper presents Opprox, a novel system for application's execution-phase-aware approximation. For a user provided error budget and target input parameters, Opprox identifies...

chapter

FlexCL: An analytical performance model for OpenCL workloads on flexible FPGAs

Shuo Wang, Yun Liang, Wei Zhang

2017 54th ACM/EDAC/IEEE Design Automation Conference (DAC) > 1 - 6

2017 54th ACM/EDAC/IEEE Design Automation Conference (DAC)

The recent adoption of OpenCL programming model by FPGA vendors has realized the function portability of OpenCL workloads on FPGA. However, the poor performance portability prevents its wide adoption. To harness the power of FPGAs using OpenCL programming model, it is advantageous to design an analytical performance model to estimate the performance of OpenCL workloads on FPGAs and provide insights...

chapter

A comprehensive framework for synthesizing stencil algorithms on FPGAs using OpenCL model

Shuo Wang, Yun Liang

2017 54th ACM/EDAC/IEEE Design Automation Conference (DAC) > 1 - 6

2017 54th ACM/EDAC/IEEE Design Automation Conference (DAC)

Iterative stencil algorithms find applications in a wide range of domains. FPGAs have long been adopted for computation acceleration due to its advantages of dedicated hardware design. Hence, FPGAs are a compelling alternative for executing iterative stencil algorithms. However, efficient implementation of iterative stencil algorithms on FPGAs is very challenging due to the data dependencies between...

chapter

A systems approach to computing in beyond CMOS fabrics

Ameya Patil, Naresh Shanbhag, Lav Varshney, Eric Pop, more

2017 54th ACM/EDAC/IEEE Design Automation Conference (DAC) > 1 - 2

2017 54th ACM/EDAC/IEEE Design Automation Conference (DAC)

Emerging applications require computing platforms to extract task-relevant information from increasingly large amounts of data. These requirements place stringent constraints on energy efficiency, throughput, latency, and for certain data types, security and privacy of computing platforms. Traditionally, silicon CMOS scaling has been relied upon to meet these energy and delay constraints. However,...

chapter

Statistical pattern based modeling of GPU memory access streams

Reena Panda, Xinnian Zheng, Jiajun Wang, Andreas Gerstlauer, more

2017 54th ACM/EDAC/IEEE Design Automation Conference (DAC) > 1 - 6

2017 54th ACM/EDAC/IEEE Design Automation Conference (DAC)

Recent research studies have shown that modern GPU performance is often limited by the memory system performance. Optimizing memory hierarchy performance requires GPU designers to draw design insights based on the cache & memory behavior of end-user applications. Unfortunately, it is often difficult to get access to end-user workloads due to the confidential or proprietary nature of the software/data...

chapter

Optimizing memory efficiency for convolution kernels on kepler GPUs

Xiaoming Chen, Jianxu Chen, Danny Z. Chen, Xiaobo Sharon Hu

2017 54th ACM/EDAC/IEEE Design Automation Conference (DAC) > 1 - 6

2017 54th ACM/EDAC/IEEE Design Automation Conference (DAC)

Convolution is a fundamental operation in many applications, such as computer vision, natural language processing, image processing, etc. Recent successes of convolutional neural networks in various deep learning applications put even higher demand on fast convolution. The high computation throughput and memory bandwidth of graphics processing units (GPUs) make GPUs a natural choice for accelerating...

chapter

Work-in-progress: REDEFINE – a case for WCET-friendly hardware accelerators for real time applications

Kavitha Madhu, Tarun Singla, S K Nandy, Ranjani Narayan, more

2017 International Conference on Compilers, Architectures and Synthesis For Embedded Systems (CASES) > 1 - 2

2017 International Conference on Compilers, Architectures and Synthesis For Embedded Systems (CASES)

REDEFINE is a distributed dynamic dataow architecture, designed for exploiting parallelism at various granularities as an embedded system-on-chip (SoC). is paper dwells on the exibility of REDEFINE architecture and its execution model in accelerating real-time applications coupled with a WCET analyzer that computes execution time bounds of real time applications.

chapter

Decoupled affine computation for SIMT GPUs

Kai Wang, Calvin Lin

2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA) > 295 - 306

2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA)

This paper introduces a method of decoupling affine computations-a class of expressions that produces extremely regular values across SIMT threads-from the main execution stream, so that the affine computations can be performed with greater efficiency and with greater independence from the main execution stream. This decoupling has two benefits: (1) For compute-bound programs, it significantly reduces...

chapter

LogCA: A high-level performance model for hardware accelerators

Muhammad Shoaib Bin Altaf, David A. Wood

2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA) > 375 - 388

2017 ACM/IEEE 44th Annual International Symposium on Computer Architecture (ISCA)

With the end of Dennard scaling, architects have increasingly turned to special-purpose hardware accelerators to improve the performance and energy efficiency for some applications. Unfortunately, accelerators don't always live up to their expectations and may under-perform in some situations. Understanding the factors which effect the performance of an accelerator is crucial for both architects and...

chapter

Online sequential extreme learning algorithm with kernels for bigdata classification

N. Pandeeswari, D. Vignesh, R. Pushpalakshmi, Varadharajan

2017 4th International Conference on Advanced Computing and Communication Systems (ICACCS) > 1 - 5

2017 4th International Conference on Advanced Computing and Communication Systems (ICACCS)

Extreme machine learning and its variants have shown good generalization performance and high leaning speed in many applications through its fast convergence. Despite the parallel and distributed ELM on MapReduce framework able to handle very large scale dataset for bigdata applications, the process of coping up with the rapidly updating data is a challenging one. Among the unified algorithms, the...

chapter

Identification of Volterra model parameters in wireless systems

Carlos Crespo-Cadenas, Javier Reina-Tosina, Maria J. Madero-Ayora, Juan A. Becerra

2017 IEEE Topical Conference on RF/Microwave Power Amplifiers for Radio and Wireless Applications (PAWR) > 96 - 99

2017 IEEE Topical Conference on RF/Microwave Power Amplifiers for Radio and Wireless Applications (PAWR)

This paper reports the identification of nonlinear models for wireless communications systems. The procedure relies on a novel complex-valued Volterra series (CVS) representation to provide a sparse representation based on statistical hypothesis testing and the Bayesian information criterion (BIC). The approach has been experimentally evaluated with the front-end of a communications transmitter taking...

INFONA - science communication portal

Search results

A new transformation algorithm for multi-granularity unbalanced linguistic terms

An embedding mechanism for natural steganography after down-sampling

Bag of Fisher Vectors representation of images by saliency-based spatial partitioning

Interpretable human action recognition in compressed domain

Higher-Order Pooling of CNN Features via Kernel Linearization for Action Recognition

Evaluating matrix representations for error-tolerant computing

Image Set Classification Using Sparse Bayesian Regression

From exaflop to exaflow

TwinKernels: An execution model to improve GPU hardware scheduling at compile time

Phase-aware optimization in approximate computing

FlexCL: An analytical performance model for OpenCL workloads on flexible FPGAs

A comprehensive framework for synthesizing stencil algorithms on FPGAs using OpenCL model

A systems approach to computing in beyond CMOS fabrics

Statistical pattern based modeling of GPU memory access streams

Optimizing memory efficiency for convolution kernels on kepler GPUs

Work-in-progress: REDEFINE – a case for WCET-friendly hardware accelerators for real time applications

Decoupled affine computation for SIMT GPUs

LogCA: A high-level performance model for hardware accelerators

Online sequential extreme learning algorithm with kernels for bigdata classification

Identification of Volterra model parameters in wireless systems

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options