Search results

chapter

Offline Handwritten Signature Modeling and Verification Based on Archetypal Analysis

Elias N. Zois, Ilias Theodorakopoulos, George Economou

2017 IEEE International Conference on Computer Vision (ICCV) > 5515 - 5524

2017 IEEE International Conference on Computer Vision (ICCV)

The handwritten signature is perhaps the most accustomed way for the acknowledgement of the consent of an individual or the authentication of the identity of a person in numerous transactions. In addition, the authenticity of a questioned offline or static handwritten signature still poses a case of interest, especially in forensic related applications. A common approach in offline signature verification...

chapter

Deep Representation Learning for Human Motion Prediction and Classification

Judith Butepage, Michael J. Black, Danica Kragic, Hedvig Kjellstrom

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1591 - 1599

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Generative models of 3D human motion are often restricted to a small number of activities and can therefore not generalize well to novel movements or applications. In this work we propose a deep learning framework for human motion capture data that learns a generic representation from a large corpus of motion capture data and generalizes well to new, unseen, motions. Using an encoding-decoding network...

chapter

Recurrent Modeling of Interaction Context for Collective Activity Recognition

Minsi Wang, Bingbing Ni, Xiaokang Yang

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 7408 - 7416

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Modeling of high order interactional context, e.g., group interaction, lies in the central of collective/group activity recognition. However, most of the previous activity recognition methods do not offer a flexible and scalable scheme to handle the high order context modeling problem. To explicitly address this fundamental bottleneck, we propose a recurrent interactional context modeling scheme based...

chapter

A Multi-stage Deep Learning Approach for Business Process Event Prediction

Nijat Mehdiyev, Joerg Evermann, Peter Fettke

2017 IEEE 19th Conference on Business Informatics (CBI) > 1 > 119 - 128

2017 IEEE 19th Conference on Business Informatics (CBI)

The ability to proactively monitor business processes is one of the main differentiators for firms to remain competitive. Process execution logs generated by Process Aware Information Systems (PAIS) help to make various business process specific predictions. This enables a proactive situational awareness related to the execution of business processes. The goal of the approach proposed in the current...

chapter

Efficient pooling of image based CNN features for action recognition in videos

Biplab Banerjee, Vittorio Murino

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 2637 - 2641

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper, we propose a new video representation incorporating image based deep features and an efficient pooling strategy for the purpose of action recognition. The Convolutional Neural Network (CNN) based features have very recently emerged as the new state of the art for image classification. Several attempts have been made to extend such CNN models for videos by explicitly focusing on the...

chapter

Effective surface normals based action recognition in depth images

Xuan Son Nguyen, Thanh Phuong Nguyen, Francois Charpillet

2016 23rd International Conference on Pattern Recognition (ICPR) > 817 - 822

2016 23rd International Conference on Pattern Recognition (ICPR)

In this paper, we propose a new local descriptor for action recognition in depth images. The proposed descriptor relies on surface normals in 4D space of depth, time, spatial coordinates and higher-order partial derivatives of depth values along spatial coordinates. In order to classify actions, we follow the traditional Bag-of-words (BoW) approach, and propose two encoding methods termed Multi-Scale...

chapter

One-shot learning of temporal sequences using a distance dependent Chinese Restaurant Process

Carlos Orrite, Mario Rodriguez, Carlos Medrano

2016 23rd International Conference on Pattern Recognition (ICPR) > 2694 - 2699

2016 23rd International Conference on Pattern Recognition (ICPR)

Activity recognition in videos is a challenging task, mainly if a scarce number of samples is available for modelling the problem. The task becomes even harder when using generative models such as mixture models or Hidden Markov Models (HMMs), as they demand a lot of samples to determinate their parameters. Additionally, these models rely on the appropriate selection of some parameters, for instance...

chapter

A hierarchical visual recognition model with precise-spike-driven synaptic plasticity

Xiaoliang Xu, Xin Jin, Rui Yan, Xun Cao

2016 IEEE Symposium Series on Computational Intelligence (SSCI) > 1 - 7

2016 IEEE Symposium Series on Computational Intelligence (SSCI)

Several conventional methods have been implemented in pattern recognition, but few of them have biological plausibility. This paper mimics the hierarchical visual system and uses the precise-spike-driven (PSD) synaptic plasticity rule to learn. The well-known HMAX model imitates the visual cortex and uses Gabor filter and max pooling to extract features. Compared with the traditional HMAX model, our...

chapter

Towards temporal adaptive representation for video action recognition

Junjie Cai, Jie Yu, Francisco Imai, Qi Tian

2016 IEEE International Conference on Image Processing (ICIP) > 4155 - 4159

2016 IEEE International Conference on Image Processing (ICIP)

Action recognition has been one of the challenging problems in the computer vision community. Most of the recent research work in this area exploits the motion features captured by dense trajectory descriptors. On the other hand, static image classification has seen the rise of deep learning architectures, with evidence that the output of intermediate layers could be successfully employed as a low...

chapter

Fisher vector encoded deep convolutional features for unconstrained face verification

Jun-Cheng Chen, Jingxiao Zheng, Vishal M. Patel, Rama Chellappa

2016 IEEE International Conference on Image Processing (ICIP) > 2981 - 2985

2016 IEEE International Conference on Image Processing (ICIP)

We present a method to combine the Fisher vector representation and the Deep Convolutional Neural Network (DCNN) features to generate a rerpesentation, called the Fisher vector encoded DCNN (FV-DCNN) features, for unconstrained face verification. One of the key features of our method is that spatial and appearance information are simultaneously processed when learning the Gaussian mixture model to...

chapter

Decoding cognitive states using the bag of words model on fMRI time series

Gunes Sucu, Emre Akbas, Ilke Oztekin, Eda Mizrak, more

2016 24th Signal Processing and Communication Application Conference (SIU) > 2245 - 2248

2016 24th Signal Processing and Communication Application Conference (SIU)

Bag-of-words (BoW) modeling has yielded successful results in document and image classification tasks. In this paper, we explore the use of BoW for cognitive state classification. We estimate a set of common patterns embedded in the fMRI time series recorded in three dimensional voxel coordinates by clustering the BOLD responses. We use these common patterns, called the code-words, to encode activities...

chapter

Encoding scale into fisher vector for human action recognition

Bowen Zhang, Hanli Wang

2015 Visual Communications and Image Processing (VCIP) > 1 - 4

2015 Visual Communications and Image Processing (VCIP)

In this paper, a new kind of Fisher Vector (FV) model, named Scale FV (ScaleFV), is proposed to ameliorate visual feature encoding for human action recognition. Although several researches have been proposed for feature encoding, the temporal scale information is almost ignored. Similar to the spatial scale information which has shown to be important in extracting and encoding visual features, the...

chapter

Visual Tracking via Saliency Weighted Sparse Coding Appearance Model

Wanyi Li, Peng Wang, Hong Qiao

2014 22nd International Conference on Pattern Recognition > 4092 - 4097

2014 22nd International Conference on Pattern Recognition (ICPR)

Sparse coding has been used for target appearance modeling and applied successfully in visual tracking. However, noise may be inevitably introduced into the representation due to background clutter. To cope with this problem, we propose a saliency weighted sparse coding appearance model for visual tracking. Firstly, a spectral filtering based visual attention computational model, which combines both...

chapter

Amplitude and texture feature based SAR image classification with a two-stage approach

Jilan Feng, Zongjie Cao, Yiming Pi

2014 IEEE Radar Conference > 360 - 364

2014 IEEE Radar Conference (RadarCon)

This paper presents an SAR image classification approach that takes advantage of both amplitude and texture features. The proposed approach is based on superpixels obtained with some over-segmentation methods, and consists of two stages. In the first stage, the SAR image is classified with amplitude and texture feature used separately. Specifically, we use statistical model based maximum-likelihood...

chapter

Classification of unions of subspaces with sparse representations

Alhussein Fawzi, Pascal Frossard

2013 Asilomar Conference on Signals, Systems and Computers > 1368 - 1372

2013 Asilomar Conference on Signals, Systems and Computers

We propose a preliminary investigation on the benefits and limitations of classifiers based on sparse representations. We specifically focus on the union of subspaces data model and examine binary classifiers built on a sparse non linear mapping (in a redundant dictionary) followed by a linear classifier. We study two common sparse non linear mappings (namely l₀ and l₁) and show that, in both cases,...

chapter

Abnormal event detection in crowded scenes based on Structural Multi-scale Motion Interrelated Patterns

Dawei Du, Honggang Qi, Qingming Huang, Wei Zeng, more

2013 IEEE International Conference on Multimedia and Expo (ICME) > 1 - 6

2013 IEEE International Conference on Multimedia and Expo (ICME)

Detecting abnormal events in crowded scenes remains challenging due to the diversity of events defined by various applications. Among the many application situations, motion analysis for event representation is suited for crowded scenes. In this paper, we propose a novel abnormal event detection method via likelihood estimation of dynamic-texture motion representation, called Structural Multi-scale...

chapter

Improving the Discriminability of Dictionary by Gist Information Detection

Tao Xu, Hongya Tuo, Zheng Fang, Li Liu, more

2013 Seventh International Conference on Image and Graphics > 400 - 403

2013 Seventh International Conference on Image and Graphics (ICIG)

Image representations using code words from a visual dictionary are widely applied in object detection and categorization. Traditionally, there are two types of methods to construct a dictionary: k-means and optimization-based method. The former cannot achieve a good discriminability because it extracts too many background features. The latter needs to cooperate with coding methods and brings about...

chapter

Exploiting parallel corpus for automatic extraction of multilingual names: Transliteration perspective

Bibekananda Kundu, Sanjay Kumar Choudhury

2012 Annual IEEE India Conference (INDICON) > 608 - 612

2012 Annual IEEE India Conference (INDICON)

This paper describes a novel approach for extraction of multilingual transliteration pairs from aligned parallel corpus. The proposed approach utilizes an encoding technique based on “Place and Manner of Articulation”. Jaccard Coefficient has been used to measure the distance between encoded source and target transliteration pairs. The proposed methodology has been employed for extraction of English-Bangla...

chapter

Natural Scene Retrieval Based on Non-negative Sparse Coding

Min Wang, Xiao-hui Yang, Lixin Han, Rong Chu

2012 Fourth International Conference on Computational Intelligence, Communication Systems and Networks > 284 - 288

2012 4th International Conference on Computational Intelligence, Communication Systems and Networks (CICSyN 2012)

Semantic understanding of images remains an important research challenge for the image and video retrieval community. A novel natural scene retrieval method based on non-negative sparse coding is proposed in this paper. It firstly combines non-negative sparse coding with spatial pyramid matching for feature extraction and representation. Then, based on sparse coding, it ranks the Euclidean distances...

chapter

Group encoding of local features in image classification

Zifeng Wu, Yongzhen Huang, Liang Wang, Tieniu Tan

Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012) > 1505 - 1508

2012 21st International Conference on Pattern Recognition (ICPR)

Saliency is an important factor in feature coding, based on which saliency coding (SaC) has been proposed for image classification recently. SaC is both effective and efficient in case of a moderate-scale codebook. However, empirical studies show that SaC will lose its superiority as the codebook size increases. To address this problem, we propose a group coding strategy, wherein the latent structure...

INFONA - science communication portal

Search results

Offline Handwritten Signature Modeling and Verification Based on Archetypal Analysis

Deep Representation Learning for Human Motion Prediction and Classification

Recurrent Modeling of Interaction Context for Collective Activity Recognition

A Multi-stage Deep Learning Approach for Business Process Event Prediction

Efficient pooling of image based CNN features for action recognition in videos

Effective surface normals based action recognition in depth images

One-shot learning of temporal sequences using a distance dependent Chinese Restaurant Process

A hierarchical visual recognition model with precise-spike-driven synaptic plasticity

Towards temporal adaptive representation for video action recognition

Fisher vector encoded deep convolutional features for unconstrained face verification

Decoding cognitive states using the bag of words model on fMRI time series

Encoding scale into fisher vector for human action recognition

Visual Tracking via Saliency Weighted Sparse Coding Appearance Model

Amplitude and texture feature based SAR image classification with a two-stage approach

Classification of unions of subspaces with sparse representations

Abnormal event detection in crowded scenes based on Structural Multi-scale Motion Interrelated Patterns

Improving the Discriminability of Dictionary by Gist Information Detection

Exploiting parallel corpus for automatic extraction of multilingual names: Transliteration perspective

Natural Scene Retrieval Based on Non-negative Sparse Coding

Group encoding of local features in image classification

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options