Search results

Items from 1 to 20 out of 56 results

chapter

Predicting Human Activities Using Stochastic Grammar

Siyuan Qi, Siyuan Huang, Ping Wei, Song-Chun Zhu

2017 IEEE International Conference on Computer Vision (ICCV) > 1173 - 1181

2017 IEEE International Conference on Computer Vision (ICCV)

This paper presents a novel method to predict future human activities from partially observed RGB-D videos. Human activity prediction is generally difficult due to its non-Markovian property and the rich context between human and environments. We use a stochastic grammar model to capture the compositional structure of events, integrating human actions, objects, and their affordances. We represent...

chapter

Temporal Action Detection with Structured Segment Networks

Yue Zhao, Yuanjun Xiong, Limin Wang, Zhirong Wu, more

2017 IEEE International Conference on Computer Vision (ICCV) > 2933 - 2942

2017 IEEE International Conference on Computer Vision (ICCV)

Detecting actions in untrimmed videos is an important yet challenging task. In this paper, we present the structured segment network (SSN), a novel framework which models the temporal structure of each action instance via a structured temporal pyramid. On top of the pyramid, we further introduce a decomposed discriminative model comprising two classifiers, respectively for classifying actions and...

chapter

Review of model-free gait recognition in biometrie systems

Maham Tariq, Munam Ali Shah

2017 23rd International Conference on Automation and Computing (ICAC) > 1 - 7

2017 23rd International Conference on Automation and Computing (ICAC)

Interests in the global security has encouraged researchers to propose novel algorithms to make robust biometrie systems. One of the interesting biometric trait is identifying human on the basis of their walking patterns, called gait recognition. In this paper, our contribution is two-fold. Firstly, we discuss the modules of model-free gait recognition techniques. Secondly, we perform the comparative...

chapter

Human fall detection using segment-level cnn features and sparse dictionary learning

Chenjie Ge, Irene Yu-Hua Gu, Jie Yang

2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP) > 1 - 6

2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP)

This paper addresses issues in human fall detection from videos. Unlike using handcrafted features in the conventional machine learning, we extract features from Convolutional Neural Networks (CNNs) for human fall detection. Similar to many existing work using two stream inputs, we use a spatial CNN stream with raw image difference and a temporal CNN stream with optical flow as the inputs of CNN....

chapter

Automatic detection of human interactions from RGB-D data for social activity classification

Claudio Coppola, Serhan Cosar, Diego R. Faria, Nicola Bellotto

2017 26th IEEE International Symposium on Robot and Human Interactive Communication (RO-MAN) > 871 - 876

2017 26th IEEE International Symposium on Robot and Human Interactive Communication (RO-MAN)

We present a system for temporal detection of social interactions. Many of the works until now have succeeded in recognising activities from clipped videos in datasets, but for robotic applications, it is important to be able to move to more realistic data. For this reason, the proposed approach temporally detects intervals where individual or social activity is occurring. Recognition of human activities...

chapter

Abnormal event detection in videos using binary features

Roberto Leyva, Victor Sanchez, Chang-Tsun Li

2017 40th International Conference on Telecommunications and Signal Processing (TSP) > 621 - 625

2017 40th International Conference on Telecommunications and Signal Processing (TSP)

In this paper we address the problem of online video abnormal event detection. A vast number of methods to automatically detect abnormal events in videos have been recently proposed. However, the majority of these recently proposed methods cannot attain online performance; in other words, they cannot detect events as soon as they occur. Thus there is a lack of methods specifically aimed to detect...

chapter

Temporal Action Localization by Structured Maximal Sums

Zehuan Yuan, Jonathan C. Stroud, Tong Lu, Jia Deng

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3215 - 3223

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We address the problem of temporal action localization in videos. We pose action localization as a structured prediction over arbitrary-length temporal windows, where each window is scored as the sum of frame-wise classification scores. Additionally, our model classifies the start, middle, and end of each action as separate components, allowing our system to explicitly model each actions temporal...

chapter

Recurrent Memory Addressing for Describing Videos

Arnav Kumar Jain, Abhinav Agarwalla, Kumar Krishna Agrawal, Pabitra Mitra

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 2200 - 2207

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

In this paper, we introduce Key-Value Memory Networks to a multimodal setting and a novel key-addressing mechanism to deal with sequence-to-sequence models. The proposed model naturally decomposes the problem of video captioning into vision and language segments, dealing with them as key-value pairs. More specifically, we learn a semantic embedding (v) corresponding to each frame (k) in the video,...

chapter

Fixation Prediction in Videos Using Unsupervised Hierarchical Features

Julius Wang, Hamed R. Tavakoli, Jorma Laaksonen

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 2225 - 2232

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

This paper presents a framework for saliency estimation and fixation prediction in videos. The proposed framework is based on a hierarchical feature representation obtained by stacking convolutional layers of independent subspace analysis (ISA) filters. The feature learning is thus unsupervised and independent of the task. To compute the saliency, we then employ a multiresolution saliency architecture...

chapter

Low-Complexity Global Motion Estimation for Aerial Vehicles

Nirmala Ramakrishnan, Alok Prakash, Thambipillai Srikanthan

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 402 - 410

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

Global motion estimation (GME) algorithms are typically employed on aerial videos captured by on-board UAV cameras to compensate for the artificial motion induced in these video frames due to camera motion. However, existing methods for GME have high computational complexity and are therefore not suitable for on-board processing in UAVs with limited computing capabilities. In this paper, we propose...

chapter

Better deep visual attention with reinforcement learning in action recognition

Gang Wang, Wenmin Wang, Jingzhuo Wang, Yaohua Bu

2017 IEEE International Symposium on Circuits and Systems (ISCAS) > 1 - 4

2017 IEEE International Symposium on Circuits and Systems (ISCAS)

Deep visual attention in computer vision has attracted much attention over the past years, which achieves great contributions especially in image classification, image caption and action recognition. However, due to taking BP training wholly or partially, they can not show the true power of attention in computational efficiency and focusing accuracy. Our intuition is that attention mechanism should...

chapter

Efficient pooling of image based CNN features for action recognition in videos

Biplab Banerjee, Vittorio Murino

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 2637 - 2641

2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

In this paper, we propose a new video representation incorporating image based deep features and an efficient pooling strategy for the purpose of action recognition. The Convolutional Neural Network (CNN) based features have very recently emerged as the new state of the art for image classification. Several attempts have been made to extend such CNN models for videos by explicitly focusing on the...

chapter

Recognition of Group Activities in Videos Based on Single-and Two-Person Descriptors

Stephane Lathuiliere, Georgios Evangelidis, Radu Horaud

2017 IEEE Winter Conference on Applications of Computer Vision (WACV) > 217 - 225

2017 IEEE Winter Conference on Applications of Computer Vision (WACV)

Group activity recognition from videos is a very challenging problem that has barely been addressed. We propose an activity recognition method using group context. In order to encode both single-person description and two-person interactions, we learn mappings from highdimensional feature spaces to low-dimensional dictionaries. In particular the proposed two-person descriptor takes into account geometric...

chapter

Pre-trained classifiers with One Shot Similarity for context aware face verification and identification

Monika Sharma, Ramya Hebbalaguppe, Lovekesh Vig

2017 IEEE International Conference on Identity, Security and Behavior Analysis (ISBA) > 1 - 7

2017 IEEE International Conference on Identity, Security and Behavior Analysis (ISBA)

Most affect based systems analyse facial expressions for emotion detection, and utilize face detection and recognition methods in order to do effective affect analysis. Recent work has demonstrated the efficacy of deep architectures for face recognition by training as classifiers on voluminous datasets. Some architectures are trained as classifiers, and some directly learn an embedding via a triplet...

chapter

Human activity prediction based on Sub-volume Relationship Descriptor

Dong-Gyu Lee, Seong-Whan Lee

2016 23rd International Conference on Pattern Recognition (ICPR) > 2060 - 2065

2016 23rd International Conference on Pattern Recognition (ICPR)

In this paper, we address the problem of recognizing unfinished human activity from partially observed videos. Specifically, we propose a novel human activity descriptor, which can represent pairwise relationships among human activities in a compact manner using pre-trained Convolutional Neural Networks (CNNs) by capturing the discriminative sub-volume. The potentially important relationship among...

chapter

Novel generative model for facial expressions based on statistical shape analysis of landmarks trajectories

Paul Audain Desrosiers, Mohamed Daoudi, Maxime Devanne

2016 23rd International Conference on Pattern Recognition (ICPR) > 961 - 966

2016 23rd International Conference on Pattern Recognition (ICPR)

We propose a novel geometric framework for analyzing spontaneous facial expressions, with the specific goal of comparing, matching, and averaging the shapes of landmarks trajectories. Here we represent facial expressions by the motion of the landmarks across the time. The trajectories are represented by curves. We use elastic shape analysis of these curves to develop a Riemannian framework for analyzing...

chapter

One-shot learning of temporal sequences using a distance dependent Chinese Restaurant Process

Carlos Orrite, Mario Rodriguez, Carlos Medrano

2016 23rd International Conference on Pattern Recognition (ICPR) > 2694 - 2699

2016 23rd International Conference on Pattern Recognition (ICPR)

Activity recognition in videos is a challenging task, mainly if a scarce number of samples is available for modelling the problem. The task becomes even harder when using generative models such as mixture models or Hidden Markov Models (HMMs), as they demand a lot of samples to determinate their parameters. Additionally, these models rely on the appropriate selection of some parameters, for instance...

chapter

In-cache MapReduce: Leverage Tiling to Boost Temporal Locality-Sensitive MapReduce Computations

Daniel Magro, Herve Paulino

2016 IEEE International Conference on Cluster Computing (CLUSTER) > 374 - 383

2016 IEEE International Conference on Cluster Computing (CLUSTER)

The MapReduce framework is being increasingly used in the scientific computing and image/video processing fields. Relevant research has tailored it for the field's specificities but there are still overwhelming limitations when it comes to temporal locality-sensitive computations. The performance of this class of computations is closely tied to an efficient use of the memory hierarchy, concern that...

chapter

Platformer level design for player believability

Elizabeth Camilleri, Georgios N. Yannakakis, Alexiei Dingli

2016 IEEE Conference on Computational Intelligence and Games (CIG) > 1 - 8

2016 IEEE Conference on Computational Intelligence and Games (CIG)

Player believability is often defined as the ability of a game playing character to convince an observer that it is being controlled by a human. The agent's behavior is often assumed to be the main contributor to the character's believability. In this paper we reframe this core assumption and instead focus on the impact of the game environment and aspects of game design (such as level design) on the...

chapter

FMRI-based perceptual validation of a computational model for visual and auditory saliency in videos

Georgia Panagiotaropoulou, Petros Koutras, Athanasios Katsamanis, Petros Maragos, more

2016 IEEE International Conference on Image Processing (ICIP) > 699 - 703

2016 IEEE International Conference on Image Processing (ICIP)

In this study, we make use of brain activation data to investigate the perceptual plausibility of a visual and an auditory model for visual and auditory saliency in video processing. These models have already been successfully employed in a number of applications. In addition, we experiment with parameters, modifications and suitable fusion schemes. As part of this work, fMRI data from complex video...

Data set:
ieee
Keywords:
COMPUTATIONAL MODELING
FEATURE EXTRACTION
VIDEOS
Publication type:
book

Publication date

Set your own date range

Keywords

TRAINING (13)
COMPUTER VISION (8)
VISUALIZATION (8)
HUMANS (7)
OBJECT DETECTION (7)
ACCURACY (6)
HIDDEN MARKOV MODELS (6)
IMAGE COLOR ANALYSIS (6)
VIDEO SIGNAL PROCESSING (6)
ADAPTATION MODEL (5)
HISTOGRAMS (5)
IMAGE EDGE DETECTION (5)
CAMERAS (4)
CONFERENCES (4)
CONTEXT (4)
CORRELATION (4)
DETECTORS (4)
ESTIMATION (4)
LEGGED LOCOMOTION (4)
MATHEMATICAL MODEL (4)
NOISE (4)
PATTERN RECOGNITION (4)
PIXEL (4)
REAL TIME SYSTEMS (4)
SIGNAL PROCESSING (4)
SUPPORT VECTOR MACHINES (4)
SURVEILLANCE (4)
ACTION RECOGNITION (3)
ALGORITHM DESIGN AND ANALYSIS (3)
BACKGROUND SUBTRACTION (3)
CLASSIFICATION ALGORITHMS (3)
COMPUTERS (3)
DATA MINING (3)
ENCODING (3)
EQUATIONS (3)
EVENT DETECTION (3)
IMAGE SEGMENTATION (3)
IMAGE SEQUENCES (3)
OBJECT RECOGNITION (3)
PREDICTIVE MODELS (3)
ROBUSTNESS (3)
SEMANTICS (3)
SHAPE (3)
SIGNAL PROCESSING ALGORITHMS (3)
VIDEO SURVEILLANCE (3)
ACTIVITY RECOGNITION (2)
ARTIFICIAL NEURAL NETWORKS (2)
COMPLEXITY THEORY (2)
COMPUTATIONAL EFFICIENCY (2)
COMPUTER ARCHITECTURE (2)
CONTEXT MODELING (2)
DATA MODELS (2)
DATABASES (2)
EDUCATIONAL INSTITUTIONS (2)
FACE (2)
FACE RECOGNITION (2)
GAIT ANALYSIS (2)
GAMES (2)
HUMAN ACTIVITY RECOGNITION (2)
IMAGE CLASSIFICATION (2)
IMAGE MATCHING (2)
IMAGE MOTION ANALYSIS (2)
IMAGE RECOGNITION (2)
IMAGE RESOLUTION (2)
KINEMATICS (2)
LEAST SQUARES APPROXIMATION (2)
LIGHTING (2)
MACHINE LEARNING (2)
MOTION DETECTION (2)
PROBABILISTIC LOGIC (2)
PROPOSALS (2)
RESOURCE MANAGEMENT (2)
SKELETON (2)
SKIN (2)
SMOOTHING METHODS (2)
SPORT (2)
STREAMING MEDIA (2)
TRACKING (2)
TRAINING DATA (2)
TRAJECTORY (2)
TRANSFORM CODING (2)
USA COUNCILS (2)
VECTORS (2)
VIDEO SEQUENCE (2)
VIDEO SEQUENCES (2)
2D MOTION FIELD (1)
3D SIFT (1)
ACCESS CONTROL (1)
ACTION DETECTION (1)
ACTION RECOGNITION PROBLEM (1)
ADAPTATION MODELS (1)
ADAPTIVE DISCRIMINANT ANALYSIS METHOD (1)
ADAPTIVE OPTICS (1)
ADAPTIVE THRESHOLD SELECTION MODEL (1)
AFFECTIVE COMPUTING (1)
AM-FM SOUND ANALYSIS (1)
ANALYTICAL MODELS (1)
more

INFONA - science communication portal

Search results

Predicting Human Activities Using Stochastic Grammar

Temporal Action Detection with Structured Segment Networks

Review of model-free gait recognition in biometrie systems

Human fall detection using segment-level cnn features and sparse dictionary learning

Automatic detection of human interactions from RGB-D data for social activity classification

Abnormal event detection in videos using binary features

Temporal Action Localization by Structured Maximal Sums

Recurrent Memory Addressing for Describing Videos

Fixation Prediction in Videos Using Unsupervised Hierarchical Features

Low-Complexity Global Motion Estimation for Aerial Vehicles

Better deep visual attention with reinforcement learning in action recognition

Efficient pooling of image based CNN features for action recognition in videos

Recognition of Group Activities in Videos Based on Single-and Two-Person Descriptors

Pre-trained classifiers with One Shot Similarity for context aware face verification and identification

Human activity prediction based on Sub-volume Relationship Descriptor

Novel generative model for facial expressions based on statistical shape analysis of landmarks trajectories

One-shot learning of temporal sequences using a distance dependent Chinese Restaurant Process

In-cache MapReduce: Leverage Tiling to Boost Temporal Locality-Sensitive MapReduce Computations

Platformer level design for player believability

FMRI-based perceptual validation of a computational model for visual and auditory saliency in videos

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options