Search results

chapter

Action recognition based on depth image sequence

Liangcan Liao, Guitao Cao, Wenming Cao

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) > 1583 - 1587

2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

Human action recognition is the process of labeling image sequences with action labels. Robust solutions to this problem have applications in domains such as medical care, human-computer interaction and virtual training. The task is challenging for feature extraction due to variations in motion performance, recording settings and inter-personal differences. To meet these challenges, we propose two...

chapter

Attention-Based Multimodal Fusion for Video Description

Chiori Hori, Takaaki Hori, Teng-Yok Lee, Ziming Zhang, more

2017 IEEE International Conference on Computer Vision (ICCV) > 4203 - 4212

2017 IEEE International Conference on Computer Vision (ICCV)

Current methods for video description are based on encoder-decoder sentence generation using recurrent neural networks (RNNs). Recent work has demonstrated the advantages of integrating temporal attention mechanisms into these models, in which the decoder network predicts each word in the description by selectively giving more weight to encoded features from specific time frames. Such methods typically...

chapter

Decision tree based fast CU partition for HEVC lossless compression of medical image sequences

Dongdong Zhang, Xiaojing Duan, Di Zang

2017 9th International Conference on Wireless Communications and Signal Processing (WCSP) > 1 - 6

2017 9th International Conference on Wireless Communications and Signal Processing (WCSP)

In this paper, we proposed a fast coding unit (CU) size decision algorithm for High Efficiency Video Coding (HEVC) medical image lossless coding. In detailed, we used the coding information obtained after checking the first two prediction unit (PU) modes inter 2N×2N and Skip to determine whether or not to continue partitioning the current CU. Eight features are extracted from the coding information...

chapter

Attitude estimation of space targets by extracting line features from ISAR image sequences

Yejian Zhou, Lei Zhang, Hongxian Wang, Zhijun Qiao, more

2017 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC) > 1 - 4

2017 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC)

In this letter, an attitude estimation method is presented for space targets by using an inverse synthetic aperture radar (ISAR) image sequence. The line structures, like the boundaries of planar payloads, are extracted from the ISAR image sequence and associated from frame to frame. With the accommodation of the radar looking angle information from the trajectory, the threedimensional attitude of...

chapter

Image-driven, model-free control of repetitive processes based on machine learning

Ewaryst Rafajłowicz

2017 10th International Workshop on Multidimensional (nD) Systems (nDS) > 1 - 6

2017 10th International Workshop on Multidimensional (nD) Systems (nDS)

An image-driven, model-free approach to design control systems for a large class of industrial process is proposed. A mathematical model of the process is replaced by sequences of subsequent images which play the role of the process (plant) states. The length of this sequences depends on the speed of the process dynamics and on the frame rate. Firstly, a learning sequence of the system states is collected...

chapter

Social Scene Understanding: End-to-End Multi-person Action Localization and Collective Activity Recognition

Timur Bagautdinov, Alexandre Alahi, Francois Fleuret, Pascal Fua, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3425 - 3434

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present a unified framework for understanding human social behaviors in raw image sequences. Our model jointly detects multiple individuals, infers their social actions, and estimates the collective actions with a single feed-forward pass through a neural network. We propose a single architecture that does not rely on external detection algorithms but rather is trained end-to-end to generate dense...

chapter

Recurrent 3D Pose Sequence Machines

Mude Lin, Liang Lin, Xiaodan Liang, Keze Wang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5543 - 5552

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

3D Human articulated pose recovery from monocular image sequences is very challenging due to the diverse appearances, viewpoints, occlusions, and also the human 3D pose is inherently ambiguous from the monocular imagery. It is thus critical to exploit rich spatial and temporal long-range dependencies among body joints for accurate 3D pose sequence prediction. Existing approaches usually manually design...

chapter

See the Forest for the Trees: Joint Spatial and Temporal Recurrent Neural Networks for Video-Based Person Re-identification

Zhen Zhou, Yan Huang, Wei Wang, Liang Wang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6776 - 6785

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Surveillance cameras have been widely used in different scenes. Accordingly, a demanding need is to recognize a person under different cameras, which is called person re-identification. This topic has gained increasing interests in computer vision recently. However, less attention has been paid to video-based approaches, compared with image-based ones. Two steps are usually involved in previous approaches,...

chapter

Memory-based pedestrian detection through sequence learning

Xudong Li, Mao Ye, Yiguang Liu, Ce Zhu

2017 IEEE International Conference on Multimedia and Expo (ICME) > 1129 - 1134

2017 IEEE International Conference on Multimedia and Expo (ICME)

Human recognize an object through eyes scanning in a certain order. We think that the proper order is helpful for capturing useful characteristics, which makes our recognition process rapidly and accurately. Therefore, we propose a memory-based sequence learning model to simulate the human recognition process. Firstly, we divide the image without overlapping to generate the sequence. Then, a convolutional...

chapter

Deep Spatial-Temporal Fusion Network for Video-Based Person Re-identification

Lin Chen, Hua Yang, Ji Zhu, Qin Zhou, more

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 1478 - 1485

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

In this paper, we propose a novel deep end-to-end network to automatically learn the spatial-temporal fusion features for video-based person re-identification. Specifically, the proposed network consists of CNN and RNN to jointly learn both the spatial and the temporal features of input image sequences. The network is optimized by utilizing the siamese and softmax losses simultaneously to pull the...

chapter

Real-time hand posture and gesture-based touchless automotive user interface using deep learning

V. John, M. Umetsu, A. Boyali, S. Mita, more

2017 IEEE Intelligent Vehicles Symposium (IV) > 869 - 874

2017 IEEE Intelligent Vehicles Symposium (IV)

In this study, a vision based in-car entertainment user interface is presented. The user interface is designed using a hand posture and gesture recognition algorithm in deep learning framework. The hand posture recognition algorithm is formulated using the convolutional neural network to perform the fundamental tasks in the user interface. The hand gesture recognition algorithm is formulated using...

chapter

Deep spatio-temporal network for accurate person re-identification

Quan Nguyen Hong, Nghia Nguyen Tuan, Trung Tran Quang, Dung Nguyen Tien, more

2017 International Conference on Information and Communications (ICIC) > 208 - 213

2017 International Conference on Information and Communications (ICIC)

Feature extraction is one of two core tasks of a person re-identification besides metric learning. Building an effective feature extractor is the common goal of any research in the field. In this work, we propose a deep spatio-temporal network model which consists of a VGG-16 as a spatial feature extractor and a GRU network as an image sequence descriptor. Two temporal pooling techniques are investigated...

chapter

DeepVO: Towards end-to-end visual odometry with deep Recurrent Convolutional Neural Networks

Sen Wang, Ronald Clark, Hongkai Wen, Niki Trigoni

2017 IEEE International Conference on Robotics and Automation (ICRA) > 2043 - 2050

2017 IEEE International Conference on Robotics and Automation (ICRA)

This paper studies monocular visual odometry (VO) problem. Most of existing VO algorithms are developed under a standard pipeline including feature extraction, feature matching, motion estimation, local optimisation, etc. Although some of them have demonstrated superior performance, they usually need to be carefully designed and specifically fine-tuned to work well in different environments. Some...

chapter

Camera motion compensation from T-junctions in distance map skeleton

Charles Beumier, Xavier Neyt

2017 International Conference on Systems, Signals and Image Processing (IWSSIP) > 1 - 5

2017 International Conference on Systems, Signals and Image Processing (IWSSIP)

In the field of aerial surveillance, tracking targets in images is complicated by the possible motion of the camera, especially if frame differencing is used to detect moving objects. We propose in this paper to exploit the high similarity in sequences acquired from a nearly static camera. In this case distance maps grown from image edge points share many similarities and T-junctions of distance map...

chapter

Change detection in marine observatory image streams using Bi-Domain Feature Clustering

Torben Moller, Ingunn Nilssen, Tim W. Nattkemper

2016 23rd International Conference on Pattern Recognition (ICPR) > 793 - 798

2016 23rd International Conference on Pattern Recognition (ICPR)

Vision based environmental monitoring using fixed cameras generates large image collections, creating a bottleneck in data analysis. In areas with limited background knowledge of the monitored habitat, this bottleneck can often not be overcome by traditional pattern recognition methods. A new change detection method to identify interesting events such as presence and behavior of different species...

chapter

Automatic detection of laser-induced structures in live cell fluorescent microscopy images using snakes with geometric constraints

Alexandr Yu. Kondrat'ev, Dmitry V. Sorokin

2016 23rd International Conference on Pattern Recognition (ICPR) > 331 - 336

2016 23rd International Conference on Pattern Recognition (ICPR)

The existence of reliable evaluation datasets for cell image registration algorithms is crucial for quantitative comparison of registration approaches. A new technique for creating real live cell image sequences for this purpose was introduced recently. These datasets contain stable structures bleached by argon laser in the cell nucleus. In this work, we propose an approach for automatic detection...

chapter

Appearance changes detection during tracking

Wei Chen, Xifeng Guo, Xinwang Liu, En Zhu, more

2016 23rd International Conference on Pattern Recognition (ICPR) > 1821 - 1826

2016 23rd International Conference on Pattern Recognition (ICPR)

Correlation tracker has made a huge success in visual object tracking. However, it is mainly because that the tracker cannot catch the occurrence of appearance changes, tracking based on correlation filters often drifts due to the unexpected appearance changes caused by occlusion, deformation and background clutter. In this paper, we propose a new method to detect the case when the tracker encountered...

chapter

Classification of cognitive state using statistics of split time series

J. Siva Ramakrishna, Hariharan Ramasangu

2016 IEEE Annual India Conference (INDICON) > 1 - 5

2016 IEEE Annual India Conference (INDICON)

Functional MRI (fMRI) data comprises of a set of trials, each trial is described in terms of a group of 20 to 25 anatomical Region Of Interests (ROI). Each ROI consists of neuroimage sequence information in terms of a set of voxels. Extracting features from ROIs and classifying cognitive states is a challenging task. In this work, average of voxel time horizon for each ROI is considered as an input...

chapter

Data-Driven Long Term Change Analysis in Marine Observatory Image Streams

Torben Moller, Ingunn Nilssen, Tim W. Nattkemper

2016 ICPR 2nd Workshop on Computer Vision for Analysis of Underwater Imagery (CVAUI) > 13 - 18

2016 ICPR 2nd Workshop on Computer Vision for Analysis of Underwater Imagery (CVAUI)

In recent years, a number of fixed long-term underwater observatories (FUO) have been deployed to monitor marine habitats over time. HD cameras deployed on FUOs enable vision based studies of long-term processes in the monitored habitats. However, in many marine environments there is often only little a-priori knowledge about potential changes that can be expected or where such changes are likely...

chapter

Automatic recognition of micro-expressions using local binary patterns on three orthogonal planes and extreme learning machine

Iyanu Pelumi Adegun, Hima B. Vadapalli

2016 Pattern Recognition Association of South Africa and Robotics and Mechatronics International Conference (PRASA-RobMech) > 1 - 5

2016 PRASA-RobMech International Conference

The use of micro expressions as a means to understand ones state of mind has received major interest owing to the rapid increase in security threats. The subtle changes that occur on ones face reveals one's hidden intentions. Recognition of these subtle intentions by humans can be challenging as this needs well trained people and is always a time consuming task. Automatic recognition of micro expressions...

INFONA - science communication portal

Search results

Action recognition based on depth image sequence

Attention-Based Multimodal Fusion for Video Description

Decision tree based fast CU partition for HEVC lossless compression of medical image sequences

Attitude estimation of space targets by extracting line features from ISAR image sequences

Image-driven, model-free control of repetitive processes based on machine learning

Social Scene Understanding: End-to-End Multi-person Action Localization and Collective Activity Recognition

Recurrent 3D Pose Sequence Machines

See the Forest for the Trees: Joint Spatial and Temporal Recurrent Neural Networks for Video-Based Person Re-identification

Memory-based pedestrian detection through sequence learning

Deep Spatial-Temporal Fusion Network for Video-Based Person Re-identification

Real-time hand posture and gesture-based touchless automotive user interface using deep learning

Deep spatio-temporal network for accurate person re-identification

DeepVO: Towards end-to-end visual odometry with deep Recurrent Convolutional Neural Networks

Camera motion compensation from T-junctions in distance map skeleton

Change detection in marine observatory image streams using Bi-Domain Feature Clustering

Automatic detection of laser-induced structures in live cell fluorescent microscopy images using snakes with geometric constraints

Appearance changes detection during tracking

Classification of cognitive state using statistics of split time series

Data-Driven Long Term Change Analysis in Marine Observatory Image Streams

Automatic recognition of micro-expressions using local binary patterns on three orthogonal planes and extreme learning machine

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options