The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Human action recognition is the process of labeling image sequences with action labels. Robust solutions to this problem have applications in domains such as medical care, human-computer interaction and virtual training. The task is challenging for feature extraction due to variations in motion performance, recording settings and inter-personal differences. To meet these challenges, we propose two...
Current methods for video description are based on encoder-decoder sentence generation using recurrent neural networks (RNNs). Recent work has demonstrated the advantages of integrating temporal attention mechanisms into these models, in which the decoder network predicts each word in the description by selectively giving more weight to encoded features from specific time frames. Such methods typically...
In this paper, we proposed a fast coding unit (CU) size decision algorithm for High Efficiency Video Coding (HEVC) medical image lossless coding. In detailed, we used the coding information obtained after checking the first two prediction unit (PU) modes inter 2N×2N and Skip to determine whether or not to continue partitioning the current CU. Eight features are extracted from the coding information...
In this letter, an attitude estimation method is presented for space targets by using an inverse synthetic aperture radar (ISAR) image sequence. The line structures, like the boundaries of planar payloads, are extracted from the ISAR image sequence and associated from frame to frame. With the accommodation of the radar looking angle information from the trajectory, the threedimensional attitude of...
An image-driven, model-free approach to design control systems for a large class of industrial process is proposed. A mathematical model of the process is replaced by sequences of subsequent images which play the role of the process (plant) states. The length of this sequences depends on the speed of the process dynamics and on the frame rate. Firstly, a learning sequence of the system states is collected...
We present a unified framework for understanding human social behaviors in raw image sequences. Our model jointly detects multiple individuals, infers their social actions, and estimates the collective actions with a single feed-forward pass through a neural network. We propose a single architecture that does not rely on external detection algorithms but rather is trained end-to-end to generate dense...
3D Human articulated pose recovery from monocular image sequences is very challenging due to the diverse appearances, viewpoints, occlusions, and also the human 3D pose is inherently ambiguous from the monocular imagery. It is thus critical to exploit rich spatial and temporal long-range dependencies among body joints for accurate 3D pose sequence prediction. Existing approaches usually manually design...
Surveillance cameras have been widely used in different scenes. Accordingly, a demanding need is to recognize a person under different cameras, which is called person re-identification. This topic has gained increasing interests in computer vision recently. However, less attention has been paid to video-based approaches, compared with image-based ones. Two steps are usually involved in previous approaches,...
Human recognize an object through eyes scanning in a certain order. We think that the proper order is helpful for capturing useful characteristics, which makes our recognition process rapidly and accurately. Therefore, we propose a memory-based sequence learning model to simulate the human recognition process. Firstly, we divide the image without overlapping to generate the sequence. Then, a convolutional...
In this paper, we propose a novel deep end-to-end network to automatically learn the spatial-temporal fusion features for video-based person re-identification. Specifically, the proposed network consists of CNN and RNN to jointly learn both the spatial and the temporal features of input image sequences. The network is optimized by utilizing the siamese and softmax losses simultaneously to pull the...
In this study, a vision based in-car entertainment user interface is presented. The user interface is designed using a hand posture and gesture recognition algorithm in deep learning framework. The hand posture recognition algorithm is formulated using the convolutional neural network to perform the fundamental tasks in the user interface. The hand gesture recognition algorithm is formulated using...
Feature extraction is one of two core tasks of a person re-identification besides metric learning. Building an effective feature extractor is the common goal of any research in the field. In this work, we propose a deep spatio-temporal network model which consists of a VGG-16 as a spatial feature extractor and a GRU network as an image sequence descriptor. Two temporal pooling techniques are investigated...
This paper studies monocular visual odometry (VO) problem. Most of existing VO algorithms are developed under a standard pipeline including feature extraction, feature matching, motion estimation, local optimisation, etc. Although some of them have demonstrated superior performance, they usually need to be carefully designed and specifically fine-tuned to work well in different environments. Some...
In the field of aerial surveillance, tracking targets in images is complicated by the possible motion of the camera, especially if frame differencing is used to detect moving objects. We propose in this paper to exploit the high similarity in sequences acquired from a nearly static camera. In this case distance maps grown from image edge points share many similarities and T-junctions of distance map...
Vision based environmental monitoring using fixed cameras generates large image collections, creating a bottleneck in data analysis. In areas with limited background knowledge of the monitored habitat, this bottleneck can often not be overcome by traditional pattern recognition methods. A new change detection method to identify interesting events such as presence and behavior of different species...
The existence of reliable evaluation datasets for cell image registration algorithms is crucial for quantitative comparison of registration approaches. A new technique for creating real live cell image sequences for this purpose was introduced recently. These datasets contain stable structures bleached by argon laser in the cell nucleus. In this work, we propose an approach for automatic detection...
Correlation tracker has made a huge success in visual object tracking. However, it is mainly because that the tracker cannot catch the occurrence of appearance changes, tracking based on correlation filters often drifts due to the unexpected appearance changes caused by occlusion, deformation and background clutter. In this paper, we propose a new method to detect the case when the tracker encountered...
Functional MRI (fMRI) data comprises of a set of trials, each trial is described in terms of a group of 20 to 25 anatomical Region Of Interests (ROI). Each ROI consists of neuroimage sequence information in terms of a set of voxels. Extracting features from ROIs and classifying cognitive states is a challenging task. In this work, average of voxel time horizon for each ROI is considered as an input...
In recent years, a number of fixed long-term underwater observatories (FUO) have been deployed to monitor marine habitats over time. HD cameras deployed on FUOs enable vision based studies of long-term processes in the monitored habitats. However, in many marine environments there is often only little a-priori knowledge about potential changes that can be expected or where such changes are likely...
The use of micro expressions as a means to understand ones state of mind has received major interest owing to the rapid increase in security threats. The subtle changes that occur on ones face reveals one's hidden intentions. Recognition of these subtle intentions by humans can be challenging as this needs well trained people and is always a time consuming task. Automatic recognition of micro expressions...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.