The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Recently, gesture recognition by the motion sensor embedded in wearable device has gradually been a popular research topic in pervasive computing. Traditionally, most of gesture recognition researches are based on sensor waveform of time and frequency domains. Another method based on trajectory estimation is emerging in gesture recognition. This method seems like more effect since it can extract more...
Action recognition based on human skeleton structure represents nowadays a prosper research field. This is mainly due to the recent advances in terms of capture technologies and skeleton extraction algorithms. In this context, we observed that 3D skeleton-based actions share several properties with handwritten symbols since they both result from a human performance. We accordingly hypothesize that...
This paper proposes an end-to-end framework, namely fully convolutional recurrent network (FCRN) for handwritten Chinese text recognition (HCTR). Unlike traditional methods that rely heavily on segmentation, our FCRN is trained with online text data directly and learns to associate the pen-tip trajectory with a sequence of characters. FCRN consists of four parts: a path-signature layer to extract...
We model dyadic (two-person) interactions by discriminatively training a spatio-temporal deformable part model of fine-grained human interactions. All interactions involve at most two persons. Our models are capable of localizing human interactions in unsegmented videos, marking the interactions of interest in space and time. Our contributions are as follows: First, we create a model that localizes...
In this paper, we propose a new characteristic measure relative people density and motion dynamics for the purpose of long-term crowd monitoring. While many related works focus on direct people counting and absolute density estimation, we will show that relative densities provide reliable information on crowd behaviour. Furthermore, we will discuss the derivation of a so-called Congestion Level of...
In recent years the most popular video-based human action recognition methods rely on extracting feature representations using Convolutional Neural Networks (CNN) and then using these representations to classify actions. In this work, we propose a fast and accurate video representation that is derived from the motion-salient region (MSR), which represents features most useful for action labeling....
Overlapped handwriting recognition is widely used to input text in smart devices since it allows to write continuous characters on an size-restricted screens. How to segment the stroke sequences into characters is a crucial step before recognition. It is currently formulated as a two-class classification problem merely evaluating on the relationships between a pair of adjacent strokes. To facilitate...
We propose a novel geometric framework for analyzing spontaneous facial expressions, with the specific goal of comparing, matching, and averaging the shapes of landmarks trajectories. Here we represent facial expressions by the motion of the landmarks across the time. The trajectories are represented by curves. We use elastic shape analysis of these curves to develop a Riemannian framework for analyzing...
This paper investigates the effects of sampling on action recognition performance. Currently, dense (regular grid) sampling and uniform random sampling are popular strategies that achieve state-of-the-art performance. However, they are data-blind and pay equal attention to locations of different informativeness. In this paper, a Shannon information based adaptive sampling approach is proposed for...
In this paper we propose a hierarchical activity clustering methodology which incorporates the use of topological persistence analysis. Our clustering methodology captures the hierarchies present in the data and is therefore able to show the dependencies that exist between these activities. We make use of an aggregate persistence diagram to select robust graphical structures present within the dataset...
The path signature feature (PSF) which was initially introduced in rough paths theory as a branch of stochastic analysis, has recently been successfully applied to the field of pattern recognition for extracting sufficient quantity of information contained in a finite trajectory, but with potentially high dimension. In this paper, we propose a variation of path signature representation, namely the...
Map retrieval, the problem of similarity search over a large collection of 3D pointset maps previously built by mobile robots, is crucial for autonomous navigation in indoor and outdoor environments. Bag-of-words (BoW) methods constitute a popular approach to map retrieval; however, these methods have extremely limited descriptive ability because they ignore the spatial layout information of the local...
We seek to extract and explore statistics that characterize New York City traffic flows based on 700 million taxi trips in the 2010–2013 New York City taxi data. This paper presents a two-part solution for intensive computation: space and time design considerations for estimating taxi trajectories with Dijkstra's algorithm, and job parallelization and scheduling with HTCondor. Our contribution is...
Scholarships and financial aids in modern universities are the basic administrative plans to ensure and promote the completion of academic training and studies for students. Traditional grants allocation procedures are based on manual determination, which costs lots of human resources. In this paper, we investigate an assistance model for helping improve the scheme of granting. We first collect students...
Exploiting simple actions to recognize complex actions instead of using complex actions as training samples can save labor expenses and time consumption. Each complex action is composed of a sequence of simple actions and different manners of combinations of simple actions can form different complex actions. Thus, in this paper, we focus on temporal order information (TOI), which can be used to improve...
Spoofing speech detection aims to differentiate spoofing speech from natural speech. Frame-based features are usually used in most of previous works. Although multiple frames or dynamic features are used to form a super-vector to represent the temporal information, the time span covered by these features are not sufficient. Most of the systems failed to detect the non-vocoder or unit selection based...
Action recognition is one of the top challenges in computer vision. In this paper, we present two binary-based video descriptors with outstanding characteristics in terms of recognition rate, computational times and memory requirements. The descriptors are called Binary Wavelet Differences (BWD) and Binary Dense Trajectories (BDT). Our proposed descriptors are based on the local binary patterns and...
In most of the traffic safety studies, both the identification of high-risk locations and the assessment of safety improvement solutions are done through the use of historical crash data. This study proposes an alternative approach that makes use of traffic conflicts extracted from traffic video recordings for safety assessment. State-of-the-art computer vision techniques are used to extract vehicle...
Localization technique is a key technology for vehicular navigation and control. The use of satellite navigation is deemed essential. Yet, satellite-based navigation system is subject to multipath effect and signal blockages in urban area. To enhance the overall navigation capability, odometry techniques have been used as dead-reckoning devices to augment satellite navigation. To account the issues...
Action recognition is actually considered as one of the most challenging areas in computer vision domain. In this paper, we propose a new approach based on utilization of motion boundaries to generate Motion Stable Shape (MSS) features to describe human actions in videos. In fact, we have considered actions as a set of human poses. Temporal evolution of each human pose is modeled by a set of new MSS...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.