Serwis Infona wykorzystuje pliki cookies (ciasteczka). Są to wartości tekstowe, zapamiętywane przez przeglądarkę na urządzeniu użytkownika. Nasz serwis ma dostęp do tych wartości oraz wykorzystuje je do zapamiętania danych dotyczących użytkownika, takich jak np. ustawienia (typu widok ekranu, wybór języka interfejsu), zapamiętanie zalogowania. Korzystanie z serwisu Infona oznacza zgodę na zapis informacji i ich wykorzystanie dla celów korzytania z serwisu. Więcej informacji można znaleźć w Polityce prywatności oraz Regulaminie serwisu. Zamknięcie tego okienka potwierdza zapoznanie się z informacją o plikach cookies, akceptację polityki prywatności i regulaminu oraz sposobu wykorzystywania plików cookies w serwisie. Możesz zmienić ustawienia obsługi cookies w swojej przeglądarce.
Spatial information describes the relative spatial position of an object in a video. Such information may aid several video analysis tasks such as object, scene, event and activity recognition. This paper studies the effect of spatial information on video activity recognition. The paper firstly performs activity recognition on KTH and Weizmann videos using Hidden Markov Model and k-Nearest Neighbour...
This paper examines a generalized version of Preemptive RANSAC for visual motion estimation. The approach described employs the BRUMA function for dealing with varying block sizes and the percentages of hypotheses to be removed during the hypotheses rejection phase. The generation of a flexible number of hypotheses is also performed in order to balance the preemption scheme. Experiments were performed...
It is estimated that 80% of crashes and 65% of near collisions involved drivers inattentive to traffic for three seconds before the event. This paper develops an algorithm for extracting characteristics allowing the cell phones identification used during driving a vehicle. Experiments were performed on sets of images with 100 positive images (with phone) and the other 100 negative images (no phone),...
Facial emotions provide an essential source of information commonly used in human communication. For humans, their recognition is automatic and is done exploiting the real-time variations of facial features. However, the replication of this natural process using computer vision systems is still a challenge, since automation and real-time system requirements are compromised in order to achieve an accurate...
This manuscript addresses the cross-spectral stereo correspondence problem. It proposes the usage of a dense flow field based representation instead of the original cross-spectral images, which have a low correlation. In this way, working in the flow field space, classical cost functions can be used as similarity measures. Preliminary experimental results on urban environments have been obtained showing...
Many computer vision applications adopting consumer depth cameras have recently received much attention due to the availability at low prices and the potential benefits to provide more useful information, which can result in a higher accuracy (e.g., for object recognition). In this work, to address the problem of drinking activity recognition in vision-based Ambient Assisted Living by using depth...
In this paper, a Binary Robust Invariant Scalable Keypoints (BRISK) based detection is utilized to facilitate the flying unmanned aerial vehicle (UAV) localization within its autonomous landing on the runway. Specifically, two target detection algorithms are proposed and developed as the BRISK-supported approach. Dataset of images and differential GPS are recorded by a ground stereo vision guidance...
By using online measurement and inspection, the efficiency of production in product manufacturing can be greatly improved. However, most of the available methods for the measurement and inspection of parts are offline in practice, which results in either time consuming or expensive inefficiencies. In this paper, a vision-based inspection approach is proposed, which is designed to measure and inspect...
This paper proposes a method to recognize human actions from a video sequence. The actions include walking, running, jogging, hand waving, clapping and boxing. The actions are categorized after recognition using a decision tree. Apart from other algorithms, our proposed method recognizes single human actions considering the speed, direction and the percentage of endpoints as a novel approach. In addition...
Camera Calibration is a crucial task in computer vision. It aims to get internal and external parameters of camera from images. Currently, classical calibration techniques are widely used in measurement and monitor, especially the method proposed by Zhang Z.Y. In this paper, it is demonstrated that classical method is rather inaccurate in some certain conditions-different distance. Experiment results...
Recovering 3D depth from a single outdoor image is a basic problem in Computer Vision and Close-Range Photogrammetry. In this paper, an efficient depth estimation approach from a single outdoor image is presented. According to scene classification, depth of regions marked as sky, ground and vertical labels is respectively predicated. Firstly, a more accurate depth calculation model for ground regions...
A new region-based local stereo matching algorithm with accurate disparity estimation is proposed. For the local stereo matching, finding an appropriate support window is crucial to the performance of disparity estimation. In order to generate an accurate support region, a modified cross-based local approach combined with mean-shift segmentation is performed. We then further improve the reliability...
Feature learning plays a crucial role in the successful human action recognition. There has been a number of approaches extracting action features from depth information and 3D skeletal data. However, either the skeleton information or the depth map is not accurate for feature learning unless complex descriptors are carefully designed and embedded. In this paper, we first propose a data sparsification...
In this paper, a fall alarm and abnormal inactivity detection system is implemented on Raspberry Pi for security surveillance of empty-nesters in real time environment. We propose a novel method for fall alarm with a small amount of computing and we also present an inactivity detection method which we named "inactivity history" method to improve the accuracy of detection and it is a kind...
The performance of different action recognition techniques has recently been studied by several computer vision researchers. However, the potential improvement in classification through classifier fusion by ensemble-based methods has remained unattended. In this work, we evaluate the performance of an ensemble of action learning techniques, each performing the recognition task from a different perspective...
This paper presents a novel method to conduct camera pose estimation though combining Kinect and Perspective-n-points algorithms. Most existing camera pose estimation methods suffer from the errors caused by inevitable outliers between 2D–3D correspondences. To this end, we propose to use a random down sampling process to deal with outliers in this paper. The proposed method is divided into two main...
An important task in computer vision is object localization and recognition within images and video. Achieving real-time object localization and recognition on low-power devices is especially relevant in the context of wearable technologies. Indeed, wearable devices have a reduced size and cost and limited computational power leading to a challenging scenario for classical computer vision algorithms...
Estimation of distance to real objects can be done with the help of various sensors. In some cases stereovision system is technically simpler and cheaper. Algorithms of dense stereovision allow to build highly detailed virtual terrain model, which gives opportunity to control the device in complicated relief or in dangerous surroundings. But some algorithms of dense stereovision have problems to determine...
Object tracking is one of the most important components in numerous applications of computer vision. In this paper, the target is represented by a series of binary patterns, where each binary pattern consists of several rectangle pairs in variable size and location. As complementary to traditional binary descriptors, these patterns are extracted in both the intensity domain and the gradient domain...
Augmented reality is becoming more and more popular due to the countless number of practical applications. A key element is the understanding of the scene and the involved human activities to be able to offer a rich interaction with the world via virtual actions and elements. For this purpose, a new vision-based human-action recognition module has been developed to be integrated with the new generation...
Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.