The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
This paper proposes a tracking approach for regions of interest (ROI) in thermal image videos, where vital signs can be measured for emotion recognition. The proposed tracking framework overcomes a number of problems associated with this goal; mainly size of the ROI, appearance variations in the ROI with physiological changes, and the duration of tracking in a practical setting. The proposed framework...
In this paper we argue that a key-region detector designed to take into account the special characteristics of document images can result in the detection of less and more meaningful key-regions. We propose a fast key-region detector able to capture aspects of the structural information of the document, and demonstrate its efficiency by comparing against standard detectors in an administrative document...
Detecting text in scene images is very challenging due to complex backgrounds, various fonts and different illumination conditions. Without prior knowledge, a detector previously trained using lots of samples still perform badly on a test image because of the disparities in distributions between the training samples and the testing ones. In this paper, we propose to adapt a pre-trained generic scene...
A common task in various machine learning (ML) application areas involves observing regularly gathered data for ‘interesting’ events. This mission is predominant in reconnaissance, but also in responsibilities fluctuating from the investigation of scientific data to the observing of unsurprisingly happening events, and from controlling engineering procedures to noticing human behavior. We will refer...
Museums, libraries, national archives and art galleries deal with visual objects that must be made accessible to a wide variety of experts or non-experts like researchers, art lovers or interested people. The ability to identify objects sharing some aspect of visual similarity can be useful when trying to trace historical influences or when looking for further examples of paintings, sculptures or...
This paper presents an automatic procedure for rapid building extraction from optical very high resolution (VHR) satellite imagery. Classical extraction models are always complex and time-consuming. The optimized process of building extraction consists of three main rapid stages: edge-preserving and smoothing bilateral filter, line segment detection, perceptual grouping polygonal building boundary...
Interest point detectors are important components in a variety of computer vision systems. This paper demonstrates an automated virtual 3D environment for controlling and measuring detected interest points on 2D images in an accurate and rapid manner. Real-time affine transform tools enable easy implementation and full automation of complex scene evaluations without the time-cost of a manual setup...
In this paper, we propose an improved algorithm for 3D scene reconstruction from images obtained from two or more cameras. The principal objective of the proposed algorithm is to reduce number of feature points used for 3D scene reconstruction by using a triangle mesh and recursive sub-mesh definition. This kind of optimization can reduce computational complexity and the algorithm thereafter might...
This paper describes a computer vision system to detect and count moving vehicles on roads. The system uses a real-time traffic video surveillance camera mounted over roads and computes the total number of vehicles which passed the road. Moving vehicle image is extracted using ‘double difference image ‘algorithm and counting is accomplished by tracking vehicle movements within a tracking zone, called...
We propose an approach to improving the detection results of a generic offline trained detector on frames from a specific video. For two consecutive frames of a video with the object, deformable part model(DPM) detection is perform to get the original detections. Then respectively obtain the image patches corresponding to the detected root box and part boxes. Thirdly, extract scale invariant feature...
Object recognition in real scenes is a central problem in computer vision. In this paper we propose a new approach for shape based recognition of objects in real scenes. This approach uses moment invariants for identification of shape features. Moment Invariants are functions of central moments. They are invariant against linear transformations such as rotation, translation and scaling. Therefore,...
Real-world scenes involve many objects that interact with each other in complex semantic patterns. For example, a bar scene can be naturally described as having a variable number of chairs of similar size, close to each other and aligned horizontally. This high-level interpretation of a scene relies on semantically meaningful entities and is most generally described using relational representations...
In this paper, we present two methods to improve the performance of landmark detection algorithms that are designed to detect individual landmarks. We focus on the landmark configuration module that takes the output of the individual landmark detectors and searches for a configuration of optimal landmark locations based on appropriate shape constraints. We design two configuration search approaches:...
Barcode detection is required in a wide range of real-life applications. Imaging conditions and techniques vary considerably and each application has its own requirements for detection speed and accuracy. In our earlier works we built barcode detectors using morphological operations and uniform partitioning with several approaches and showed their behaviour on a set of test images. In this work, we...
A robust image processing technique capable of detecting and localizing objects accurately plays an important role in many computer vision applications. In this paper, a feature based detector for birds is proposed. By combining Histogram of Oriented Gradients (HOG) and Center-Symmetric Local Binary Pattern (CS-LBP) as the feature set, detection of crows under various lighting conditions could be...
This paper presents a unified action recognition framework combining harris3D descriptor with 3D SIFT detector. We perform action recognition experiments on the KTH dataset using Support Vector Machines. Experiments apply the leave-one-out and compare our proposed approach with state-of-the-art methods. The result shows that our proposed approach is effective. Compared with other approaches our approach...
In order to be deployed in real-world driving environments, autonomous vehicles must be able to recognize and respond to exceptional road conditions, such as highway workzones, because such unusual events can alter previously known traffic rules and road geometry. In this paper, we present a set of computer vision methods which recognize the bounds of a highway workzone and temporary changes in highway...
In this work, a novel feature detection algorithm, a new local binary pattern for local binary description and a tree-based descriptor indexing for descriptor matching are proposed. Similar to well-known FAST detector, proposed feature detector performs detection via pixel intensity comparisons in nested circles. Interest point description is achieved by a novel comparison pattern, whereas matching...
Extraction methods of points of interest in image characterization are many and diverse. In general the process of characterization follows a construction of a detector/descriptor schema. The detection allows selecting some points of interest as primitives who capture important structural information about the image. These detected points are then described by characteristic vectors to form a representative...
This paper proposes a novel system for automatically detecting children from a color monocular back-up camera, as part of a back-up warning device in passenger vehicles. We presented the use of an attentional mechansim that focuses compute-intensive bounding-box classifiers on a subset of all possible bounding-box solutions to enable real-time performance of 248ms per frame with negligible reduction...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.