The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
In this paper, a structural similarity index is first proposed for two images with possibly different dynamic ranges and intensities as well as possibly small rotation and translation. The proposed index is then extended by dividing two images into local windows, and the similarity is detected by checking all pairs of local windows. It is shown by experimental results that the proposed indices are...
In this paper, a cooperative system for target tracking for Omnidome is presented. Omnidome is a dual-camera sensor composed of an omnidirectional camera and a Pan-Tilt-Zoom (PTZ) unit. The tracking system is based on color histogram analysis, and is triggered by a motion detector; it has been designed in order to get the best performance when performing people tracking. The system is capable of extracting...
Multiview video coding (MVC) improves the coding efficiency by motion estimation (ME) and disparity estimation (DE). ME and DE at encoder side involve in heavy computation, which needs to be further reduced for practical applications. This paper presents fast disparity estimation by utilizing depth information to reduce DE's computational complexity. First, the coordinate offset of the encoding block...
In this paper, a perceptual multiview video coding scheme is proposed, based on the synthesized Just Noticeable Distortion (JND) maps. In JND-based perceptual video coding, the residues after intra or inter prediction are tuned according to the corresponding JND thresholds to save the bits without affecting the perceptual quality. In our scheme, to reduce the computational cost of generating the multiview...
Unmanned Arial Vehicles (UAVs) require the development of some on-board safety equipments before inheriting the sky. An on-board collision avoidance system is being built by our team. Due to the strict size, weight, power, and costs constraints, visual intruder airplane detection is the only option. This paper introduces our visual airplane detector algorithm, which is designed to be operational in...
An autonomous, efficient and effective object tracking algorithm was required to autonomously identify and track incoming targets. Then controlling a pan-tilt mounted with the sensing camera to accommodate the target within the camera's field of view and controlling a weapon mounted on the second mechanical pan tilt to lock the target and follow it efficiently and accurately. A hybrid algorithm is...
augmented reality applications overlap virtual objects over a real scene considering the context. Today, more advanced applications also make use of diminished reality, which removes real objects from a scene. This paper describes a novel approach that combines augmented reality and diminished reality techniques to modify real objects in augmented reality applications. The proposed approach removes...
From a rectified stereo image pair, the task of view synthesis is to generate images from any viewpoint along the baseline. The main difficulty of the problem is how to fill occluded regions. In this paper, we present a new method for view synthesis that is both fast and accurate. Occlusions are filled using color and disparity information to produce consistent pixel estimates. Results are comparable...
View synthesis offers a great flexibility in generating free viewpoint television (FTV) and 3D video (3DV). However, the depth-image-based view synthesis approach is very sensitive to errors in the camera parameters or poorly estimated depth maps (also called depth images). Because of these errors, three kinds of artifacts (blurring, contour, hole) are possibly introduced during the general synthesis...
We propose a straightforward intensity-based dissolve detection method which is able to cope with the particular constraints of the artistic animated movie domain. It uses the hypothesis that during a dissolve, the amount of fading-out and fading-in pixels should be high. Instead of just applying a global threshold, as most of the existing approaches do, we use a twin-threshold approach coped with...
Image based rendering is an attractive alternative for generating novel views compared to model based rendering due to its lower complexity and potential for photo-realistic results. We present a fast unsupervised method for synthesising arbitrary viewpoints of a scene from a set of existing views. Our novel improvements include optimising the placement of depth layers to take advantage of the composition...
Polarization is a fundamental property of light for which humans have no innate detector. Numerous sensors have been devised to capture this property of light. Of these types of sensors, division of focal plane polarimeters allow for both high resolution and real-time capture of the angle and degree of polarization. This paper details a formal system of measurements designed to categorize the optical...
The display of complex 3D scenes in real-time on mobile devices is difficult due to the insufficient data throughput and a relatively weak graphics performance. Hence, we propose a client-server system, where the processing of the complex scene is performed on a server and the resulting data is streamed to the mobile device. In order to cope with low transmission bit rates, the server sends new data...
In this paper, we introduce a novel probabilistic approach to handle occlusions and perspective effects. The proposed method is an object based method embedded in a marked point process framework. We apply it for the size estimation of a penguin colony, where we model a penguin colony as an unknown number of 3D objects. The main idea of the proposed approach is to sample some candidate configurations...
Detection and removal of rain in image is a difficult and crucial problem due to the complexity of rain and its negative effects on image. The spatio-temporal property and the chromatic property of rain are comprehensively analyzed. Using the two properties, a simple but effective algorithm is proposed to detect and remove the rain of sequential images. Firstly time complexity of k-means is reduced...
In this paper, a low-complexity motion-based saliency map estimation method for perceptual video coding is proposed. The method employs a camera motion compensated vector map computed by means of a hierarchical motion estimation (HME) procedure and a Restricted Affine Transformation (RAT)-based modeling of the camera motion. To allow for a computationally efficient solution, the number of layers of...
Addressing the image correspondence problem by feature matching is a central part of computer vision and 3D inference from images. Consequently, there is a substantial amount of work on evaluating feature detection and feature description methodology. However, the performance of the feature matching is an interplay of both detector and descriptor methodology. Our main contribution is to evaluate the...
We propose a data-driven, multi-view body pose estimation algorithm for video. It can operate in uncontrolled environments with loosely calibrated and low resolution cameras and without restricting assumptions on the family of possible poses or motions. Our algorithm first estimates a rough pose estimation using a spatial and temporal silhouette based search in a database of known poses. The estimated...
The frequency spectrum of angle-of-arrival(AOA) fluctuations of laser propagating through atmospheric turbulence is important for the tracking subsystem using in free space optical(FSO) communication system. We present the high frequency spectrum feather of the AOA measuring with high frame rate digital camera. The high frame rate complementary metal oxide semiconductor(CMOS) camera can achieve 3000...
The fusion of stereo and laser range finders (LIDARs) has been proposed as a method to compensate for each individual sensor's deficiencies - stereo output is dense, but noisy for large distances, while LIDAR is more accurate, but sparse. However, stereo usually performs poorly on textureless areas and on scenes containing repetitive structures, and the subsequent fusion with LIDAR leads to a degraded...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.