The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Biometrics is the detailed measurements of the human body. Biometrics deals with automated methods of identifying a person or verifying the identity of a person based on physiological or behavioural characteristics. Protection of biometric data is gaining interest and digital watermarking techniques are used to protect the biometric data from either accidental or intentional attacks. Among the various...
In mixed-resolution (MR) stereoscopic video, one view is presented with a lower resolution compared with the other one; therefore, a lower bitrate, a reduced computational complexity, and a decrease in memory access bandwidth can be expected in coding. The human visual system is known to fuse left and right views in such a way that the perceptual visual quality is closer to that of the higher-resolution...
Most image retargeting algorithms rely heavily on valid saliency map detection to proceed. But the inefficiency of high quality saliency map detection severely restricts applications of these image retargeting methods. In this paper, we describe a stochastic algorithm for efficient context-aware saliency map detection. Our method is a multiple level saliency map detection algorithm which integrates...
This work proposes an interactive tool for creating stereo image from a mono image. The user interaction is defined as scribbling on the object of interest followed by relative depth assignment to the selected object. Initial step in the algorithm is to create structured image oversegments with intensity homogeneity and geometrical convexity constraint. The final image segmentation is realized by...
We propose a novel 2D to 3D scheme by considering perceived 3D experience caused by occlusion and visual attention. In this scheme, initial depth model, the saliency map of visual attention and occlusion analysis are integrated in depth calculation. Mean-while, characteristics of human visual system are also considered as weight factors. Then, depth normalization and refining are implemented. The...
We propose a novel method to automatically detect and extract the video modality of the sound sources that are present in a scene. For this purpose, we first assess the synchrony between the moving objects captured with a video camera and the sounds recorded by a microphone. Next, video regions presenting a high coherence with the soundtrack are automatically labelled as being part of the source....
Consecutive corrupted MBs or slice errors are commonly seen in modern video transmission systems. Temporal error concealment is an effective approach to reduce the impact of errors. Conventional temporal error concealment techniques recover slice errors on a MB basis. We propose a new novel temporal scheme for slice error concealment based on a size-adaptive region basis. Size-adaptive region boundary...
Cashiers in retail stores usually exhibit certain repetitive and periodic activities when processing items. Detecting such activities plays a key role in most retail fraud detection systems. In this paper, we propose a highly efficient, effective and robust vision technique to detect checkout-related primitive activities, based on a hierarchical finite state machine (FSM). Our deterministic approach...
Through our work we managed to operate a display which has 480×272 resolution and 16 bit color depth with an FPGA equipped with an sbRIO 9632 developer card. The program was written in LabView and rotates a cube. With this work we managed to rotate a cube in 3D, create a 2D image from it with central projection, and then visualize it on the display, all of this on FPGA. The connection between the...
1 The proposed interpolation filter comprises two concatenating filters, adaptive pre-interpolation filter (APIF) and the normative interpolation filter in H.264/AVC. The former is applied only to the integer pixels in the reference frames; the latter generates all the sub-position samples, supported by the output of APIF. The convolution of APIF and the standard filter minimizes the motion prediction...
Automatic Language Identification (LID) in music has received significantly less attention than LID in speech. Here, we study the problem of LID in music videos uploaded on YouTube. We use a “bag-of-words” approach based on state-of-the-art content based audio-visual features and linear SVM classifiers for automatic LID. Our system obtains 48% accuracy for a corpus of 25000 music videos and 25 different...
View synthesis offers a great flexibility in generating free viewpoint television (FTV) and 3D video (3DV). However, the depth-image-based view synthesis approach is very sensitive to errors in the camera parameters or poorly estimated depth maps (also called depth images). Because of these errors, three kinds of artifacts (blurring, contour, hole) are possibly introduced during the general synthesis...
We propose a straightforward intensity-based dissolve detection method which is able to cope with the particular constraints of the artistic animated movie domain. It uses the hypothesis that during a dissolve, the amount of fading-out and fading-in pixels should be high. Instead of just applying a global threshold, as most of the existing approaches do, we use a twin-threshold approach coped with...
This paper presents a novel method of visual saliency detection. The use of saliency promises benefits to multimedia applications. However, up to now just few reasonable applications of saliency exist. It is clear that limited accuracy is one of the possible reasons for this. Another reason could be that in general saliency allows us to detect salient regions of the image rather than objects. To fill...
In this paper, we describe a novel error diffusion scheme for higher halftone quality with less visual artifacts. The proposed algorithm improves mid-tone quality of error diffusion significantly by diffusing the error along a jumping scanning path. The algorithm calculates the accumulative error to determine the point where breaks up the scanning path. A cost function is developed to search the optimal...
This paper aims to provide an application that uses gestures to interact with virtual objects in an Augmented Reality application. The application has no major dependencies of the work environment, lighting or users' skin color. To achieve this goal, libraries of particular use for natural interaction and Kinect device, which serves to provide RGB images of the environment and the depth map of the...
We present a robotic vision system that is capable of tracking fast moving objects. We briefly describe the design and construction of the system and concentrate on the biological strategies we have implemented to improve the system's capabilities in terms of speed and accuracy. We have also explored new horizons in the area of artificial intelligence to propose some useful algorithms based on visual...
The paper presents a global motion compensation algorithm, designed to work in real time on low memory and low processing power hardware platforms, such as mobile phones. The algorithm is designed as one of the parts of a fully automated exposure fusion algorithm also intended for mobile platform. It implements translational shifts to one or both of the input images (the overexposed and the underexposed)...
In this paper, we present a new tonemapping operator to display high dynamic range image onto conventional displayable devices and printers. In our work, a new tone map algorithm, derived from the Contrast Limited Adaptive histogram Equalization (CLAHE) technique is presented. Due to different luminance intervals could result in overlapped reaction on the limited response in limited response range...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.