The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Current telepresence systems, while being a great step forward in videoconferencing, still have important points to improve in what eye-contact, gaze and gesture awareness concerns. Many-to-many communications are going to greatly benefit from mature auto-stereoscopic 3D technology; allowing people to engage more natural remote meetings, with proper eye-contact and better spatiality feeling. For this...
This paper proposes the method of video image mosaics in real-time based on Scale Invariant Feature Transform (SIFT) algorithm. The real-time processing is great significant for the video image mosaics. SIFT is the efficient method for extracting distinctive invariant features from images. But it costs so much computation time that it can't meet the real-time processing demand. Therefore, an improved...
This work presents a novel approach to accurately segment video objects in complex environments, based on region-level analysis. A robust-to-illumination region segmentation, a flexible and robust framework for region characterization and matching, and a multi-layer region-based background model, are key aspects of the proposed approach. Presented results show that, not requiring any post-processing...
Video text provides high-level semantic information. However, due to the complex background in video, it is of great difficulty to extract text efficiently. Although many methods hold assumptions on single feature, such as texture, connected areas etc., there are still some problems in dealing with multilingual text extraction because of its quite different appearance. In this paper, the color and...
Identifying handled objects, i.e. objects being manipulated by a user, is essential for recognizing the person's activities. An egocentric camera as worn on the body enjoys many advantages such as having a natural first-person view and not needing to instrument the environment. It is also a challenging setting, where background clutter is known to be a major source of problems and is difficult to...
In complex background, conventional automatic video-text location methods can not robustly locate text. A robust video-text location method is proposed in this paper. It can be divided into two stages. In the first stage, an unsupervised paradigm based on wavelet is applied to obtain candidate text region. In the second stage, traversing line with its aptitude spectrum is introduced and applied to...
Motion-based video segmentation remains an important problem in video processing. A promising approach that has received significant attention formulates the problem as an energy minimization within a MAP-MRF framework. While a great deal of progress has been made toward finding robust and computationally reasonable motion segmentation methods, automatically generating such a segmentation that performs...
This paper proposes a video hashing method for the purpose of digital rights management, media monitoring and tracking the distribution of illegally copied video. The proposed algorithm is applied to temporally representative images of a video. The resulting hashes are found experimentally to be highly robust to a range of attacks including noise, rotation, time shift, and frame dropping.
Occlusion in the monitoring video is a problem often encountered in the moving vehicles detection, tracking and identification. In practice, the moving vehicles that are needed to be tracked are often overlapped in the image. As a result, the mistakes in the targets segment and traffic parameters calculation are the problems much more difficult to solve. Generally, the moving target occlusion in video...
Text in video is a compact but effective clue for video indexing and summarization. In this paper, we propose an edge-based video text extraction approach with low computation, which can automatically detect and extract text from complex video frames. We first detect the edge maps of both an intensity image and its binarized image, and merge the two into one edge map, which contains less edge pixels...
In this paper we present a segmentation system for monocular video sequences with static camera that aims at foreground/background separation and tracking. We propose to combine a simple pixel-wise model for the background with a general purpose region based model for the foreground. The background is modeled using one Gaussian per pixel, thus achieving a precise and easy to update model. The foreground...
This paper presents the study of vocal videostroboscopic videos to detect morphological pathologies using a combination of motion information and segmentation. The motion permits us to obtain the keyframes of total (or minimum) closure and maximum opening and to have the initialization for the segmentation process. The segmentation is made analyzing the image textures applying Gabor filtering. After...
We investigate effective means of building robust dictionaries for detecting the sparse foreground in videos with static background. This work is an extension to our existing solution to foreground/background segmentation problem using the linear programming method proposed to detect sparse errors in signals, which are created by a known dictionary. The dictionary building methods we study are established...
In this paper, we propose a novel foreground segmentation approach for applications using static cameras. The foreground segmentation is modeled as an energy function optimum process, where energy function is based on Markov Random Field (MRF) and efficiently optimized by Gibbs sampling. The essence of our method is that we fuse four foreground/background models based on color and texture. This allows...
In this paper, we explore new edge features such as straightness for the elimination of non significant edges from the segmented text portion of a video frame to detect accurate boundary of the text lines in video images. To segment the complete text portions, the method introduces candidate text block selection from a given image. Heuristic rules are formed based on combination of filters and edge...
Automatic video object segmentation based on spatial-temporal information has been a research topic for many years. Existing approaches can achieve good results in some cases, such as where there is a simple background. However, in the case of cluttered backgrounds or low quality video input, automatic video object segmentation is still a problem without a general solution. A novel approach is introduced...
In this paper, a novel road modeling strategy is proposed, defining an accurate and robust system that operates in real-time. The strategy aims to find a trade-off between computational requirements of real systems and accuracy and robustness of the results. The basis of the strategy is an adaptive road segmentation technique which ensures robust detections of lane markings and vehicles. A multiple...
In sports videos, text provides valuable information about the game such as scores and information about the players. This paper provides a golf navigation system based on this player information. In the case of baseball and soccer, the location of key captions such as scoreboard captions is generally fixed during the game. However, in golf, the location of the key captions containing player information...
This paper describes a technique, called V-mirroring, for integrating videos taken from different cameras with different viewpoints of the same scene. The term V-Mirroring stems from the use of virtual mirrors in order to composite videos together. These mirrors are placed in the scene, near to the locations of the cameras. Thereafter, for any given camera, its corresponding video is overlaid with...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.