The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Illumination changes present challenging problems to video surveillance algorithms tasked with identifying and tracking objects. Illumination changes can drastically alter the appearance of a scene, causing truly salient features to be lost amid otherwise stable background. We describe an illumination change compensation method that identifies large, stable, chromatically distinct background features-called...
This paper presents an automatic people counting system based on face detection, where the number of people passing through a gate or door is counted by setting a video camera. The basic idea is to first use the frame difference to detect the rough edges of moving people and then use the chromatic feature to locate the people face. Based on NCC (Normalized Color Coordinates) color space, the initial...
Shopping carts have traditionally been used as a tool provided to the customers in retail stores to carry items from the shelf to checkout stations. These days shopping carts can also be used as a security checkpoint to prevent store losses. All the items collected in a shopping cart are supposed to be unloaded at the checkout station to be scanned and included in the bill. Any items left in the cart...
A critical issue of measuring video similarity is most video data are huge files, which vary in terms of length and amount of data, resulting in time-consuming data processing. Therefore, reducing the dimensionality of the data becomes a necessity. This paper proposes the video similarity measurement approach for sports video classification by dimensionality reduction with distance space and random...
Video text provides high-level semantic information. However, due to the complex background in video, it is of great difficulty to extract text efficiently. Although many methods hold assumptions on single feature, such as texture, connected areas etc., there are still some problems in dealing with multilingual text extraction because of its quite different appearance. In this paper, the color and...
In video-surveillance systems, the moving object segmentation stage (commonly based on background subtraction) has to deal with several issues like noise, shadows and multimodal backgrounds. Hence, its failure is inevitable and its automatic evaluation is a desirable requirement for online analysis. In this paper, we propose a hierarchy of existing performance measures not-based on ground-truth for...
This paper proposes a novel weighted feature fusion in color face recognition (FR) to automatically annotate faces in personal videos. In the proposed FR method, multiple face images (belonging to the same subject) are clustered from a sequence of video frames. To facilitate a complementary effect on improving annotation performance, the grouped faces are combined using the proposed weighted feature...
Robust foreground detection is a fundamental precursor of many video processing applications. Although various approaches were advanced, there still exist many factors making detection very challenging: 1) Dynamic background with gradual brightness changes, camera movement and large amount of noises. 2) Sharp illumination changes caused by shadows, light on-off, and so on. 3) Real-time requirement...
Methods for video copy detection are typically based on the use of low-level visual features. However, low-level features may vary significantly for near-duplicates, which are video sequences that have been the subject of spatial or temporal modifications. As such, the use of low-level visual features may be inadequate for detecting near-duplicates. In this paper, we present a new video copy detection...
In this paper, we propose a technique for classifying shots of playfield-based sports video into their respective view classes. Based on common broadcasting style, a shot can be classified as a far-view or a closeup-view. The technique considers the frame-wise color values of each pixel in the HSV color space, while at the same time calculating the assumed object size within the segmented playfield...
An efficient colorization scheme for images and videos based on prioritized source propagation is proposed in this work. A user first scribbles colors on a set of source pixels in an image or the first frame of a movie. The proposed algorithm then propagates those colors to the other non-source pixels and the subsequent frames. Specifically, the proposed algorithm identifies the non-source pixel with...
We introduce the first visual dataset of fast foods with a total of 4,545 still images, 606 stereo pairs, 303 3600 videos for structure from motion, and 27 privacy-preserving videos of eating events of volunteers. This work was motivated by research on fast food recognition for dietary assessment. The data was collected by obtaining three instances of 101 foods from 11 popular fast food chains, and...
A novel classification method of video shot genre based on data-mining has been proposed. Shot boundary detection and key frames extraction are firstly performed. Then, some visual features such as color and motion are extracted for the key frame and shots. Furthermore, decision tree is applied to discover the rules between these features and shots genres from numerous training data. These rules are...
A small color space conversion (CSC) block in video input/ output (VIO) architecture can efficiently support both standards: ITU-R BT.601 and ITU-R BT.709 for RGB and YCbCr color format. The design of an arbitrary (programmable) coefficient helps to reduce the size of CSC block. A new dithering technique is implemented to achieve the advantages, high-quality image and no buffer memory, of error-diffusion...
Sign language number recognition system lays down foundation for handshape recognition which addresses real and current problems in signing in the deaf community and leads to practical applications. The input for the sign language number recognition system is 5000 Filipino sign language number video file with 640 x 480 pixels frame size and 15 frame/second. The color-coded gloves uses less color compared...
Aimed at that there is a missing detection in the video shot segmentation because of that the global color feature can't reflect the changes in the image's substance, a new method of shot detection is proposed using global color feature combined with the characteristic of local edge. In order to determine the border of the shot segmentation, the paper eliminated the flash interference with the gray...
This paper is a new attempt to introduce a ldquomulti-resolution and multi-cuerdquo particle filter tracking method. It combines multiple features distribution and multiple resolutions to facilitate 2D video tracking from a monocular view. The methodpsilas benefits lie in its speed, its robustness. Speed is improved by multiple resolutions which reduce number of particles dramatically while keeping...
Recognizing objects being manipulated in hands can provide essential information about a person's activities and have far-reaching impacts on the application of vision in everyday life. The egocentric viewpoint from a wearable camera has unique advantages in recognizing handled objects, such as having a close view and seeing objects in their natural positions. We collect a comprehensive dataset and...
As the Internet and multimedia technology develops, the content security of the multimedia has become more and more important. To distinguish various contents in the multimedia, we present an approach for automatic video classification based on combination of MPEG-7 descriptors and second-prediction strategy. In this paper, color, texture, shape and motion descriptors are extracted from five different...
We propose a way of quickly and automatically identifying the positions of players in soccer broadcast video. The approach is based on fast specification of camera view angles corresponding to frame pictures by using a ldquoblock-based searchrdquo. Our method can decide on a camera view angle that corresponds to a frame picture by making block-based connections for how to allocate ldquoline-blocksrdquo,...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.