The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Video search today uses the metadata surrounding the video, ignoring its semantic content. Over the years, a lot of research has gone into indexing and browsing of sports video content. In this work, we present a novel approach for classification of events in cricket videos and thus, summarize its visual content. The proposed method segments a cricket video into shots and identifies the visual content...
In this paper, robust descriptors are extracted to detect video copies generated by complicated transformations. The main contribution of the proposed method lies in three aspects. Firstly, the complicated transformations on video copies are identified and tackled to guarantee the extraction of robust descriptors. Secondly, a motion classification approach is proposed to divide the video into video...
For the question of how to classification of cultural relic videos, this paper put forward the analysis method based on semantic and it's two-steps in classification of video: Firstly, based on algorithm that has been used, separating the cultural relic shots and extracting the key frames; Then extracting the features of the key frames. Next training classification of the key frames' features using...
This article shows the improvement of automatic cartoon classification. Two new visual features - color component and color kind based on region segmentation - are proposed. Compared to traditional HSV color histogram and texture, experiment using the two new features can achieve better result, with less dimensions and higher mining efficiency.
Human detection and recognition at a distance is recently a matter of great concern among computer vision researchers. This paper introduces a new set of human body features for the recognition of detected human as an object. The feature extraction is performed by an established human model consisting of five parts. These features consist of geometric calculations of detected object and their different...
We consider the task of automatic detection and recognition of traffic signs in video. We show that successful off-the-shelf detection (Viola-Jones) and classification (SVM) systems yield unsatisfactory results. Our main concern are high false positive detection rates which occur due to sparseness of the traffic signs in videos. We address the problem by enforcing spatio-temporal consistency of the...
Scene classification is used to categorize images into different classes, such as urban, mountain, beach, or indoor. This paper presents work on scene classification of television shows and feature films. These types of media bring unique challenges that are not present in photographs, as many shots are close-ups in which few characteristics of the scene are visible. In our work, the video is first...
In this paper, a new action feature descriptor PEM (PCRM-EOH-MOH) is proposed for fast human action recognition. This descriptor is constructed based on three information channels: Pixel Change Ratio Map (PCRM), Edge Orientation Histogram (EOH) and Motion Orientation Histogram (MOH) features. A video sequence is first represented as a collection of PEM features. Then, video representations are constructed...
Efficient pedestrian detection is essential for intelligent vehicles and driver assistance system. An increasing number of experts have attached more importance to this subject in recent years. The input of this task is a video captured by a monocular optical camera which is installed on a vehicle. And the aim is to locate every pedestrian in each frame of the video as soon as possible. This task...
Human detection has always been an important part of computer vision but many implementations lack the real-time performance that real world applications require. This paper presents a real-time implementation of human detection in video using the state-of-the-art histograms of oriented gradients method. Each image in the video sequence is tested at multiple scales using a sliding window. Histograms...
Different facial expressions are related to a small set of muscles and limited ranges of motions. In this paper we propose an automatic facial expression recognition system, different from other automatic methods in both face detection and feature extraction. In system the facial expressions identify itself in video sequences. First, the differences between neutral and emotional states are detected...
Night surveillance is a challenging task because of low brightness, low contrast, low signal to noise ratio (SNR) and low appearance information. Most existing models for night surveillance share the following problems: a lack of adaptability for different scenes and separation between detection and tracking. To solve these problems we propose a model based on salient contrast change (SCC) feature,...
We present a semi-automatic system that converts conventional video shots to stereoscopic video pairs. The system requires just a few user-scribbles in a sparse set of frames. The system combines a diffusion scheme, which takes into account the local saliency and the local motion at each video location, coupled with a classification scheme that assigns depth to image patches. The system tolerates...
In this paper, we propose a method for fast pedestrian detection in images/videos. Multi-scale orientated (MSO) features are proposed to represent coarse pedestrian contour, on which Adaboost classifiers are trained for pedestrian coarse location. In the fine detection, histogram of oriented gradient (HOG) features and SVM classifiers are employed to precisely classify pedestrians and non-pedestrians...
This paper presents a semantic-based video analysis method and two steps of their classification in the wushu video: Firstly, used of the image frame difference to construct on the basis of background image, and used of threshold settings to determine the body information and background information, This can achieve the elimination of background and exercise the purpose of extracting the human body...
Text in video frames provides brief and important content information which is helpful to video scene understanding, annotation and searching. A new text detection method in video frames is proposed in this paper. First, a small overlapped sliding window is scanned over the frame from which hybrid features are extracted. And then SVM classifier is employed to distinguish the text from background....
Wireless Capsule Endoscopy (WCE) is a non invasive procedure which is used to view the lower gastrointestinal tract. Physicians can detect diseases such as bleeding, Crohn's disease, peptic ulcers, and colon cancer. In this paper a methodology is presented to identify peptic ulcers in the small intestine automatically. It first performs color transformation into the HSV color space; it utilizes log...
Existing pedestrian and vehicle detection algorithms use 2D cues of objects, such as pixel values, color, texture, shape information or motion. The use of 3D cues in object detection, on the other hand, is not well studied in the literature. In this paper, we propose an efficient algorithm that detects pedestrian and vehicle using their 3D cues. The proposed algorithm first detects moving objects...
Shot boundary detection (SBD) is the basis of interpreting video content, event, and relevant knowledge. As existing SBD algorithms are sensitive to video object motion and no reliable solution exists to provide accurate shot boundary detection, it still remains an unsolved problem. We propose a new algorithm of shot boundary detection in this paper, which employs support vector machine (SVM) as a...
News caption text contains useful information for video annotation, indexing and searching. This paper presents a new caption text location method. First, a small overlapped sliding window is scanned over the keyframe. Then texture and edge features are extracted as the input to SVM classifier to distinguish caption text from background. At last, vote mechanism and morphological filter are performed...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.