The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Categorizing web-based videos is an important yet challenging task. The difficulties arise from large data diversity within a category, lack of labeled data, and degradation of video quality. This paper presents a large scale video taxonomic classification scheme (with more than 1000 categories) tackling these issues. Taxonomic structure of categories is deployed in classifier training. To compensate...
Automatic categorization of videos in a Web-scale unconstrained collection such as YouTube is a challenging task. A key issue is how to build an effective training set in the presence of missing, sparse or noisy labels. We propose to achieve this by first manually creating a small labeled set and then extending it using additional sources such as related videos, searched videos, and text-based webpages...
The problem we address is: Given line correspondences over three views, what is the condition of the line correspondences for the spatial relation of the three associated camera positions to be uniquely recoverable? The observed set of lines in space is called critical if there are multiple projectively nonequivalent configurations of the camera positions that can picture the same image triplet of...
Text detection in images is important for the retrieval of text information from digital graph, video databases and web sites. In this paper, a text detection method based on sparse representation classification with discrimination dictionaries is presented, which can detect text with different sizes, fonts and colors. The propose method detects edge information using Sobel operator and a sliding...
We address the question of, what structure of a set of lines in space constitutes a critical configuration to three generally positioned cameras. By critical configuration, it is meant a set of lines in space whose image projections to the cameras do not allow unique determination of the cameraspsila relative positions in space. We approach the question by looking into the trifocal tensor of the cameraspsila...
We present a higher-level visual representation, visual synset, for object categorization. The visual synset improves the traditional bag of words representation with better discrimination and invariance power. First, the approach strengthens the inter-class discrimination power by constructing an intermediate visual descriptor, delta visual phrase, from frequently co-occurring visual word-set with...
Digital animation is a widely used digital media on Internet to convey information. However, many animations nowadays are usually advertisements and contain only junk information. In order to detect and filter such information, a feature extraction, analysis and classification method for animation content understanding is proposed. A feature set composed of the traditional image/video features and...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.