The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
This paper introduces a novel pre-locating algorithm for rapid logo detection in unconstrained color images. This work is distinguished by two major contributions. The first is a new method of representation for logo called “spatial connected component descriptor” (SCCD) containing connected component (CC) prediction model and effective-CC pixel distribution histogram. The former represents combinations...
This paper presents a novel Global and Local Features based Latent Dirichlet Allocation model for scene recognition. The proposed model follows the bag-of-word framework like the Latent Dirichlet Allocation model. The traditional Latent Dirichlet Allocation model for scene recognition only uses the orderless bag of features called global features without considering spatial constraints on these features...
The story-related subject caption (SSC) in broadcast news video expresses the subject of news story, and plays an important role in news story segmentation and news video indexing. We find that a SSC always has a strip background and all the SSCs in one news video have the same style. By taking advantage of these characters, this paper presents an unsupervised approach to detect SSCs in broadcast...
In this paper, we present a revised method to compute the similarity of traditional string edit distance. Given two strings X and Y over a finite alphabet, an edit distance between X and Y can be defined as the minimum weight of transforming X into Y through a sequence of weighted edit operations. Because this method lacks some type of normalization, it would bring some computation errors when the...
Text extraction from images with complex backgrounds remains a challenging problem. Existing thresholding methods succeed in extracting text from images with simple or slowly varying backgrounds. However, when the backgrounds include sharply varying contours, some background pixels, which have similar intensities to the text, are classified to the text pixels in the binary image. In the literature,...
In a digital media asset management system, TV program segmentation is a key step for the long videos recorded from television channels to be represented in the hierarchical structure. In this paper, a novel approach based on acoustic cues for automatic segmenting television stream into individual programs is proposed. This presented method is composed of the following steps: (1) Several sets of repetitions...
To precisely segment Chinese characters in images and videos, we propose a novel recognition-based method in this paper. Our method consists of four steps: 1) Original text image is binarized to get a binary image; 2) Use a horizontal projection profile of the binary image to estimate the width of the bounding box of Chinese character; 3) Combined with character recognizer, use a vertical projection...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.