The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Since news videos are valuable sources of multimedia information on real-world events, there is a demand for viewing them efficiently. However, there is a problem that summarization methods based on auditory contents do not take into account the visual contents. In the case of news videos, due to its presentation style where audio contents and visual contents do not necessarily come from the same...
In this work, we present a method of human action recognition based on detection of interest points by spatial and temporal constraints. Firstly, the improved Harris-Laplace algorithm is proposed to solve the problem of multi-scale. Then, the bag-of-visual features (BoV) model is used for feature extraction, and is built the visual dictionary with K-means clustering. We train the Support Vector Machine...
The paper proposes CONSIFT descriptors which are the rotation-variant modification of SIFT (primarily for affine-invariant keypoints). CONSIFT of a keypoint K is its SIFT computed relatively to the orientation defined by the location of another keypoint L (and concatenated with similarly computed SIFT for keypoint L relatively to the location of K). It is additionally recommended that K and L are...
A method for partial near-duplicate retrieval in random images is proposed and evaluated. Unlike the majority of existing methods, it is based on matching individual keypoints only (i.e. no analysis/verification of configuration constraints). The proposed description of keypoints incorporates affine-invariant representation of keypoint bundles (photometric and geometric properties of neighboring keypoints)...
In this paper, we propose a novel texture descriptor, Structured Texton, to extract and characterize meaningful texture patterns in images. Structured Textons are constructed by grouping local extremum regions connected by the nesting relationship. To further improve the discriminative ability, high order texton words are generated from the Structured Textons, preserving both the appearance information...
We propose an approach to improving the detection results of a generic offline trained detector on a specific video. Our method does not leverage visual tracking as most detection by tracking methods do. Instead, the proposed detection by detections approach can serve as a more confident initialization for detection by tracking methods. Different from other supervised detector adaptation methods,...
Searching interesting regions in aerial video is a new and challenging problem. This paper presents an approach to detect visual interesting regions in aerial video using pLSA topic model. Traditional interesting region detection approaches just use bottom-up information, such as color, orientation and movement etc. Our proposed method can discover the semantic content of the whole image, the co-occurrence...
The bag-of-keypoints representation started to be used as a black box providing reliable and repeatable measurements from images for a wide range of applications such as visual object recognition and texture classification. This order less bag-of-keypoints approach has the advantage of simplicity, lack of global geometry, and state-of-the-art performance in recent texture classification tasks. In...
Tracking-by-detection is an attractive paradigm for intelligent visual surveillance applications where clutter, lighting variations, target overlap and occlusions hamper conventional background modeling. However, state-of-the-art vehicle and pedestrian detectors based on discriminative classification are too computationally expensive for real-time implementation on embedded smart cameras. This paper...
In this paper a system for illuminated manuscripts images analysis is presented. In particular the bag-of-keypoints strategy, commonly adopted for object recognition, image classification and scene recognition, is applied to the classification of automatically extracted miniatures. Pictures are characterized by SURF descriptors, and a classification procedure is performed, comparing the results of...
In this paper we propose a new method for human action categorization by using an effective combination of novel gradient and optic flow descriptors, and creating a more effective codebook modeling the ambiguity of feature assignment in the traditional bag-of-words model. Recent approaches have represented video sequences using a bag of spatio-temporal visual words, following the successful results...
In this paper, we propose a method for document image segmentation based on pLSA (probabilistic latent semantic analysis) model. The pLSA model is originally developed for topic discovery in text analysis using "bag-of-words" document representation. The model is useful for image analysis by "bag-of-visual words" image representation. The performance of the method depends on the...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.