The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Compared to image representation based on low-level local descriptors, deep neural activations of Convolutional Neural Networks (CNNs) are richer in mid-level representation, but poorer in geometric invariance properties. In this paper, we present a straightforward framework for better image representation by combining the two approaches. To take advantages of both representations, we extract a fair...
In image classification, the most powerful statistical learning approaches are based on the Bag-of-Words paradigm. In this article, we propose an extension of this formalism. Considering the Bag-of-Features, dictionary coding and pooling steps, we propose to focus on the pooling step. Instead of using the classical sum or max pooling strategies, we introduced a density function-based pooling strategy...
An important consideration in designing of visual object classification systems is the representation of images. In the past decades, significant improvement has been made, which can owe to two strategies: the proposal of new discriminative local image features together with the combination of existing local image features. In this paper, several strategies which provide the combination weights of...
In this work we tackle the problem of search personalization for on-line soft goods shopping. By learning what the user likes and what the user does not like, better search rankings and therefore a better overall shopping experience can be obtained. The first contribution of the work is in terms of feature selection: given the specific nature of the domain, we combine the traditional visual and text...
Recently, spatial pyramid matching (SPM) has achieved leading performance in scene classification. In SPM, spatial information of local features is integrated into representation by computing histograms of local features over increasingly fine regular grids on an image. But this spatial information encoding scheme is very restrictive, because it assumes that images from same scene category have similar...
This paper presents an actor video retrieval system based on face video-tubes extraction and representation with sets of temporally coherent features. Visual features, SIFT points, are tracked along a video shot, resulting in sets of feature point chains (spatio-temporal tubes). These tubes are then classified and retrieved using a kernel-based SVM learning framework for actor retrieval in a movie...
Representing image using the distribution of local features on a group of visual words is an effective method for visual categorization. Visual words can be related conceptually and the information can be incorporated to enhance the performance. However, conventional methods usually use visual words independently without considering this. This paper proposes a novel approach to measure the conceptual...
In this paper, we consider the problem of classifying a real world image to the corresponding object class based on its visual content via sparse representation, which is originally used as a powerful tool for acquiring, representing and compressing high-dimensional signals. Assuming the intuitive hypothesis that an image could be represented by a linear combination of the training images from the...
In this paper, we propose a group-sensitive multiple kernel learning (GS-MKL) method to accommodate the intra-class diversity and the inter-class correlation for object categorization. By introducing an intermediate representation "group" between images and object categories, GS-MKL attempts to find appropriate kernel combination for each group to get a finer depiction of object categories...
Support vector machines (SVM) has been widely applied in the area of content-based image retrieval in order to learn high-level concepts from low-level image features. Most existing SVM based image retrieval algorithms only rely on global-based features to represent the image content, which obviously can not well reflect the image semantic content. Region-based representations are far more close to...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.