The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Image classification is a method that distinguishes the different categories of targets based on the different features of image. The current problem usually is that the feature modeling of target has a great influence on recognition robustness. In order to solve this problem, a correlation-based method is presented to optimize the bag-of-visual-word (BOVW) model by reducing the dictionary size. The...
Deep learning has brought a series of breakthroughs in image processing. Specifically, there are significant improvements in the application of food image classification using deep learning techniques. However, very little work has been studied for the classification of food ingredients. Therefore, this paper proposes a new framework, called DeepFood which not only extracts rich and effective features...
With the exponential growth of web meta-data, exploiting multimodal online sources via standard search engine has become a trend in visual recognition as it effectively alleviates the shortage of training data. However, the web meta-data such as text data is usually not as cooperative as expected due to its unstructured nature. To address this problem, this paper investigates the numerical representation...
Human action recognition is an imperative research area in the field of computer vision due to its numerous applications. Recently, with the emergence and successful deployment of deep learning techniques for image classification, object recognition, and speech recognition, more research is directed from traditional handcrafted to deep learning techniques. This paper presents a novel method for human...
To achieve more effective solution for large-scale image classification (i.e., classifying millions of images into thousands or even tens of thousands of object classes or categories), a deep multi-task learning algorithm is developed by seamlessly integrating deep CNNs with multi-task learning over the concept ontology, where the concept ontology is used to organize large numbers of object classes...
A novel Hierarchical Structured Dictionary Learning (HSDL) algorithm is proposed in this paper. It aims to learn classs-pecific dictionaries for all classes simultaneously in a hierarchical structure. A discriminative term based on Fisher discrimination criterion is jointly considered for both the classs-pecific dictionaries in the lower level and the shared dictionaries in the upper level to enhance...
In this paper we propose to face the problem of event detection from single images, by exploiting both background information often containing revealing contextual clues and details, which are salient for recognizing the event. Such details are visual objects critical to understand the underlying event depicted in the image and were recently defined in the literature as “event-saliency”. Adopting...
In this work, an interactive visual system MICS is presented for large-scale brain CT image classification. Automatic feature extraction algorithms are added in MICS to improve system efficiency and classification accuracy. In visualization part, we designed an interactive feature extraction interface, enable users to extract and fine-tune image features according to specific requirements. In addition,...
Sparse Coding is a widely used method to represent an image. However, sparse coding and its improved algorithms have the problem of complex computation and long running time and so on. For these problems, we propose an image classification method based on hash codes and space pyramid, which encodes local feature points with hash codes instead of sparse coding. Firstly, extract the local feature points...
Image emotion analysis is a new and challenging research direction that gains more and more attention in the research community. Most previous works in this field only use common or generic features, and have hard restrictions on training images, such as scale, resolution, etc. Inspired by scale-space theories and psychology theories of color, we propose a procedure to extract interpretive features...
We consider the use of transfer learning, via the use of deep Convolutional Neural Networks (CNN) for the image classification problem posed within the context of X-ray baggage security screening. The use of a deep multi-layer CNN approach, traditionally requires large amounts of training data, in order to facilitate construction of a complex complete end-to-end feature extraction, representation...
To train a scene classifier with good generalization capability, a large number of human labeled training images are often needed. However, a large number of well-labeled training images may not always be available. To alleviate this problem, the web resources-aided scene classification framework was proposed. The present paper is a new development based on our previously proposed framework [1], with...
Deep convolutional neural networks is a recently developed method that yields very successful results in image classification. Deep neural networks, which have a high number of parameters, require a large amount of data to avoid overfitting during training. For applications in which the available data is not adequate to train a deep neural network from the scratch, deep neural networks trained for...
In image classification and retrieval, the semantic gap is the major challenge. It characterizes the difference between human perception of a concept and how it can be represented using machine level language. Bag of visual words is a well-known efficient method for image representation, however it showed some limitations. The loss of information during the vector quantization process is one of these...
We present an image classification method which consists of salient region (SR) detection, local feature extraction, and pairwise local observations based Naive Bayes classifier (NBPLO). Different from previous image classification algorithms, we propose a scale, translation, and rotation invariant image classification algorithm. Based on the discriminative pairwise local observations, we develop...
Motivated by the widespread adoption of social networks and the abundant availability of user-generated multimedia content, our purpose in this work is to investigate how the known principles of active learning for image classification fit in this newly developed context. The process of active learning can be fully automated in this social context by replacing the human oracle with the user tagged...
This article puts forward a kind of huge amounts of multi-object image recognition method -- BVCNN. Firstly, BING method is used to recognize images, which greatly reduces the time of estimating image targets, and makes it possible that quickly identify multiple target images, compared to traditional convolution neural networks only achieving single target image recognition, Secondly, vectorization...
Kernel-based Support Vector Machine (SVM) is widely used in many fields (e.g. image classification) for its good generalization, in which the key factor is to design effective kernel functions based on efficient features. In this paper, we propose a new approach that uses a combination of global and local image features to represent images and learns Support Vector Machine classifier with a new and...
Constructing effective representations is a critical but challenging problem in multimedia understanding. The traditional handcraft features often rely on domain knowledge, limiting the performances of exiting methods. This paper discusses a novel computational architecture for general image feature mining, which assembles the primitive filters (i.e. Gabor wavelets) into compositional features in...
Given the proliferation of geo-tagged images, geo-aware image classification is an emerging topic. To derive a better image representation, tag features which represents an image as a histogram of tags are recently introduced. However, it is unclear whether geo tags can improve the tag features. To resolve the uncertainty, this paper studies geo-aware tag features. Our work is based on previous work...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.