The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Drift is the most difficult issue in object visual tracking based on framework of “tracking-by-detection”. Due to the self-taught learning, the mis-aligned samples are potentially to be incorporated in learning and degrade the discrimination of the tracker. This paper proposes a new tracking approach that resolves this problem by three multi-level collaborative components: a high-level global appearance...
With the development of modern transportation system, the technology of Intelligent Transportation System has attracted more and more interests from research and industry communities. Road traffic sign recognition is one of the most important topics in this field. The traditional methods excessively rely on image morphology, segmentation and various image feature extractions, however most of the methods...
There is growing interest in social image classification because of its importance in web-based image application. Though there are many approaches on image classification, it is a great problem to integrate multi-modal content of social images simultaneously for social image classification, since the textual content and visual content are represented in two heterogeneous feature spaces. In this study,...
With the development of stone processing and sales, effective stone surface texture image recognition methods are needed. We proposed a new stone surface texture image recognition method based on texture and colour. We combine the following visual features: Gabor features which can well simulate the single cell sensing profile of mammalian visual neurons, The Grey-level Co-occurrence Matrices(GLCM)...
In our study, we sought to generate rules for cognitive distractions of car drivers using data from a driving simulation environment. We collected drivers' eye-movement and driving data from 18 research participants using a simulator. Each driver drove the same 15-minute course two times. The first drive was normal driving (no-load driving), and the second drive was driving with a mental arithmetic...
Label-deficient semi-supervised learning is a challenging setting in which there is an abundance of unlabeled data but a dearth of labeled data. We propose a method for applying Gaussian process latent variable models (GPLVM) in a label-deficient setting, a method in which the discriminative GPLVM objective function trains a back-constraining neural network followed by a transformation into a semi-supervised...
Blog is becoming an increasingly popular media for information publishing. Besides the main content, most of blog pages nowadays also contain noisy information such as advertisements etc. Removing these unrelated elements can improves user experience, but also can better adapt the content to various devices such as mobile phones. Though template-based extractors are highly accurate, they may incur...
A video contains rich perceptual information, such as visual appearance, motion and audio, which can be used for understanding the activities in videos. Recent works have shown the combination of appearance (spatial) and motion (temporal) clues can significantly improve human action recognition performance in videos. To further explore the multimodal representation of video in action recognition,...
From past to present, individuals' appreciation of taste has always been wondered. Moreover, there is an increasing research interest in measuring taste appreciation. Most of the previous work in this area are psychological studies that rely on manual coding of facial actions and/or emotional expressions. Consequently, these studies depend on human observations. We propose a preliminary study for...
In this paper, we address the problem of semi-supervised visual domain adaptation for transferring scene category models from ground view images to overhead view very high-resolution (VHR) remote sensing images. We introduce a multiple kernel learning domain adaptation algorithm to fuse the information from multiple features and cope with the considerable variation in feature distributions between...
An image retrieval system is a technique for browsing, searching and retrieving images from a big database of digital images. In this paper, we propose a new content-based image retrieval system that can solve the object and scene recognition problems and categorize similar images. The proposed model consists of a deep structure support vector machine with Gaussian mixture model, which is combined...
In this paper, we propose a l2,1-norm based discriminative robust transfer learning (DKTL) method for domain adaptation tasks. The key idea is to simultaneously learn discriminative subspaces by using the proposed domain-class-consistency (DCC) metric, and the representation based robust transfer model between source domain and target domain via l21-norm minimization. The DCC metric includes two parts:...
Achieving precise and robust human detection and tracking over camera networks is a very challenging task in the research of intelligent video surveillance. Its difficulties mainly result from abrupt human object motion, object occlusion and object scale change, and changing object appearance due to changes in illumination and viewpoint, non-rigid deformations, intra-class variability in shape and...
P300, which is usually evoked by visual stimulus, is widely used in electroencephalography (EEG) based brain computer interface (BCI) studies. As an application-oriented BCI study, the P300 speller would inevitably be used in outdoor environments. However, the visual stimulation effect might be interfered by the reflections in outdoor environment. This paper attempted to improve the outdoor P300 speller...
An object often has many distinct manifestations in computer vision, which brings a great challenge to utilizing more comprehensive information. Inspired by some biological researches about edge sensitivity and global structure priority, our key insight is to establish unified transfer classification network with shared contour information. Combining two convolutional networks with three cascaded...
Correct recognition of emotion veracity exhibited in facial gestures is troublesome for people. Yet, there is a belief that computer systems are able to perceive some tiny changes correlated to veracity expression, invisible for people, and therefore are able to improve proper perception of emotions. This work addresses the problem of spontaneous and posed smile recognition and suggests two approaches...
Along with the development of social network, more and more people know the world by reading news. The problem about what kind of emotion is inspired when people read news is very worthy of discussion. This paper will mix Deep Belief Networks (DBN) model and Support Vector Machine (SVM) to a hybrid neural network model by using the Contrast Divergence (CD) algorithm to estimate the weights when training...
Decoding human emotion induced by visual stimuli from brain signal retrieved by functional magnetic resonance imaging are proposed. Brain decoding technique with support vector machine is used to predict human emotion. Human subjective experiment are conducted by five subjects and the result shows that the accuracy of prediction is around 70% for two subjects and around 50% for other two subjects...
Image modality classification categorizes images according to their type. It is an important module in the Open-iSM multimodal (text+image) search engine that retrieves figures from biomedical articles. It is a hierarchical classification where on the top level the input figures are classified into two general categories: regular images (X-ray, CT, MRI, photographs, etc.) vs. illustration images (cartoon...
Machine learning from brain images is a central tool for image-based diagnosis and diseases characterization. Predicting behavior from functional imaging, brain decoding, analyzes brain activity in terms of the behavior that it implies. While these multivariate techniques are becoming standard brain mapping tools, like mass-univariate analysis, they entail much larger computational costs. In an time...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.