The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
In this paper, we propose a new texture descriptor, completed local derivative pattern (CLDP). In contrast to completed local binary pattern (CLBP), which involves only local differences at each scale, CLDP encodes the directional variation of the local differences of two scales as a complementary component to local patterns in CLBP. The new component in CLDP, with regarded as the directional derivative...
We present a fully trainable solution for binarization of degraded document images using extremely randomized trees. Unlike previous attempts that often use simple features, our method encodes all heuristics about whether or not a pixel is foreground text into a high-dimensional feature vector and learns a more complicated decision function. We introduce two novel features, the Logarithm Intensity...
How to avoid the invading of the attack in the biometric system, such as 2D printed photos, gradually becomes an important research hotspot. In this paper, we present a novel descriptor in light field to tackle the issue. Based on the angular and spatial information in light field, the proposed light field histogram of gradient (LFHoG) descriptor is derived from three directions, including vertical,...
In this paper, a new multi-class classification method is proposed and evaluated in the problem of human action recognition in unconstrained environments. The proposed method exploits both the maximum margin property of multi-class Support Vector Machines and Linear Discriminant Analysis-based discrimination. Experiments indicate that by exploiting such discriminant information in a multi-class maximum...
We present a histogram-based real-time solution to detecting directly irradiated regions in digital fluoroscopic images. Our method leverages the power of model matching, machine learning and domain knowledge to characterize and segment images using histograms. The input image is automatically identified as containing partial, all, or null direct radiation. The regions with direct radiation are segmented...
In this paper, we propose to use contexts of superpixels as a prior to improve semantic segmentation by the CRF framework. A graphical model is constructed on over-segmented images. Our main contribution is to take the concept of “superpixel embedding” into consideration, which is formalized as a potential item for optimizing the energy of the whole graph. We also introduce two ways of calculating...
This paper presents an unsupervised approach to vocal detection in music recordings based on dictionary learning. At a first stage, the recording to be segmented is treated as training data and the K-SVD algorithm is used to learn a dictionary which sparsely represents a short-term feature sequence that has been extracted from the recording. Subsequently, the vectors of the feature sequence are reconstructed...
Dance traditions constitute a significant aspect of cultural heritage around the world. The organization, semantic analysis, and retrieval of dance-related multimedia content (i.e., music, video) in databases is, therefore, crucial to their preservation. In this paper we explore the problem of folk dances recognition from video recordings, focusing on Greek folk dances, using different representations...
Over the past years, automatic traffic accident detection (ATAD) based on video has become one of the most promising applications in intelligent transportation and is playing a more and more important role in ensuring travel safety. This paper proposes a classifier-based supervised method by viewing the last seconds before motor vehicle collisions as the detection target. In our method, we devise...
Histopathological analysis is crucial for the diagnosis of a large number of cancer types. A lot of progress has been made in the development of molecular based assays, but many of the cases still require the careful analysis of the stained tissue under a bright-field microscope and its analysis. This procedure is costly and time-consuming. We present a novel method for classification of cancer cells...
We consider the detection of the control or idle state in an asynchronous Steady-state visually evoked potential (SSVEP)-based brain computer interface system. We propose a likelihood ratio test using Canonical Correlation Analysis (CCA) scores calculated from the EEG measurements. The test exploits the state-specific distributions of CCA scores. The algorithm was tested on offline measurements from...
Accurate and confident diagnosis of ADHD is highly dependent on subjective observations. Several quantitative methods have been proposed, posing it as a two-class classification problem (ADHD and non-ADHD). However, the results have not made it past the research stage yet, as misclassification rates must be close to 0%. This study aims to discriminate ADHD and non-ADHD subjects using autoregressive...
Facial expressions are considered a reliable indicator in neonatal pain assessment. This paper proposes a new neonatal pain expression recognition method, which utilizes the feature descriptors based on weighted Local Binary Pattern (LBP) and the classifier based on sparse representation. Firstly, the normalized facial image is described using a feature vector, which is histogram sequence obtained...
In this paper, we present a detailed comparison study of skin segmentation methods for psoriasis images. Different techniques are modified and then applied to a set of psoriasis images acquired from the Royal Melbourne Hospital, Melbourne, Australia, with aim of finding the best technique suited for application to psoriasis images. We investigate the effect of different colour transformations on skin...
Automated texture analysis of lung computed tomography (CT) images is a critical tool in subtyping pulmonary emphysema and diagnosing chronic obstructive pulmonary disease (COPD). Texton-based methods encode lung textures with nearest-texton frequency histograms, and have achieved high performance for supervised classification of emphysema subtypes from annotated lung CT images. In this work, we first...
ShapeNets is an image representation, which is based on shape, compact structure, hierarchical image structure and appearance characteristic of object contour. In a ShapeNets, the shape of image is a window of containing objects which can be extracted with the method of objectness. The outline of objects can also be extracted in a line boundary detection algorithm based on histogram of gradients direction,...
Smart car shows great potential in our future life and has attracted lots of interests from many research and industry communities. In this field, the technique of machine vision and recognition plays an important role, for instance, the automatic front vehicle recognition can provide driving safety information for the smart car. Lots of previous work on front vehicle recognition has been done, most...
We present a baseline convolutional neural network (CNN) structure and image preprocessing methodology to improve facial expression recognition algorithm using CNN. To analyze the most efficient network structure, we investigated four network structures that are known to show good performance in facial expression recognition. Moreover, we also investigated the effect of input image preprocessing methods...
Many supervised approaches report state-of-the-art results for recognizing short-term actions in manually clipped videos by utilizing fine body motion information. The main downside of these approaches is that they are not applicable in real world settings. The challenge is different when it comes to unstructured scenes and long-term videos. Unsupervised approaches have been used to model the long-term...
In this paper, a bark recognition algorithm for plant classification is presented. A Least-Square Support Vector Machine (LSSVM) with image and data processing techniques is used to implement a general purpose automated classifier. Using a data base of 40 sections of photographs taken of each of the 23 species, we applied an algorithm to homogenize the illumination of the images. After applying it,...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.