The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
During the last half decade, convolutional neural networks (CNNs) have triumphed over semantic segmentation, which is a core task of various emerging industrial applications such as autonomous driving and medical imaging. However, to train CNNs requires a huge amount of data, which is difficult to collect and laborious to annotate. Recent advances in computer graphics make it possible to train CNN...
Weakly supervised semantic segmentation and localization have a problem of focusing only on the most important parts of an image since they use only image-level annotations. In this paper, we solve this problem fundamentally via two-phase learning. Our networks are trained in two steps. In the first step, a conventional fully convolutional network (FCN) is trained to find the most discriminative parts...
We introduce a novel method for 3D object detection and pose estimation from color images only. We first use segmentation to detect the objects of interest in 2D even in presence of partial occlusions and cluttered background. By contrast with recent patch-based methods, we rely on a “holistic” approach: We apply to the detected objects a Convolutional Neural Network (CNN) trained to predict their...
Pedestrian detection is a critical problem in computer vision with significant impact on safety in urban autonomous driving. In this work, we explore how semantic segmentation can be used to boost pedestrian detection accuracy while having little to no impact on network efficiency. We propose a segmentation infusion network to enable joint supervision on semantic segmentation and pedestrian detection...
While deep learning is remarkably successful on perceptual tasks, it was also shown to be vulnerable to adversarial perturbations of the input. These perturbations denote noise added to the input that was generated specifically to fool the system while being quasi-imperceptible for humans. More severely, there even exist universal perturbations that are input-agnostic but fool the network on the majority...
Learned boundary maps are known to outperform handcrafted ones as a basis for the watershed algorithm. We show, for the first time, how to train watershed computation jointly with boundary map prediction. The estimator for the merging priorities is cast as a neural network that is convolutional (over space) and recurrent (over iterations). The latter allows learning of complex shape priors. The method...
In the field of connectomics, neuroscientists seek to identify cortical connectivity comprehensively. Neuronal boundary detection from the Electron Microscopy (EM) images is often done to assist the automatic reconstruction of neuronal circuit. But the segmentation of EM images is a challenging problem, as it requires the detector to be able to detect both filament-like thin and blob-like thick membrane,...
This paper proposes a novel approach for segmenting primary video objects by using Complementary Convolutional Neural Networks (CCNN) and neighborhood reversible flow. The proposed approach first pre-trains CCNN on massive images with manually annotated salient objects in an end-to-end manner, and the trained CCNN has two separate branches that simultaneously handle two complementary tasks, i.e.,...
General human action recognition requires understanding of various visual cues. In this paper, we propose a network architecture that computes and integrates the most important visual cues for action recognition: pose, motion, and the raw images. For the integration, we introduce a Markov chain model which adds cues successively. The resulting approach is efficient and applicable to action classification...
Unsupervised learning from visual data is one of the most difficult challenges in computer vision. It is essential for understanding how visual recognition works. Learning from unsupervised input has an immense practical value, as huge quantities of unlabeled videos can be collected at low cost. Here we address the task of unsupervised learning to detect and segment foreground objects in single images...
Convolutional Neural Networks (CNNs) have shown great success in solving key artificial vision challenges such as image segmentation. Training these networks, however, normally requires plenty of labeled data, while data labeling is an expensive and time-consuming task, due to the significant human effort involved. In this paper we propose two pixel-level domain adaptation methods, introducing a training...
Indoor object recognition is a key task for mobile robot indoor navigation. In this paper, we proposed a pipeline for indoor object detection based on convolutional neural network (CNN). With the proposed method, we first pre-train an off-line CNN model by using both public Indoor Dataset and private frames of videos (FoV) dataset. This is then followed by a selective search process to extract a region...
Automatic segmentation of the left ventricle (LV) can become a useful tool in echocardiography, for instance to provide automatic ejection fraction measurements or to initialize deformation imaging algorithms. Deep neural networks have recently shown very promising results for improving image classification and segmentation. These methods learn using only a set of input and output data, but require...
Automatic segmentation of the left ventricle (LV) can become a useful tool in echocardiography. Deep convolutional neural networks (CNNs) have shown promising results for image classification and segmentation on several domains, however CNNs seem to require a lot of training data. In this work, CNNs are investigated for LV ultrasound image segmentation. We study if the need for manual annotation can...
To improve the accuracy of surface defect detection, an approach of defect inspection based on visual saliency map and Support Vector Machine(SVM) is proposed. Monochrome fabric defect images are taken as examples in this paper. By analyzing the visual saliency maps of these images, the global associated value and the background associated value are extracted as the two features. After being normalized,...
Cardiac function is of paramount importance for both prognosis and treatment of different pathologies such as mitral regurgitation, ischemia, dyssynchrony and myocarditis. Cardiac behavior is determined by structural and functional features. In both cases, the analysis of medical imaging studies requires to detect and segment the myocardium. Nowadays, magnetic resonance imaging (MRI) is one of the...
Every organism emits energy around it which comprises UV-radiation, EM-radiation, infrared and thermal radiation. This energy around human body represents health condition of the subject under study. These energy fields are called as aura of the body under consideration. Several types of equipments are there to capture such energy. Kirlian camera captures the distribution of energy radiation around...
In this paper, we propose an automatic brain tumor segmentation algorithm based on a 22-layers deep, three-dimensional Convolutional Neural Network (CNN) for the challenging problem of gliomas segmentation. To correct the bias field distortion of MRI images, we have added N4ITK method before intensity normalization. The use of several cascaded convolution layers with small kernels allows building...
Forensic odontology is one method of determining the identity of the individuals who use it as a base dental identification. Teeth can provide information about the individual's identity because of its distinctive. Currently, the process of forensic identification through dental radiography is performed manually so it took a long time to match the teeth with human identity. Therefore, we need a system...
Command extraction from human beings becomes easier for a machine if it can analyze the non verbal ways of communication such as emotions. This paper focuses on improving the efficiency of extracting emotion from human facial expression images. The features that were extracted in this experiment were obtained from JAFFE (Japanese Female Facial Expression) database which includes 213 images of different...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.