The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Finding mines in Sonar imagery is a significant problem with a great deal of relevance for seafaring military and commercial endeavors. Unfortunately, the lack of enormous Sonar image data sets has prevented automatic target recognition (ATR) algorithms from some of the same advances seen in other computer vision fields. Namely, the boom in convolutional neural nets (CNNs) which have been able to...
In this paper, we address interesting questions about how feng shui influences house price from a data perspective. First, is feng shui likely to influence house price? Second, how do different feng shui features, e.g., house shape, master bedroom location, and other interior room arrangements, influence the price? Third, can we automatically diagnose the feng shui problems of a house? From a dataset...
In this paper, a unified deep convolutional architecture is proposed to address the problems in the person re-identification task. The proposed method adaptively learns the discriminative deep mid-level features of a person and constructs the correspondence features between an image pair in a data-driven manner. The previous Siamese structure deep learning approaches focus only on pair-wise matching...
Support vector data description (SVDD) is a popular technique for detecting anomalies. The SVDD classifier partitions the whole space into an inlier region, which consists of the region near the training data, and an outlier region, which consists of points away from the training data. The computation of the SVDD classifier requires a kernel function, and the Gaussian kernel is a common choice for...
With the rapid adoption of smartphones and tablets, more and more remote medical diagnostic applications have mushroomed. Tongue Diagnosis (TD) is a kind of noninvasive diagnostic technique, which offers significant information for health conditions. However, it is rather tough to extract the tongue from a high-quality image, in which there is a definite large area of the tongue, to say nothing of...
Stereo matching is a fundamental task in vision applications. we propose an adaptive cross-scale aggregation method for stereo matching, which is introduced by solving an optimization problem. Unlike the original approach which introduces the same regularization term based on the inter-scale regularizer parameter to control the cost consistency among the multi-scales for all regions of the input images...
Human re-identification is an important component in many application domains especially the automatic surveillance system. This paper proposes a robust method to re-identify persons using their face shapes based on the Active Shape Model (ASM) and the Procrustes Shape Analysis (PSA). The ASM-based technique is used to extract landmark points of each face image, as the feature. Then, the Procrustes...
Identification of the correct medicinal plants that goes in to the preparation of a medicine is very important in ayurvedic medicinal industry. The main features required to identify a medicinal plant is its leaf shape, colour and texture. Colour and texture from both sides of the leaf contain deterministic parameters to identify the species. This paper explores feature vectors from both the front...
To solve the problem of training rate decline in neural network caused by too much noise in the traditional image, a new method of expression recognition based on CNN was proposed. First, in order to narrow the face range, face image could be detected from the original image by using the AdaBoost cascade classifier. Then, the coordinates of the eye, mouth and other key parts and brow, nasolabial and...
An open question in facial landmark localization in video is whether one should perform tracking or tracking-by-detection (i.e. face alignment). Tracking produces fittings of high accuracy but is prone to drifting. Tracking-by-detection is drift-free but results in low accuracy fittings. To provide a solution to this problem, we describe the very first, to the best of our knowledge, synergistic approach...
We propose a robust hand pose estimation method by learning hand articulations from depth features and auxiliary modality features. As an additional modality to depth data, we present a function of geometric properties on the surface of the hand described by heat diffusion. The proposed heat distribution descriptor is robust to identify the keypoints on the surface as it incorporates both the local...
We propose a novel measure of visual similarity for image retrieval that incorporates both structural and aesthetic (style) constraints. Our algorithm accepts a query as sketched shape, and a set of one or more contextual images specifying the desired visual aesthetic. A triplet network is used to learn a feature embedding capable of measuring style similarity independent of structure, delivering...
The amount of data produced every day on the internet increases every day and with the increasing popularity of the social networks the number of published photos are huge, and those pictures contain several implicit or explicit brand logos. Detecting this logos in natural images can provide information about how widespread is a brand, discover unwanted copyright distribution, analyze marketing campaigns,...
Hand Gesture Recognition is completed on top-view hand images observed by a Time of Flight(ToF) camera in a car. The work attempts to solve two important problems of touchless interactions inside a car. First, low latency identification of the gestures which are unobtrusive for the driver. Second, reducing the labelled data required to train learning based solutions, this is particularly important...
We propose a method for generating caustic images in real time using a deep/convolutional neural network (CNN). To do so, training images are first rendered using photon mapping, and the CNN learns the correspondences between the depth images and caustic images. After learning, the CNN generates a caustic image from a depth image within 55 milliseconds. In addition, the similarity between the generated...
Text segmentation is an important problem in document analysis related applications. We address the problem of classifying connected components of a document image as text or non-text. Inspired from previous works in the literature, besides common size and shape related features extracted from the components, we also consider component images, without and with context information, as inputs of the...
Active shape model is widely used for facial feature localization. Regarding the traditional ASM algorithm can't describe the object shape precisely, an improved ASM algorithm is proposed. At first, we establish shape model and use PCA (Principle Component Analysis) to transform high-dimensional data to lower dimensions. Another work is to establish local texture model giving sample points with different...
The frequent occurrence of road congestion and traffic accidents has affected people's travel efficiency and travel safety. Traffic sign recognition has become one of the key research objects in intelligent transportation system. This paper studies the identification of road traffic signs based on video images. First of all, collected image will be image preprocessing with image reduction, brightness...
In this paper we present a skeleton-free Kinect system to estimate body mass index (BMI) of human bodies. Unlike other systems in the literature, the proposed system does not require a scale to measure the weight. The weight of observed subjects are estimated using body surface area (BSA) regression. The proposed system employs the state-of-the-art deep residual network to extract meaningful features...
In order to reduce the number of accidents caused by the call when the driver was driving, this paper uses the computer vision technology to dectet the behavior of the driver. Based on the constrained local models (CLM) to detect the characteristic changes of the mouth area, combine the HSV color space and the template matching to detect the hand characteristics to judge whether the driver has the...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.