The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
An open question in facial landmark localization in video is whether one should perform tracking or tracking-by-detection (i.e. face alignment). Tracking produces fittings of high accuracy but is prone to drifting. Tracking-by-detection is drift-free but results in low accuracy fittings. To provide a solution to this problem, we describe the very first, to the best of our knowledge, synergistic approach...
We propose a robust hand pose estimation method by learning hand articulations from depth features and auxiliary modality features. As an additional modality to depth data, we present a function of geometric properties on the surface of the hand described by heat diffusion. The proposed heat distribution descriptor is robust to identify the keypoints on the surface as it incorporates both the local...
We propose a novel measure of visual similarity for image retrieval that incorporates both structural and aesthetic (style) constraints. Our algorithm accepts a query as sketched shape, and a set of one or more contextual images specifying the desired visual aesthetic. A triplet network is used to learn a feature embedding capable of measuring style similarity independent of structure, delivering...
The amount of data produced every day on the internet increases every day and with the increasing popularity of the social networks the number of published photos are huge, and those pictures contain several implicit or explicit brand logos. Detecting this logos in natural images can provide information about how widespread is a brand, discover unwanted copyright distribution, analyze marketing campaigns,...
Hand Gesture Recognition is completed on top-view hand images observed by a Time of Flight(ToF) camera in a car. The work attempts to solve two important problems of touchless interactions inside a car. First, low latency identification of the gestures which are unobtrusive for the driver. Second, reducing the labelled data required to train learning based solutions, this is particularly important...
We propose a method for generating caustic images in real time using a deep/convolutional neural network (CNN). To do so, training images are first rendered using photon mapping, and the CNN learns the correspondences between the depth images and caustic images. After learning, the CNN generates a caustic image from a depth image within 55 milliseconds. In addition, the similarity between the generated...
Text segmentation is an important problem in document analysis related applications. We address the problem of classifying connected components of a document image as text or non-text. Inspired from previous works in the literature, besides common size and shape related features extracted from the components, we also consider component images, without and with context information, as inputs of the...
Active shape model is widely used for facial feature localization. Regarding the traditional ASM algorithm can't describe the object shape precisely, an improved ASM algorithm is proposed. At first, we establish shape model and use PCA (Principle Component Analysis) to transform high-dimensional data to lower dimensions. Another work is to establish local texture model giving sample points with different...
The frequent occurrence of road congestion and traffic accidents has affected people's travel efficiency and travel safety. Traffic sign recognition has become one of the key research objects in intelligent transportation system. This paper studies the identification of road traffic signs based on video images. First of all, collected image will be image preprocessing with image reduction, brightness...
In this paper we present a skeleton-free Kinect system to estimate body mass index (BMI) of human bodies. Unlike other systems in the literature, the proposed system does not require a scale to measure the weight. The weight of observed subjects are estimated using body surface area (BSA) regression. The proposed system employs the state-of-the-art deep residual network to extract meaningful features...
In order to reduce the number of accidents caused by the call when the driver was driving, this paper uses the computer vision technology to dectet the behavior of the driver. Based on the constrained local models (CLM) to detect the characteristic changes of the mouth area, combine the HSV color space and the template matching to detect the hand characteristics to judge whether the driver has the...
The success of various applications including robotics, digital content creation, and visualization demand a structured and abstract representation of the 3D world from limited sensor data. Inspired by the nature of human perception of 3D shapes as a collection of simple parts, we explore such an abstract shape representation based on primitives. Given a single depth image of an object, we present...
This paper presents improvements in terms of accuracy for shape object classification using a new low complexity method compared to previous implementation [1]. The method is using echoes generated by a JAVA platform capable of emulate sound propagation in a controlled 2D virtual environment [2][3]. Echoes originate from the ultrasonic waves generated inside a virtual environment which contains geometrical...
Body surface area is an important measure in many clinical trials. It is a critical parameter that is used in estimating radiation and substance doses for human trials. Traditionally, these trials relied on skin-fold tests which are very invasive and uncomfortable to the subjects. In this paper we present a skeleton-free Kinect system to estimate body surface area of human bodies. The proposed system...
Gender recognition from face images is a challenging problem with applications in various knowledge domains, such as biometrics, security and surveillance, human-computer interaction, among others. In this work, we propose and evaluate a novel method for gender recognition based on a geometric descriptor constructed from a pre-defined face shape model. The proposed approach, tested on four different...
This paper presents a multiple classifier system (MCS) to identify plants species based on the texture and shape features extracted from leaf images. A diverse pool of SVM and Neural Network classifiers is trained on four different feature sets, namely, Local Binary Pattern (LBP), Histogram of Gradients (HOG), Speed of Robust Features (SURF) and Zernike Moments (ZM). Then, a static classifier selection...
This paper deals with classification algorithms as one of the basic principles of pattern recognition. We analyze their effect to a feature space and compare the type and the shape of the separating and decision surface, respectively. We proposed a novel classification approach based on Cumulative Fuzzy Membership Function that creates a decision surface in a different way as an MF ARTMAP neural network...
Pattern classification in electroencephalography (EEG) signals is an important problem in biomedical engineering since it enables the detection of brain activity, in particular the early detection of epileptic seizures. In this paper we propose a k-nearest neighbors classification for epileptic EEG signals based on an t-location-scale statistical representation to detect spike-and-waves. The proposed...
Person re-identification in public areas (such as airports, train stations and shopping malls) has recently received increased attention within computer vision research due, in part, to the demand for enhanced levels of security. Re-identifying subjects within non-overlapped camera networks can be considered as a challenging task. Illumination changes in different scenes, variations in camera resolutions,...
This paper describes the development of an encountered-type haptic interface that can generate the physical characteristics, such as shape and rigidity, of three-dimensional (3D) virtual objects using an array of newly developed non-expandable balloons. To alter the rigidity of each non-expandable balloon, the volume of air in it is controlled through a linear actuator and a pressure sensor based...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.