The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
The man-machine dialogue is too difficult to put in place but it remains an important issue for help people with speech problems. In this objective, we have oriented our work around a control interface including multiple communication tools: Videoconferencing between a doctor and patient thanks to a camera via Skype where a communication can be established and by which the doctor can make an initial...
With the fast development of Geographic Information Systems, visual global localization has gained a lot of attention due to the low price of a camera and the practical implications. In this paper, we leverage Google Street View and a monocular camera to develop a refined and continuous positioning in urban environments: namely a topological visual place recognition and then a 6 DoF pose estimation...
When customers visit a book store and they find a book that get their interest, they must want to know the content of the book. They do that because they need to know if the book contain the needs information or not. But in reality, almost all of the book that saled on the book store are sealed. This condition usually make customers cancel their payment for the book. Even if there are any information...
We present a novel pipeline for augmenting a 3D eye-glass mesh into a person's face. While doing so, we take care about the proper fitment of the glass in terms of pupilary distance computed automatically, which is user-friendly in compare to standard marker based approaches. Our method also performs rigid eye-glass temple correction during augmentation followed by tracking to present realistic rendering...
In this paper, a complete indoor navigation assistance system for visually impaired people is introduced. Different multimedia technologies are integrated in a single system in order to provide a precise, safe and friendly navigation service. First, the environment is modeled and represented. After that, the user location is determined by combining Wi-Fi and vision information. This combination offers...
We present a publicly available benchmark database for the problem of hand posture recognition from noisy depth data and fused RGB-D data obtained from low-cost time-of-flight (ToF) sensors. The database is the most extensive database of this kind containing over a million data samples (point clouds) recorded from 35 different individuals for ten different static hand postures. This captures a great...
In this paper, we analyze some of our real-world deployment of face recognition (FR) systems for various applications and discuss the gaps between expectations of the user and what the system can deliver. We evaluate some of the existing algorithms with modifications for applications including FR on wearable devices (like Google Glass) for improving social interactions, monitoring of elderly people...
This paper presents a metric global localization in the urban environment only with a monocular camera and the Google Street View database. We fully leverage the abundant sources from the Street View and benefits from its topo-metric structure to build a coarse-to-fine positioning, namely a topological place recognition process and then a metric pose estimation by local bundle adjustment. Our method...
Light fields captured by a new generation of digital photo cameras are processed in order to render different representation of the same scene. Evaluating the quality of experience in light field imaging applications requires publicly available databases in order to allow the research community to fairly compare the obtained results. This paper presents an open access database of light field images...
In this paper, we present MO-SLAM, a novel visual SLAM system that is capable of detecting duplicate objects in the scene during run-time without requiring an offline training stage to pre-populate a database of objects. Instead, we propose a novel method to detect landmarks that belong to duplicate objects. Further, we show how landmarks belonging to duplicate objects can be converted to first-order...
Spatial affordance can be defined as the functionality a space, or place, lends to human activity. Different places afford different activity possibilities - sleeping is mostly done in the bedroom, and cooking is mostly done in the kitchen. Semantic place labels like kitchen and bedroom, therefore, provide context with which a robot can better infer human activity. Real rooms, however, often defy...
Image-Based Rendering (IBR) allows good-quality free-viewpoint navigation in urban scenes, but suffers from artifacts on poorly reconstructed objects, e.g., reflective surfaces such as cars. To alleviate this problem, we propose a method that automatically identifies stock 3D models, aligns them in the 3D scene and performs morphing to better capture image contours. We do this by first adapting learning-based...
Visual localization is the process of finding the location of a camera from the appearance of the images it captures. In this work, we propose an observation model that allows the use of images for particle filter localization. To achieve this, we exploit the capabilities of Gaussian Processes to calculate the likelihood of the observation for any given pose, in contrast to methods which restrict...
It has been shown that local magnetic field (MF) anomalies can be used in accurate global self-localisation with fingerprinting. However, MF anomalies can only affect limited areas, and the low discernibility of received local MF signals may result in many positions having the same MF-Location information in areas far away from disturbances. This is mainly due to the sensitivity limitations of sensors...
Periocular characteristics are being used as a supplementary feature for the biometric systems employing iris characteristics to mitigate effects of the noisy iris on authentication performance. In the same lines, ocular characteristics are also used to enhance the performance of face based systems under the impact of pose, expression and illumination. However, the iris and face systems are operated...
This study validates the potential of a tabletop system to enhance students' quality and intensity of argumentation when engaging in co-located collaborative design activities. Twenty-four undergraduate students participated in a between-subjects design where one group used the proposed system and the other group used a paper-based approach. Overall students using the tabletop system over exceeded...
Biometric recognition of an individual can be done based on physiological characteristics such as a face, palmprint, vein, etc. and behavioural characteristics which include gait, keystroke, mouse dynamics, etc. Among the various biometric modalities, the wrist vein recognition has been known for high accuracy, stability and resistance to spoofing. The success of wrist vein biometrics strongly correlates...
More and more pedestrians own devices (as a smartphone) that integrate a wide array of low-cost sensors (camera, IMU, magnetometer and GNSS receiver). GNSS is usually used for pedestrian localization in urban environment, but signal suffers of an inaccuracy of several meters. In order to have a more accurate localization and improve pedestrian navigation and urban mobility, we present a method for...
The paper presents a thorough evaluation of two representative visual place recognition algorithms that can be applied to the problem of indoor localization of a person equipped with a modern smartphone. The evaluation focuses on comparing two different state-of-the-art approaches: single image-based place recognition, represented by the FAB-MAP algorithm, and recognition based on a sequence of images,...
Eye is the main sensing organ in human body, so it contributes much for user interface. Eye tracking has been widely used in Human Computer Interactions. Real time face image capturing along with eye gaze tracking can provide a means of user input to the computer. Here we combine an efficient method for tracking eye and detecting blink to produce an environment by which physically disabled persons...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.