The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Analyzing fish and fish schools behavior can help in studying fish-fish interaction, analyzing characteristics of fish species, studying prey avoidance maneuvers of fish schools, etc. Such analysis requires the estimation of each fish's 3D location, 3D pose, and 3D shape over time. Moreover if we are interested in studying the interaction of fish by injecting visual / acoustic stimuli artificially...
Omnidirectional, also referred to as 360º, visual content provides an immersive experience since it allows users to view a visual scene from different directions. The overall content typically covers a full sphere, and omnidirectional videos or images are processed to obtain a projection on a 2D plane of a fraction of the sphere (aka viewport), which is shown to the user. Therefore, users can look...
In this paper, a unified deep convolutional architecture is proposed to address the problems in the person re-identification task. The proposed method adaptively learns the discriminative deep mid-level features of a person and constructs the correspondence features between an image pair in a data-driven manner. The previous Siamese structure deep learning approaches focus only on pair-wise matching...
This study describes a method for using a camera to automatically recognize the speed limits on speed-limit signs. This method consists of the following three processes: first (1) a method of detecting the speed-limit signs with a machine learning method utilizing the local binary pattern (LBP) feature quantities as information helpful for identification, then (2) an image processing method using...
Rigid structure-from-motion (RSfM) and non-rigid structure-from-motion (NRSfM) have long been treated in the literature as separate (different) problems. Inspired by a previous work which solved directly for 3D scene structure by factoring the relative camera poses out, we revisit the principle of “maximizing rigidity” in structure-from-motion literature, and develop a unified theory which is applicable...
One of the solutions of depth imaging of moving scene is to project a static pattern on the object and use just a single image for reconstruction. However, if the motion of the object is too fast with respect to the exposure time of the image sensor, patterns on the captured image are blurred and reconstruction fails. In this paper, we impose multiple projection patterns into each single captured...
Videos taken in the wild sometimes contain unexpected rain streaks, which brings difficulty in subsequent video processing tasks. Rain streak removal in a video (RSRV) is thus an important issue and has been attracting much attention in computer vision. Different from previous RSRV methods formulating rain streaks as a deterministic message, this work first encodes the rains in a stochastic manner,...
Based on the realization that more than 100 Tokyo children have died after falling from balconies during the last five years, this paper proposes a new system for detecting potentially dangerous situations using red green blue-depth (RGB-D) cameras, and thus aims to help prevent children from falling from balconies. The system, which is based on the results of our investigation into how residents...
Hand Gesture Recognition is completed on top-view hand images observed by a Time of Flight(ToF) camera in a car. The work attempts to solve two important problems of touchless interactions inside a car. First, low latency identification of the gestures which are unobtrusive for the driver. Second, reducing the labelled data required to train learning based solutions, this is particularly important...
Intelligent automobiles and advanced driver assistance systems (ADAS) are some of the major technological developments that affect human daily life. Today, many studies are being generated to develop state of the art transportation systems. The general objective in these studies is to cope with negative effects of traffic. In this work, our aim is to contribute to the development of ADAS by determining...
In this paper we present a skeleton-free Kinect system to estimate body mass index (BMI) of human bodies. Unlike other systems in the literature, the proposed system does not require a scale to measure the weight. The weight of observed subjects are estimated using body surface area (BSA) regression. The proposed system employs the state-of-the-art deep residual network to extract meaningful features...
In this article, the efficiency of Kalman's filter to classify moving parts is shown. A SCARA robot working cell was designed and built to accomplish this task. The main goal is to classify parts that are being carried on a conveyor belt at constant speed according to their shape or color. The operation of the Kalman's filter is described, as well as the mathematical equations to develop the algorithm...
Facial alignment involves finding a set of landmark points on an image with a known semantic meaning. However, this semantic meaning of landmark points is often lost in 2D approaches where landmarks are either moved to visible boundaries or ignored as the pose of the face changes. In order to extract consistent alignment points across large poses, the 3D structure of the face must be considered in...
Our objective is to create a system that enables users to interact with surrounding surfaces by using touch interactions. In this work, we propose a touch detection method that utilizes the shadows of a finger for use with a system featuring an infrared (IR) camera and two IR lights. Since the shape of a finger's shadow varies depending on the distance between the surface and the finger, the system...
Nowadays, the reading and interpretation of antibiogram tests is a frequently performed task by doctors, researchers and technicians at hospitals and laboratories. An antibiogram is a test of the sensitivity of a microorganism to given antibiotics. Reading an interpretation of the antibiogram results is usually performed manually, which leads to human errors and a great waste of time. There are few...
Because visually impaired persons are not able to confirm the appearance of their own face, they are afraid of and uneasy about makeup. We have been developing a system that assists makeup application through verbal feedback according to the appearance of the user's face. The system encourages social communication by helping the user feel confident. In this paper, we introduce a new method of using...
Body surface area is an important measure in many clinical trials. It is a critical parameter that is used in estimating radiation and substance doses for human trials. Traditionally, these trials relied on skin-fold tests which are very invasive and uncomfortable to the subjects. In this paper we present a skeleton-free Kinect system to estimate body surface area of human bodies. The proposed system...
Snow detection and removal from video images is very challenging. Normally the snowflakes affect only on a very small region of an image, hence the confusion to determine which region should be considered and which one should not. In this paper, a frame difference method with five successive frames is first presented to detect the snow pixels from image background, but the method didn't work well...
An emerging problem in computer vision is the reconstruction of 3D shape and pose of an object from a single image. Hitherto, the problem has been addressed through the application of canonical deep learning methods to regress from the image directly to the 3D shape and pose labels. These approaches, however, are problematic from two perspectives. First, they are minimizing the error between 3D shapes...
We propose the use of a light-weight setup consisting of a collocated camera and light source – commonly found on mobile devices – to reconstruct surface normals and spatially-varying BRDFs of near-planar material samples. A collocated setup provides only a 1-D “univariate” sampling of a 3-D isotropic BRDF. We show that a univariate sampling is sufficient to estimate parameters of commonly used analytical...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.