The Infona portal uses cookies, i.e. strings of text saved by a browser on the user's device. The portal can access those files and use them to remember the user's data, such as their chosen settings (screen view, interface language, etc.), or their login data. By using the Infona portal the user accepts automatic saving and using this information for portal operation purposes. More information on the subject can be found in the Privacy Policy and Terms of Service. By closing this window the user confirms that they have read the information on cookie usage, and they accept the privacy policy and the way cookies are used by the portal. You can change the cookie settings in your browser.
Visual Sonification is the process of converting visual properties of objects into sound signals. This paper describes the Michigan Visual Sonification System (MVSS) that utilizes this process to assist the visually impaired in distinguishing different objects in their surroundings. MVSS uses depth information to first segment and localize salient objects and then represents an object's appearance...
Ability to effectively and efficiently detect planes is very useful for mobile robot navigation. This is because plane structures are abundant in man-made environment. Existing methods for plane detection are derived for different kinds of input data. In this paper, we propose a plane detection method for image sequences from Kinect. The algorithm takes into account the limitation of the Kinect's...
Clonal Selection Algorithm (CLONALG) and Particle Swarm Optimization (PSO) have been applied for wide spectrum of computer vision problems. However, their applications to 3D object recognition receive only little attention. In this paper, CLONALG and PSO algorithms for recognition of 3D object are discussed. Instead of using any predefined model to extract the geometrical information, the 3D object...
We propose a joint learning method for object classification and localization using 3D color texture features and geometry-based segmentation from weakly-labeled 3D color datasets. Recently, new consumer cameras such as Microsoft's Kinect produce not only color images but also depth images. These reduce the difficulty of object detection dramatically for the following reasons: (a) reasonable candidates...
Effective robotic interaction with household objects requires the ability to recognize both object instances and object categories. The former are often characterized by locally discriminative texture cues (e.g., instances with prominent brand names and logos), and the latter by salient global shape properties (plates, bowls, pots). We describe experiments with both types of cues, combining a template-and-deformable-parts...
In this paper, we propose a new method for three dimensional rotation-free recognition of characters in scene. In the proposed method, we employ the Modified Quadratic Discriminant Function (MQDF) classifier trained with samples generated by three-dimensional rotation process in a computer. We assume that when recognizing individual characters, considering three-dimensional rotation can approximately...
Current standard quantitative 3D spectral-domain optical coherence tomography (SD-OCT) analyses of various ocular diseases is limited in detecting structural damage at early pathologic stages. This is mostly because only a small fraction of the 3D data is used in the current method of quantifying the structure of interest. This paper presents a novel SD-OCT data analysis technique, taking full advantage...
In this paper we propose a Multi-frame Marked Point Process model for automatic target detection and tracking in Inverse Synthetic Aperture Radar (ISAR) image sequences. For purposes of dealing with high ISAR noise, we obtain the optimal target sequence by an energy minimization process, which simultaneously considers the observed image data and prior geometric interaction constraints between the...
The paper presents an active vision system for human posture recognition, which is an important function of any assisted living system, suitable to be employed in indoor environments. Both hardware and software architectures are defined in order to meet constraints typically imposed by AAL (Ambient Assisted Living) contexts such as compactness, low-power consumption, installation simplicity, privacy...
We propose a new methodology to detect parts of interest inside of complex objects using multiple X-ray views. Our method consists of two steps: ‘structure estimation’, to obtain a geometric model of the multiple views from the object itself, and ‘parts detection’, to detect the object parts of interest. The geometric model is estimated by a bundle adjustment algorithm on stable SIFT keypoints across...
In order to construct the city three dimensional (3D) building model, building contours must be extracted in fast and accurately. Traditional engineering survey and photogrammetry method is low efficient to do this. The Light Detect and Ranging (LiDAR), so called Airborne Laser Scanning, works on the same principle as radar except using light waves instead of radio waves. It can measure distance and...
Stereo video object segmentation is a critical technology of the new generation of video coding, video retrieval, Internet and other emerging interactive multimedia field. This paper puts forward a redundant wavelet transform based stereo video object segmentation algorithm. First, the algorithm obtains the disparity map by redundant wavelet transform and then uses the disparity map to do video object...
This paper presents a novel approach for the generation of 3D building model from IKONOS satellite image data. The main idea of 3D modeling is based on the grouping of 3D line segments. The divergence-based centroid neural network is employed in the grouping process. Prior to the grouping process, 3D line segments are extracted with the aid of the elevation information obtained by using area-based...
Stereo vision refers to the ability to infer information on the 3D structure of scene from two or more images taken from different viewpoints. This paper describes procedure for depth map creating using rectified stereo images and segmentation algorithm belief propagation (BP). Very necessary steps to creating depth map are camera calibration and image rectification of the image pairs. Calibration...
Recently, there are many autonomous navigation applications done in outdoor environment. However, safe navigation is still a daunting challenge in terrain containing vegetation. Thus, a study on vegetation detection for outdoor automobile navigation is investigated in this work. At the early state of our research, we focused on the segmentation of LADAR data into two classes by using local three-dimensional...
In this paper we propose a method that exploits 3D motion-based features between frames of 3D facial geometry sequences for dynamic facial expression recognition. An expressive sequence is modeled to contain an onset followed by an apex and an offset. Feature selection methods are applied in order to extract features for each of the onset and offset segments of the expression. These features are then...
We present a method for improving human segmentation results in calibrated, multi-view environments using features derived from both pixel (image) and voxel (volume) space. The main focus of this work is to develop a low-cost, vision-based system for passive activity monitoring of older adults in the home, to capture early signs of illness and functional decline and allow seniors to live independently...
One of the most remarkable facts of the human visual system is that it rapidly and accurately understands the characteristics of the complex visual world - the relative depth with respect to different objects in the scene, occluded objects in the scene, etc. due to prior experience and knowledge about the scene. The various types of tasks related to understanding what we see in a visual scene is called...
This study presents a computer-aided detection (CADe) system of hepatocellular carcinoma (HCC) using sequential forward floating selection (SFFS) method with linear discriminant analysis (LDA). We extracted morphologic and texture features from the segmented HCC candidate regions from the arterial phase (AP) images of the contrast-enhanced hepatic CT images. To select the most discriminatory features...
Network structures formed by actin filaments are present in many kinds of fluorescence microscopy images. In order to quantify the conformations and dynamics of such actin filaments, we propose a fully automated method to extract actin networks from images and analyze network topology. The method handles well intersecting filaments and, to some extent, overlapping filaments. First we automatically...
Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.