Search results

chapter

Learning an AUV docking maneuver with a convolutional neural network

Albert Sans-Muntadas, Kristin Y. Pettersen, Edmund Brekke, Eleni Kelasidi

OCEANS 2017 – Anchorage > 1 - 5

OCEANS 2017 - Anchorage

This paper proposes and implements a convolutional neural network (CNN) that maps images from a camera to an error signal to guide and control an autonomous underwater vehicle into the entrance of a docking station. The paper proposes to use an external positioning system synchronized with the vehicle to obtain a dataset of images matched with the position and orientation of the vehicle. By using...

chapter

End-to-End Correspondence and Relationship Learning of Mid-Level Deep Features for Person Re-Identification

Shan Lin, Chang-Tsun Li

2017 International Conference on Digital Image Computing: Techniques and Applications (DICTA) > 1 - 6

2017 International Conference on Digital Image Computing: Techniques and Applications (DICTA)

In this paper, a unified deep convolutional architecture is proposed to address the problems in the person re-identification task. The proposed method adaptively learns the discriminative deep mid-level features of a person and constructs the correspondence features between an image pair in a data-driven manner. The previous Siamese structure deep learning approaches focus only on pair-wise matching...

chapter

Seam tracking and welding bead geometry analysis for autonomous welding robot

Luciane B. Soares, Atila A. Weis, Ricardo N. Rodrigues, Paulo L. J. Drews, more

2017 Latin American Robotics Symposium (LARS) and 2017 Brazilian Symposium on Robotics (SBR) > 1 - 6

2017 Latin American Robotics Symposium (LARS) and 2017 Brazilian Symposium on Robotics (SBR)

Welding is a process recognized by the laborious work and hazardous work environment it takes place, but it is an important process in different industrial scenarios, like the shipbuilding industry. The use of robots has been increasing in recent years, reducing the human interference necessary for the process. This paper proposes a system for automated seam tracking and a geometric welding bead analysis...

chapter

Automated segmentation of gingival diseases from oral images

Aman Rana, Gregory Yauney, Lawrence C. Wong, Otkrist Gupta, more

2017 IEEE Healthcare Innovations and Point of Care Technologies (HI-POCT) > 144 - 147

2017 IEEE Healthcare Innovation Point-of-Care Technologies (HI-POCT)

Periodontal diseases are the largest cause of tooth loss among people of all ages and are also correlated with systemic diseases such as endocarditis. Advanced periodontal disease comprises degradation of surrounding tooth structures, severe inflammation and gingival bleeding. Inflammation is an early indicator of periodontal disease. Early detection and preventive measures can help prevent serious...

chapter

A mobile robot platform for supervised machine learning applications

Frazer K. Noble

2017 24th International Conference on Mechatronics and Machine Vision in Practice (M2VIP) > 1 - 6

2017 24th International Conference on Mechatronics and Machine Vision in Practice (M2VIP)

In supervised machine learning applications, a data set of training and validation features and labels is required to train a neural network. In this paper, we present a remote-controlled, mobile robot and describe software used to generate a data set for vision-based, supervised machine learning applications. We present results from an experiment, which validates the developed platform, and also...

chapter

Viewpoint Invariant RGB-D Human Action Recognition

Jain Liu, Naveed Akhtar, Ajmal Mian

2017 International Conference on Digital Image Computing: Techniques and Applications (DICTA) > 1 - 8

2017 International Conference on Digital Image Computing: Techniques and Applications (DICTA)

Viewpoint variation is a major challenge in video- based human action recognition. We exploit the simultaneous RGB and Depth sensing of RGB-D cameras to address this problem. Our technique capitalizes on the complementary spatio-temporal information in RGB and Depth frames of the RGB-D videos to achieve viewpoint invariant action recognition. We extract view invariant features from the dense trajectories...

chapter

Recent Advances of Deep Learning for Sign Language Recognition

Lihong Zheng, Bin Liang, Ailian Jiang

2017 International Conference on Digital Image Computing: Techniques and Applications (DICTA) > 1 - 7

2017 International Conference on Digital Image Computing: Techniques and Applications (DICTA)

To assist the social interaction of deaf and hearing impaired people, efficient interactive communication tools is expected. With the growing research interest in action and gesture recognition in the last years, many successful applications for sign language recognition comprise new types of sensors including low-cost depth camera and advanced machine learning technologies. In this paper, we present...

chapter

Closed and Open-World Person Re-Identification and Verification

Solene Chan-Lang, Quoc-Cuong Pham, Catherine Achard

2017 International Conference on Digital Image Computing: Techniques and Applications (DICTA) > 1 - 8

2017 International Conference on Digital Image Computing: Techniques and Applications (DICTA)

In recent years, remarkable breakthrough has been achieved in person re-identification (Re-ID). However most methods are only tested in the closed-world setting where the probe person is assumed to be one of the gallery people. In this paper, we tackle a more realistic problem, open-world Re-ID, which requires to find out whether the probe person is among the gallery or not, and if so, who he is....

chapter

Dynamic texture using deep learning

Rishabh Bansal, Arun Singh Pundir, Balasubramanian Raman

TENCON 2017 - 2017 IEEE Region 10 Conference > 2609 - 2614

TENCON 2017 - 2017 IEEE Region 10 Conference

Identifying object in a dynamic scene is one of the main problems in computer vision. This is directly related to solving recognition problem for dynamic texture. Recognizing dynamic texture has become a fundamental problem to understand natural video content. It is a powerful technique for recognizing natural scenes such as fire, waves and smoke. Methods which exist today suffer from various problems...

chapter

SurfaceNet: An End-to-End 3D Neural Network for Multiview Stereopsis

Mengqi Ji, Juergen Gall, Haitian Zheng, Yebin Liu, more

2017 IEEE International Conference on Computer Vision (ICCV) > 2326 - 2334

2017 IEEE International Conference on Computer Vision (ICCV)

This paper proposes an end-to-end learning framework for multiview stereopsis. We term the network SurfaceNet. It takes a set of images and their corresponding camera parameters as input and directly infers the 3D model. The key advantage of the framework is that both photo-consistency as well geometric relations of the surface structure can be directly learned for the purpose of multiview stereopsis...

chapter

Stepwise Metric Promotion for Unsupervised Video Person Re-identification

Zimo Liu, Dong Wang, Huchuan Lu

2017 IEEE International Conference on Computer Vision (ICCV) > 2448 - 2457

2017 IEEE International Conference on Computer Vision (ICCV)

The intensive annotation cost and the rich but unlabeled data contained in videos motivate us to propose an unsupervised video-based person re-identification (re-ID) method. We start from two assumptions: 1) different video tracklets typically contain different persons, given that the tracklets are taken at distinct places or with long intervals; 2) within each tracklet, the frames are mostly of the...

chapter

Image-Based Localization Using LSTMs for Structured Feature Correlation

F. Walch, C. Hazirbas, L. Leal-Taixe, T. Sattler, more

2017 IEEE International Conference on Computer Vision (ICCV) > 627 - 637

2017 IEEE International Conference on Computer Vision (ICCV)

In this work we propose a new CNN+LSTM architecture for camera pose regression for indoor and outdoor scenes. CNNs allow us to learn suitable feature representations for localization that are robust against motion blur and illumination changes. We make use of LSTM units on the CNN output, which play the role of a structured dimensionality reduction on the feature vector, leading to drastic improvements...

chapter

Cross-View Asymmetric Metric Learning for Unsupervised Person Re-Identification

Hong-Xing Yu, Ancong Wu, Wei-Shi Zheng

2017 IEEE International Conference on Computer Vision (ICCV) > 994 - 1002

2017 IEEE International Conference on Computer Vision (ICCV)

While metric learning is important for Person reidentification (RE-ID), a significant problem in visual surveillance for cross-view pedestrian matching, existing metric models for RE-ID are mostly based on supervised learning that requires quantities of labeled samples in all pairs of camera views for training. However, this limits their scalabilities to realistic applications, in which a large amount...

chapter

Learning to Estimate 3D Hand Pose from Single RGB Images

Christian Zimmermann, Thomas Brox

2017 IEEE International Conference on Computer Vision (ICCV) > 4913 - 4921

2017 IEEE International Conference on Computer Vision (ICCV)

Low-cost consumer depth cameras and deep learning have enabled reasonable 3D hand pose estimation from single depth images. In this paper, we present an approach that estimates 3D hand pose from regular RGB images. This task has far more ambiguities due to the missing depth information. To this end, we propose a deep network that learns a network-implicit 3D articulation prior. Together with detected...

chapter

Visible-light based gaze tracking with image enhancement pre-processing for wearable eye trackers

Ting-Lun Liu, Chih-Peng Fan

2017 IEEE 6th Global Conference on Consumer Electronics (GCCE) > 1 - 2

2017 IEEE 6th Global Conference on Consumer Electronics (GCCE)

In this study, a visible-light based fast iris ellipse fitting based gaze tracking scheme is developed for wearable eye trackers. First, after image enhancement pre-processing of eye images, the two-level binarization identifies the iris contour, and the candidate points for ellipse fittings are selected from the binaried iris profile. Next, by fast Random Sample Consensus (RANSAC) ellipse fitting,...

chapter

[POSTER] A Probabilistic Combination of CNN and RNN Estimates for Hand Gesture Based Interaction in Car

Aditya Tewari, Bertram Taetz, Frederic Grandidier, Didier Stricker

2017 IEEE International Symposium on Mixed and Augmented Reality (ISMAR-Adjunct) > 1 - 6

2017 IEEE International Symposium on Mixed and Augmented Reality (ISMAR-Adjunct)

Hand Gesture Recognition is completed on top-view hand images observed by a Time of Flight(ToF) camera in a car. The work attempts to solve two important problems of touchless interactions inside a car. First, low latency identification of the gestures which are unobtrusive for the driver. Second, reducing the labelled data required to train learning based solutions, this is particularly important...

chapter

[POSTER] Decision Forest For Efficient and Robust Camera Relocalization

Amine Kacete, Thomas Wentz, Jerome Royan

2017 IEEE International Symposium on Mixed and Augmented Reality (ISMAR-Adjunct) > 20 - 24

2017 IEEE International Symposium on Mixed and Augmented Reality (ISMAR-Adjunct)

To robustly estimate the pose, classical methods assume some geometrical and temporal assumptions (SfM: Structure from Motion, SLAM: Simultaneous Localization and mapping). These approaches take a pair of images as input and establish correspondences based on global strategy (using the whole image information) or sparse strategy (using key-points features). These correspondences allow solving a set...

chapter

Real-Time Brazilian License Plate Detection and Recognition Using Deep Convolutional Neural Networks

Sergio Montazzolli Silva, Claudio Rosito Jung

2017 30th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI) > 55 - 62

2017 30th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI)

Automatic License Plate Recognition (ALPR) is an important task with many applications in Intelligent Transportation and Surveillance systems. As in other computer vision tasks, Deep Learning (DL) methods have been recently applied in the context of ALPR, focusing on country-specific plates, such as American or European, Chinese, Indian and Korean. However, either they are not a complete DL-ALPR pipeline,...

chapter

A skeleton-free kinect system for body mass index assessment using deep neural networks

D. Nahavandi, A. Abobakr, H. Haggag, M. Hossny, more

2017 IEEE International Systems Engineering Symposium (ISSE) > 1 - 6

2017 IEEE International Systems Engineering Symposium (ISSE)

In this paper we present a skeleton-free Kinect system to estimate body mass index (BMI) of human bodies. Unlike other systems in the literature, the proposed system does not require a scale to measure the weight. The weight of observed subjects are estimated using body surface area (BSA) regression. The proposed system employs the state-of-the-art deep residual network to extract meaningful features...

chapter

Neural network for the detection of misplaced and missing regions in images

Jin Siang Tan, Rosmiwati Mohd-Mokhtar

2017 IEEE 2nd International Conference on Automatic Control and Intelligent Systems (I2CACIS) > 134 - 139

2017 IEEE 2nd International Conference on Automatic Control and Intelligent Systems (I2CACIS)

This paper presents a neural-network-based approach for the detection of misplaced and missing regions in images. The main objective of this project is to develop an intelligent system that can identify a misplaced or missing region of a tested image. The system can be used to detect misplaced and missing components of printed circuit boards during the manufacturing process. Jigsaw puzzle pieces can...

INFONA - science communication portal

Search results

Learning an AUV docking maneuver with a convolutional neural network

End-to-End Correspondence and Relationship Learning of Mid-Level Deep Features for Person Re-Identification

Seam tracking and welding bead geometry analysis for autonomous welding robot

Automated segmentation of gingival diseases from oral images

A mobile robot platform for supervised machine learning applications

Viewpoint Invariant RGB-D Human Action Recognition

Recent Advances of Deep Learning for Sign Language Recognition

Closed and Open-World Person Re-Identification and Verification

Dynamic texture using deep learning

SurfaceNet: An End-to-End 3D Neural Network for Multiview Stereopsis

Stepwise Metric Promotion for Unsupervised Video Person Re-identification

Image-Based Localization Using LSTMs for Structured Feature Correlation

Cross-View Asymmetric Metric Learning for Unsupervised Person Re-Identification

Learning to Estimate 3D Hand Pose from Single RGB Images

Visible-light based gaze tracking with image enhancement pre-processing for wearable eye trackers

[POSTER] A Probabilistic Combination of CNN and RNN Estimates for Hand Gesture Based Interaction in Car

[POSTER] Decision Forest For Efficient and Robust Camera Relocalization

Real-Time Brazilian License Plate Detection and Recognition Using Deep Convolutional Neural Networks

A skeleton-free kinect system for body mass index assessment using deep neural networks

Neural network for the detection of misplaced and missing regions in images

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options