Search results

chapter

3DCNN-DQN-RNN: A Deep Reinforcement Learning Framework for Semantic Parsing of Large-Scale 3D Point Clouds

Fangyu Liu, Shuaipeng Li, Liqiang Zhang, Chenghu Zhou, more

2017 IEEE International Conference on Computer Vision (ICCV) > 5679 - 5688

2017 IEEE International Conference on Computer Vision (ICCV)

Semantic parsing of large-scale 3D point clouds is an important research topic in computer vision and remote sensing fields. Most existing approaches utilize hand-crafted features for each modality independently and combine them in a heuristic manner. They often fail to consider the consistency and complementary information among features adequately, which makes them difficult to capture high-level...

chapter

Image2song: Song Retrieval via Bridging Image Content and Lyric Words

Xuelong Li, Di Hu, Xiaoqiang Lu

2017 IEEE International Conference on Computer Vision (ICCV) > 5650 - 5659

2017 IEEE International Conference on Computer Vision (ICCV)

Image is usually taken for expressing some kinds of emotions or purposes, such as love, celebrating Christmas. There is another better way that combines the image and relevant song to amplify the expression, which has drawn much attention in the social network recently. Hence, the automatic selection of songs should be expected. In this paper, we propose to retrieve semantic relevant songs just by...

chapter

Makeup-Go: Blind Reversion of Portrait Edit

Ying-Cong Chen, Xiaoyong Shen, Jiaya Jia

2017 IEEE International Conference on Computer Vision (ICCV) > 4511 - 4519

2017 IEEE International Conference on Computer Vision (ICCV)

Virtual face beautification (or markup) becomes common operations in camera or image processing Apps, which is actually deceiving. In this paper, we propose the task of restoring a portrait image from this process. As the first attempt along this line, we assume unknown global operations on human faces and aim to tackle the two issues of skin smoothing and skin color change. These two tasks, intriguingly,...

chapter

Learning to Disambiguate by Asking Discriminative Questions

Yining Li, Chen Huang, Xiaoou Tang, Chen Change Loy

2017 IEEE International Conference on Computer Vision (ICCV) > 3439 - 3448

2017 IEEE International Conference on Computer Vision (ICCV)

The ability to ask questions is a powerful tool to gather information in order to learn about the world and resolve ambiguities. In this paper, we explore a novel problem of generating discriminative questions to help disambiguate visual instances. Our work can be seen as a complement and new extension to the rich research studies on image captioning and question answering. We introduce the first...

chapter

Detailed Surface Geometry and Albedo Recovery from RGB-D Video under Natural Illumination

Xinxin Zuo, Sen Wang, Jiangbin Zheng, Ruigang Yang

2017 IEEE International Conference on Computer Vision (ICCV) > 3152 - 3161

2017 IEEE International Conference on Computer Vision (ICCV)

In this paper we present a novel approach for depth map enhancement from an RGB-D video sequence. The basic idea is to exploit the photometric information in the color sequence. Instead of making any assumption about surface albedo or controlled object motion and lighting, we use the lighting variations introduced by casual object movement. We are effectively calculating photometric stereo from a...

chapter

Understanding Low- and High-Level Contributions to Fixation Prediction

Matthias Kummerer, Thomas S.A. Wallis, Leon A. Gatys, Matthias Bethge

2017 IEEE International Conference on Computer Vision (ICCV) > 4799 - 4808

2017 IEEE International Conference on Computer Vision (ICCV)

Understanding where people look in images is an important problem in computer vision. Despite significant research, it remains unclear to what extent human fixations can be predicted by low-level (contrast) compared to highlevel (presence of objects) image features. Here we address this problem by introducing two novel models that use different feature spaces but the same readout architecture. The...

chapter

A Two Stream Siamese Convolutional Neural Network for Person Re-identification

Dahjung Chung, Khalid Tahboub, Edward J. Delp

2017 IEEE International Conference on Computer Vision (ICCV) > 1992 - 2000

2017 IEEE International Conference on Computer Vision (ICCV)

Person re-identification is an important task in video surveillance systems. It can be formally defined as establishing the correspondence between images of a person taken from different cameras at different times. In this paper, we present a two stream convolutional neural network where each stream is a Siamese network. This architecture can learn spatial and temporal information separately. We also...

chapter

View Adaptive Recurrent Neural Networks for High Performance Human Action Recognition from Skeleton Data

Pengfei Zhang, Cuiling Lan, Junliang Xing, Wenjun Zeng, more

2017 IEEE International Conference on Computer Vision (ICCV) > 2136 - 2145

2017 IEEE International Conference on Computer Vision (ICCV)

Skeleton-based human action recognition has recently attracted increasing attention due to the popularity of 3D skeleton data. One main challenge lies in the large view variations in captured human actions. We propose a novel view adaptation scheme to automatically regulate observation viewpoints during the occurrence of an action. Rather than re-positioning the skeletons based on a human defined...

chapter

Video Fill In the Blank Using LR/RL LSTMs with Spatial-Temporal Attentions

Amir Mazaheri, Dong Zhang, Mubarak Shah

2017 IEEE International Conference on Computer Vision (ICCV) > 1416 - 1425

2017 IEEE International Conference on Computer Vision (ICCV)

Given a video and a description sentence with one missing word, “source sentence”, Video-Fill-In-the-Blank (VFIB) problem is to find the missing word automatically. The contextual information of the sentence, as well as visual cues from the video, are important to infer the missing word accurately. Since the source sentence is broken into two fragments: the sentence’s left fragment (before the blank)...

chapter

Super-Trajectory for Video Segmentation

Wenguan Wang, Jianbing Shen, Jianwen Xie, Fatih Porikli

2017 IEEE International Conference on Computer Vision (ICCV) > 1680 - 1688

2017 IEEE International Conference on Computer Vision (ICCV)

We introduce a novel semi-supervised video segmentation approach based on an efficient video representation, called as “super-trajectory”. Each super-trajectory corresponds to a group of compact trajectories that exhibit consistent motion patterns, similar appearance and close spatiotemporal relationships. We generate trajectories using a probabilistic model, which handles occlusions and drifts in...

chapter

Color constancy method based on local chromaticity distribution and illuminant influence for hue angle

Ji-Hoon Yoo, Yeong-Ho Ha, Shibudas Kattakkalil Subhashdas

2017 IEEE 6th Global Conference on Consumer Electronics (GCCE) > 1 - 5

2017 IEEE 6th Global Conference on Consumer Electronics (GCCE)

This paper proposed robust color constancy method for changing illuminant by using local chromaticity distribution and analysis of illuminant influence for each hue angle. First, changing in chromaticity distribution direction for each color with respect to various illuminant is analyzed using principal component analysis. Next, change in standard deviation of chromaticity distribution with respect...

chapter

Conformal mapping applied to encoding and decoding of images

Alan H. F. Silva, Uyara F. Silva, Wesley P. Calixto, Alana S. Magalhaes, more

2017 CHILEAN Conference on Electrical, Electronics Engineering, Information and Communication Technologies (CHILECON) > 1 - 5

2017 CHILEAN Conference on Electrical, Electronics Engineering, Information and Communication Technologies (CHILECON)

This work presents images encoding and decoding using the theory of conformal mapping. The conformal mapping theory made changes in the domain of problems without modifying physical characteristics between the domains. Images were utilized and are transported between domains using transformation functions like encrypt keys. Developed method showed to be able to preserve original images characteristics...

chapter

On the Performance of Visual Semantics for Improving Texture-Based Blind Image Quality Assessment

Pedro Garcia Freitas, Mylene Christine Queiroz De Farias

2017 30th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI) > 330 - 337

2017 30th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI)

Blind image quality assessment (BIQA) methods aim to estimate the quality of a given test image without referring to the corresponding reference (original) image. Most BIQA methods use visual sensitivity models, which take into consideration intrinsic image characteristics (e.g. contrast, luminance, and texture) to identify degradations and estimate quality. For example, texture-based BIQA methods...

chapter

Color-Based and Recursive Fiducial Marker for Augmented Reality

Douglas Tybusch, Gilseone Rosa De Moraes, Osmar M. dos Santos, Andrei Piccinini Legg, more

2017 30th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI) > 254 - 261

2017 30th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI)

The popularity of applications using Augmented Reality, especially due to the dissemination of smartphones with high processing power, introduces the need for Fiducial Markers that can be detected quickly, with good accuracy and can deal with partial occlusion. Fiducial Markers can have different shapes, sizes, structure and colors, and are inserted into a scene to facilitate the detection and consequent...

chapter

[POSTER] Lightning Markers: Synchronization-free Single-shot Detection of Imperceptible AR Markers Embedded in a High-Speed Video Display

Tsutomu Kusanagi, Shingo Kagami, Koichi Hashimoto

2017 IEEE International Symposium on Mixed and Augmented Reality (ISMAR-Adjunct) > 229 - 234

2017 IEEE International Symposium on Mixed and Augmented Reality (ISMAR-Adjunct)

This paper proposes a method of embedding AR (augmented reality) markers in a high-speed video sequence so that they are imperceptible to human eyes. The embedded markers appear for very short periods and keep changing their positions at lightning speed. By carefully designing the timings of marker display, a camera with a sufficiently short exposure time running at any frame rate is able to detect...

chapter

[POSTER] Visualizing In-Organ Tumors in Augmented Monocular Laparoscopy

Erol Ozgur, Alexis Lafont, Adrien Bartoli

2017 IEEE International Symposium on Mixed and Augmented Reality (ISMAR-Adjunct) > 46 - 51

2017 IEEE International Symposium on Mixed and Augmented Reality (ISMAR-Adjunct)

One of the important goals of medical augmented reality is to reveal the hidden anatomy, such as a tumor in an organ. However, conveying a hidden tumor's depth to the user effortlessly and precisely is still an unsolved problem. This is especially difficult in monocular laparoscopy. First, the number of available depth cues is in practice limited to only two: occlusion and relative size. Second, exploiting...

chapter

TETRIS: Smartphone-to-Smartphone Screen-Based Visible Light Communication

Matthew Stafford, Adriana Rogers, Shela Wu, Charles Carver, more

2017 IEEE 14th International Conference on Mobile Ad Hoc and Sensor Systems (MASS) > 570 - 574

2017 IEEE 14th International Conference on Mobile Ad Hoc and Sensor Systems (MASS)

With the extensive use of smartphones, technology improving secure communication between smartphones is a growing field of research. As a form of Visible Light Communication, a color video barcode system creates a smartphone-tosmartphone communication channel. This color video barcode system, effectively an evolved form of QR codes, provides a secure alternative to WiFi, Bluetooth, and Near Field...

chapter

Improving Face Detection Performance by Skin Detection Post-Processing

Oeslle Lucena, Italo De P. Oliveira, Luciana Veloso, Eanes Pereira

2017 30th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI) > 300 - 307

2017 30th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI)

Face detection is already incorporated in many biometrics and surveillance applications. Therefore, the reduction of false detections is a priority in those systems. However, face detection is still challenging. Many factors, such as pose variation and complex backgrounds, contribute to false detections. Besides, the fidelity of a true detection, measured by precision rate, is a concern in content-based...

chapter

Diagnosing Leukemia in Blood Smear Images Using an Ensemble of Classifiers and Pre-Trained Convolutional Neural Networks

Luis Henrique Silva Vogado, Rodrigo De Melo Souza Veras, Alan Ribeiro Andrade, Flavio Henrique Duarte De Araujo, more

2017 30th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI) > 367 - 373

2017 30th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI)

Leukemia is a worldwide disease. In this paper we demonstrate that it is possible to build an automated, efficient and rapid leukemia diagnosis system. We demonstrate that it is possible to improve the precision of current techniques from the literature using the description power of well-known Convolutional Neural Networks (CNNs). We extract features from a blood smear image using pre-trained CNNs...

chapter

Single Image Super-Resolution Using Multiple Extreme Learning Machine Regressors

Daniel Luis Cosmo, Fernando Kentaro Inaba, Evandro Ottoni Teatini Salles

2017 30th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI) > 397 - 404

2017 30th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI)

This paper presents a new technique to solve the single image super resolution reconstruction problem based on multiple extreme learning machine regressors, called here MELM. The MELM employs a feature space of low resolution images, divided in subspaces, and one regressor is trained for each one. In the training task, we employ a color dataset containing 91 images, with approximately 5.3 million...

INFONA - science communication portal

Search results

3DCNN-DQN-RNN: A Deep Reinforcement Learning Framework for Semantic Parsing of Large-Scale 3D Point Clouds

Image2song: Song Retrieval via Bridging Image Content and Lyric Words

Makeup-Go: Blind Reversion of Portrait Edit

Learning to Disambiguate by Asking Discriminative Questions

Detailed Surface Geometry and Albedo Recovery from RGB-D Video under Natural Illumination

Understanding Low- and High-Level Contributions to Fixation Prediction

A Two Stream Siamese Convolutional Neural Network for Person Re-identification

View Adaptive Recurrent Neural Networks for High Performance Human Action Recognition from Skeleton Data

Video Fill In the Blank Using LR/RL LSTMs with Spatial-Temporal Attentions

Super-Trajectory for Video Segmentation

Color constancy method based on local chromaticity distribution and illuminant influence for hue angle

Conformal mapping applied to encoding and decoding of images

On the Performance of Visual Semantics for Improving Texture-Based Blind Image Quality Assessment

Color-Based and Recursive Fiducial Marker for Augmented Reality

[POSTER] Lightning Markers: Synchronization-free Single-shot Detection of Imperceptible AR Markers Embedded in a High-Speed Video Display

[POSTER] Visualizing In-Organ Tumors in Augmented Monocular Laparoscopy

TETRIS: Smartphone-to-Smartphone Screen-Based Visible Light Communication

Improving Face Detection Performance by Skin Detection Post-Processing

Diagnosing Leukemia in Blood Smear Images Using an Ensemble of Classifiers and Pre-Trained Convolutional Neural Networks

Single Image Super-Resolution Using Multiple Extreme Learning Machine Regressors

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options