Search results

chapter

Deep Image Retrieval Applied on Kotenseki Ancient Japanese Literature

Chairath Sirirattanapol, Yusuke Matsui, Shin'ichi Satoh, Kuninori Matsuda, more

2017 IEEE International Symposium on Multimedia (ISM) > 495 - 499

2017 IEEE International Symposium on Multimedia (ISM)

Kotenseki is a collection of classical and ancient Japanese literature. It is comprised of image books that express Japanese stories by using comic drawings of different characters, such as humans, nature, and animals. To effectively store them for posterity, a search system is important. We propose an efficient CBIR system to assist the users in easily accessing the information and have an enjoyable...

chapter

Deep affordance learning for single- and multiple-instance object detection

Jian-Gang Wang, Prabhu Shankar Mahendran, Eam-Khwang Teoh

TENCON 2017 - 2017 IEEE Region 10 Conference > 321 - 326

TENCON 2017 - 2017 IEEE Region 10 Conference

Affordance learning in general, is to identify the purpose, use, and ways to interact with an object, based on information gained from observing the object. Most of the existing affordance learning approaches assume the object target has been cropped individually from images. However, the object could not be easily separated from others due to occlusion or noise. Actually, two or more neighboring...

chapter

PPR-FCN: Weakly Supervised Visual Relation Detection via Parallel Pairwise R-FCN

Hanwang Zhang, Zawlin Kyaw, Jinyang Yu, Shih-Fu Chang

2017 IEEE International Conference on Computer Vision (ICCV) > 4243 - 4251

2017 IEEE International Conference on Computer Vision (ICCV)

We aim to tackle a novel vision task called Weakly Supervised Visual Relation Detection (WSVRD) to detect “subject-predicate-object” relations in an image with object relation groundtruths available only at the image level. This is motivated by the fact that it is extremely expensive to label the combinatorial relations between objects at the instance level. Compared to the extensively studied problem,...

chapter

Cut, Paste and Learn: Surprisingly Easy Synthesis for Instance Detection

Debidatta Dwibedi, Ishan Misra, Martial Hebert

2017 IEEE International Conference on Computer Vision (ICCV) > 1310 - 1319

2017 IEEE International Conference on Computer Vision (ICCV)

A major impediment in rapidly deploying object detection models for instance detection is the lack of large annotated datasets. For example, finding a large labeled dataset containing instances in a particular kitchen is unlikely. Each new environment with new instances requires expensive data collection and annotation. In this paper, we propose a simple approach to generate large annotated instance...

chapter

Identification of autonomous landing sign for unmanned aerial vehicle based on faster regions with convolutional neural network

Junjie Chen, Xiren Miao, Hao Jiang, Jing Chen, more

2017 Chinese Automation Congress (CAC) > 2109 - 2114

2017 Chinese Automation Congress (CAC)

In order to realize autonomous landing of the unmanned aerial vehicle (UAV) in power patrolling, a visual method vision based on Faster Regions with Convolutional Neural Network (Faster R-CNN) for UAVs is studied. In this paper, we design the landing sign of the combination of concentric circles and pentagon, and propose the Faster R-CNN recognition algorithm which can be used to identify the target...

chapter

What looks good with my sofa: Multimodal search engine for interior design

Ivona Tautkute, Aleksandra Mozejko, Wojciech Stokowiec, Tomasz Trzcinski, more

2017 Federated Conference on Computer Science and Information Systems (FedCSIS) > 1275 - 1282

2017 Federated Conference on Computer Science and Information Systems (FedCSIS)

In this paper, we propose a multi-modal search engine for interior design that combines visual and textual queries. The goal of our engine is to retrieve interior objects, e.g. furniture or wall clocks, that share visual and aesthetic similarities with the query. Our search engine allows the user to take a photo of a room and retrieve with a high recall a list of items identical or visually similar...

chapter

Dense Captioning with Joint Inference and Visual Context

Linjie Yang, Kevin Tang, Jianchao Yang, Li-Jia Li

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1978 - 1987

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Dense captioning is a newly emerging computer vision topic for understanding images with dense language descriptions. The goal is to densely detect visual concepts (e.g., objects, object parts, and interactions between them) from images, labeling each with a short descriptive phrase. We identify two key challenges of dense captioning that need to be properly addressed when tackling the problem. First,...

chapter

Polyhedral Conic Classifiers for Visual Object Detection and Classification

Hakan Cevikalp, Bill Triggs

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4114 - 4122

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a family of quasi-linear discriminants that outperform current large-margin methods in sliding window visual object detection and open set recognition tasks. In these tasks the classification problems are both numerically imbalanced – positive (object class) training and test windows are much rarer than negative (non-class) ones – and geometrically asymmetric –...

chapter

Mining Object Parts from CNNs via Active Question-Answering

Quanshi Zhang, Ruiming Cao, Ying Nian Wu, Song-Chun Zhu

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3890 - 3899

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Given a convolutional neural network (CNN) that is pre-trained for object classification, this paper proposes to use active question-answering to semanticize neural patterns in conv-layers of the CNN and mine part concepts. For each part concept, we mine neural patterns in the pre-trained CNN, which are related to the target part, and use these patterns to construct an And-Or graph (AOG) to represent...

chapter

ViP-CNN: Visual Phrase Guided Convolutional Neural Network

Yikang Li, Wanli Ouyang, Xiaogang Wang, Xiao'Ou Tang

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 7244 - 7253

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

As the intermediate level task connecting image captioning and object detection, visual relationship detection started to catch researchers attention because of its descriptive power and clear structure. It detects the objects and captures their pair-wise interactions with a subject-predicate-object triplet, e.g. person-ride-horse. In this paper, each visual relationship is considered as a phrase...

chapter

Weighted Hough voting for multi-view car detection

Tao Xiang, Zuomei Lai, Wensheng Qiao, Tao Li

2017 20th International Conference on Information Fusion (Fusion) > 1 - 7

2017 20th International Conference on Information Fusion (Fusion)

Hough voting based methods for object detection work by means of allowing local image patches to vote for the center of the object according to the trained visual words. They are effective for object with small local varieties, but incapable of solving multi-view detection problem. The traditional way is training visual words for each subcategory that has similar view. However, limited training data...

chapter

Learn local priors by transferring training masks for salient object detection

Dan Wang, Canxiang Yan, Quan Zhou

2017 IEEE International Conference on Multimedia and Expo (ICME) > 1141 - 1146

2017 IEEE International Conference on Multimedia and Expo (ICME)

In this paper, we present a novel framework to incorporate high-level guidance and low-level features to automatically identify salient objects based on two ideas. The first one considers the specific location prior to encode visual saliency, while the second one estimates image saliency using contrast with respect to background regions. The proposed framework consists of the following three steps:...

chapter

An intelligent mannequin based system with real-time view of regional ophthalmic blocks

Nimal J. Kumar, Boby George, Mohanasankar Sivaprakasam

2017 IEEE International Instrumentation and Measurement Technology Conference (I2MTC) > 1 - 6

2017 IEEE International Instrumentation and Measurement Technology Conference (I2MTC)

In this paper, an ophthalmic anesthetic training system with two cameras integrated in it to provide a real-time visual feedback to the trainee is presented. The mannequin developed uses anatomically accurate ocular structures and the trainee is able to see the needle and ocular structures in real-time on a monitor, during the training. Other than the mannequin with integrated camera system, a virtual...

chapter

Automatic Privacy Prediction to Accelerate Social Image Sharing

Zhenzhong Kuang, Zongmin Li, Dan Lin, Jianping Fan

2017 IEEE Third International Conference on Multimedia Big Data (BigMM) > 197 - 200

2017 IEEE Third International Conference on Multimedia Big Data (BigMM)

The manual process for privacy setting could be very time-consuming and challenging for common users. By assuming that there are hidden correlations between the visual properties of images (i.e., visual features) or object classes and the privacy settings for image sharing, an effective algorithm is developed in this paper to achieve automatic prediction of image privacy, so that the best-matching...

chapter

Real-Time Target Detection and Recognition with Deep Convolutional Networks for Intelligent Visual Surveillance

Wen Xu, Jing He, Hao Lan Zhang, Bo Mao, more

2016 IEEE/ACM 9th International Conference on Utility and Cloud Computing (UCC) > 321 - 326

2016 IEEE/ACM 9th International Conference on Utility and Cloud Computing (UCC)

Moving target detection and tracking, recognition, behaviours analysis are the key issues in the intelligent visual surveillance system (IVSS). The challenge is how to process the real-time video stream in an effective way in case that we could find the interested objects for analysis. However, the traditional video surveillance technology often does not meet the needs of real-time key frame recognition...

chapter

Learn How to Choose: Independent Detectors Versus Composite Visual Phrases

Guy Rosenthal, Ariel Shamir, Leonid Sigal

2017 IEEE Winter Conference on Applications of Computer Vision (WACV) > 382 - 390

2017 IEEE Winter Conference on Applications of Computer Vision (WACV)

Most approaches for scene parsing, recognition or retrieval use detectors that are either (i) independently trained or (ii) jointly trained for conjunctions of object-object or object-attribute phrases. We posit that neither of these two extremes is uniformly optimal, in terms of performance, across all categories and conjunctions. The choice of whether one should train an independent or composite...

chapter

A systemic approach to automatic metadata extraction from multimedia content

Christos Varytimidis, Georgios Tsatiris, Konstantinos Rapantzikos, Stefanos Kollias

2016 IEEE Symposium Series on Computational Intelligence (SSCI) > 1 - 7

2016 IEEE Symposium Series on Computational Intelligence (SSCI)

There is a need for automatic processing and extracting of meaningful metadata from multimedia information, especially in the audiovisual industry. This higher level information is used in a variety of practices, such as enriching multimedia content with external links, clickable objects and useful related information in general. This paper presents a system for efficient multimedia content analysis...

chapter

MedianStruck for long-term tracking applications

Florian Baumann, Enes Dayangac, Josep Aulinas, Matthias Zobel

2016 Sixth International Conference on Image Processing Theory, Tools and Applications (IPTA) > 1 - 6

2016 Sixth International Conference on Image Processing Theory, Tools and Applications (IPTA)

In this paper, we propose a mutual framework that combines two state-of-the-art visual object tracking algorithms. Both trackers benefit from each other's advantage leading to an efficient visual tracking approach. Many state-of-the-art trackers have poor performance due to rain, fog or occlusion in real-world scenarios. Often, after several frames, objects are getting lost, only leading to a short-term...

chapter

Visualization of object image database by using logistic discriminant analysis

Kenji Watanabe, Kazunori Nomoto, Shin Kato

2016 IEEE 19th International Conference on Intelligent Transportation Systems (ITSC) > 1661 - 1665

2016 IEEE 19th International Conference on Intelligent Transportation Systems (ITSC)

For object detection, large-scale databases obtained via an in-vehicle camera have been proposed. The databases are generally used to evaluate object detection methods and/or to train classifiers in these methods. When proposing a new database, we should evaluate the characteristics of a large number of the samples in the database to improve the usability of the proposed database. In the evaluation,...

chapter

ConvNets and AGMM based real-time human detection under fisheye camera for embedded surveillance

Van Tuan Nguyen, Thanh Binh Nguyen, Sun-Tae Chung

2016 International Conference on Information and Communication Technology Convergence (ICTC) > 840 - 845

2016 International Conference on Information and Communication Technology Convergence (ICTC)

Human detection is an essential task in so many applications, especially surveillance systems. Recently, ConvNets (Convolutional Neural Networks)-based YOLO model is a successful method applied for object (including human) detection. It is one of the fastest way to detect directly objects from the input image. However, compared to the ConvNets-based state-of-the-art object detection methods, YOLO...

INFONA - science communication portal

Search results

Deep Image Retrieval Applied on Kotenseki Ancient Japanese Literature

Deep affordance learning for single- and multiple-instance object detection

PPR-FCN: Weakly Supervised Visual Relation Detection via Parallel Pairwise R-FCN

Cut, Paste and Learn: Surprisingly Easy Synthesis for Instance Detection

Identification of autonomous landing sign for unmanned aerial vehicle based on faster regions with convolutional neural network

What looks good with my sofa: Multimodal search engine for interior design

Dense Captioning with Joint Inference and Visual Context

Polyhedral Conic Classifiers for Visual Object Detection and Classification

Mining Object Parts from CNNs via Active Question-Answering

ViP-CNN: Visual Phrase Guided Convolutional Neural Network

Weighted Hough voting for multi-view car detection

Learn local priors by transferring training masks for salient object detection

An intelligent mannequin based system with real-time view of regional ophthalmic blocks

Automatic Privacy Prediction to Accelerate Social Image Sharing

Real-Time Target Detection and Recognition with Deep Convolutional Networks for Intelligent Visual Surveillance

Learn How to Choose: Independent Detectors Versus Composite Visual Phrases

A systemic approach to automatic metadata extraction from multimedia content

MedianStruck for long-term tracking applications

Visualization of object image database by using logistic discriminant analysis

ConvNets and AGMM based real-time human detection under fisheye camera for embedded surveillance

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options