Search results

Items from 101 to 120 out of 1,054 results

1 ...
3
4
5
6
7
8
9

chapter

CityPersons: A Diverse Dataset for Pedestrian Detection

Shanshan Zhang, Rodrigo Benenson, Bernt Schiele

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4457 - 4465

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Convnets have enabled significant progress in pedestrian detection recently, but there are still open questions regarding suitable architectures and training data. We revisit CNN design and point out key adaptations, enabling plain FasterRCNN to obtain state-of-the-art results on the Caltech dataset. To achieve further improvement from more and better data, we introduce CityPersons, a new set of person...

chapter

Semantic Autoencoder for Zero-Shot Learning

Elyor Kodirov, Tao Xiang, Shaogang Gong

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4447 - 4456

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Existing zero-shot learning (ZSL) models typically learn a projection function from a feature space to a semantic embedding space (e.g. attribute space). However, such a projection function is only concerned with predicting the training seen class semantic representation (e.g. attribute prediction) or classification. When applied to test data, which in the context of ZSL contains different (unseen)...

chapter

Learning to Extract Semantic Structure from Documents Using Multimodal Fully Convolutional Neural Networks

Xiao Yang, Ersin Yumer, Paul Asente, Mike Kraley, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4342 - 4351

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present an end-to-end, multimodal, fully convolutional network for extracting semantic structures from document images. We consider document semantic structure extraction as a pixel-wise segmentation task, and propose a unified model that classifies pixels based not only on their visual appearance, as in the traditional page segmentation task, but also on the content of underlying text. Moreover,...

chapter

Semantic Image Inpainting with Deep Generative Models

Raymond A. Yeh, Chen Chen, Teck Yian Lim, Alexander G. Schwing, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6882 - 6890

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Semantic image inpainting is a challenging task where large missing regions have to be filled based on the available visual data. Existing methods which extract information from only a single image generally produce unsatisfactory results due to the lack of high level context. In this paper, we propose a novel method for semantic image inpainting, which generates the missing content by conditioning...

chapter

Object Region Mining with Adversarial Erasing: A Simple Classification to Semantic Segmentation Approach

Yunchao Wei, Jiashi Feng, Xiaodan Liang, Ming-Ming Cheng, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6488 - 6496

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We investigate a principle way to progressively mine discriminative object regions using classification networks to address the weakly-supervised semantic segmentation problems. Classification networks are only responsive to small and sparse discriminative regions from the object of interest, which deviates from the requirement of the segmentation task that needs to localize dense, interior and integral...

chapter

Semantically Consistent Regularization for Zero-Shot Recognition

Pedro Morgado, Nuno Vasconcelos

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2037 - 2046

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The role of semantics in zero-shot learning is considered. The effectiveness of previous approaches is analyzed according to the form of supervision provided. While some learn semantics independently, others only supervise the semantic subspace explained by training classes. Thus, the former is able to constrain the whole space but lacks the ability to model semantic correlations. The latter addresses...

chapter

Matrix Tri-Factorization with Manifold Regularizations for Zero-Shot Learning

Xing Xu, Fumin Shen, Yang Yang, Dongxiang Zhang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2007 - 2016

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Zero-shot learning (ZSL) aims to recognize objects of unseen classes with available training data from another set of seen classes. Existing solutions are focused on exploring knowledge transfer via an intermediate semantic embedding (e.g., attributes) shared between seen and unseen classes. In this paper, we propose a novel projection framework based on matrix tri-factorization with manifold regularizations...

chapter

Learning Spatial Regularization with Image-Level Supervisions for Multi-label Image Classification

Feng Zhu, Hongsheng Li, Wanli Ouyang, Nenghai Yu, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2027 - 2036

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Multi-label image classification is a fundamental but challenging task in computer vision. Great progress has been achieved by exploiting semantic relations between labels in recent years. However, conventional approaches are unable to model the underlying spatial relations between labels in multi-label images, because spatial annotations of the labels are generally not provided. In this paper, we...

chapter

Learning Object Interactions and Descriptions for Semantic Image Segmentation

Guangrun Wang, Ping Luo, Liang Lin, Xiaogang Wang

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5235 - 5243

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Recent advanced deep convolutional networks (CNNs) achieved great successes in many computer vision tasks, because of their compelling learning complexity and the presences of large-scale labeled data. However, as obtaining per-pixel annotations is expensive, performances of CNNs in semantic image segmentation are not fully exploited. This work significantly increases segmentation accuracy of CNNs...

chapter

Webly Supervised Semantic Segmentation

Bin Jin, Maria V. Ortiz Segovia, Sabine Susstrunk

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1705 - 1714

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a weakly supervised semantic segmentation algorithm that uses image tags for supervision. We apply the tags in queries to collect three sets of web images, which encode the clean foregrounds, the common backgrounds, and realistic scenes of the classes. We introduce a novel three-stage training pipeline to progressively learn semantic segmentation models. We first train and refine a class-specific...

chapter

Exploiting Saliency for Object Segmentation from Image Level Labels

Seong Joon Oh, Rodrigo Benenson, Anna Khoreva, Zeynep Akata, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5038 - 5047

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

There have been remarkable improvements in the semantic labelling task in the recent years. However, the state of the art methods rely on large-scale pixel-level annotations. This paper studies the problem of training a pixel-wise semantic labeller network from image-level annotations of the present object classes. Recently, it has been shown that high quality seeds indicating discriminative object...

chapter

Dilated Residual Networks

Fisher Yu, Vladlen Koltun, Thomas Funkhouser

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 636 - 644

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Convolutional networks for image classification progressively reduce resolution until the image is represented by tiny feature maps in which the spatial structure of the scene is no longer discernible. Such loss of spatial acuity can limit image classification accuracy and complicate the transfer of the model to downstream applications that require detailed scene understanding. These problems can...

chapter

Multi-scale Continuous CRFs as Sequential Deep Networks for Monocular Depth Estimation

Dan Xu, Elisa Ricci, Wanli Ouyang, Xiaogang Wang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 161 - 169

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper addresses the problem of depth estimation from a single still image. Inspired by recent works on multi-scale convolutional neural networks (CNN), we propose a deep model which fuses complementary information derived from multiple CNN side outputs. Different from previous methods, the integration is obtained by means of continuous Conditional Random Fields (CRFs). In particular, we propose...

chapter

Predicting Ground-Level Scene Layout from Aerial Imagery

Menghua Zhai, Zachary Bessinger, Scott Workman, Nathan Jacobs

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4132 - 4140

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We introduce a novel strategy for learning to extract semantically meaningful features from aerial imagery. Instead of manually labeling the aerial imagery, we propose to predict (noisy) semantic features automatically extracted from co-located ground imagery. Our network architecture takes an aerial image as input, extracts features using a convolutional neural network, and then applies an adaptive...

chapter

Mining Object Parts from CNNs via Active Question-Answering

Quanshi Zhang, Ruiming Cao, Ying Nian Wu, Song-Chun Zhu

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3890 - 3899

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Given a convolutional neural network (CNN) that is pre-trained for object classification, this paper proposes to use active question-answering to semanticize neural patterns in conv-layers of the CNN and mine part concepts. For each part concept, we mine neural patterns in the pre-trained CNN, which are related to the target part, and use these patterns to construct an And-Or graph (AOG) to represent...

chapter

Object Co-skeletonization with Co-segmentation

Koteswar Rao Jerripothula, Jianfei Cai, Jiangbo Lu, Junsong Yuan

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3881 - 3889

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Recent advances in the joint processing of images have certainly shown its advantages over the individual processing. Different from the existing works geared towards co-segmentation or co-localization, in this paper, we explore a new joint processing topic: co-skeletonization, which is defined as joint skeleton extraction of common objects in a set of semantically similar images. Object skeletonization...

chapter

Surface Motion Capture Transfer with Gaussian Process Regression

Adnane Boukhayma, Jean-Sebastien Franco, Edmond Boyer

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3558 - 3566

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We address the problem of transferring motion between captured 4D models. We particularly focus on human subjects for which the ability to automatically augment 4D datasets, by propagating movements between subjects, is of interest in a great deal of recent vision applications that builds on human visual corpus. Given 4D training sets for two subjects for which a sparse set of corresponding keyposes...

chapter

A Joint Speaker-Listener-Reinforcer Model for Referring Expressions

Licheng Yu, Hao Tan, Mohit Bansal, Tamara L. Berg

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3521 - 3529

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Referring expressions are natural language constructions used to identify particular objects within a scene. In this paper, we propose a unified framework for the tasks of referring expression comprehension and generation. Our model is composed of three modules: speaker, listener, and reinforcer. The speaker generates referring expressions, the listener comprehends referring expressions, and the reinforcer...

chapter

Multi-label classification for images with missing labels

Jianghong Ma, Jicong Fan, Wei Wang

2017 IEEE 15th International Conference on Industrial Informatics (INDIN) > 1050 - 1055

2017 IEEE 15th International Conference on Industrial Informatics (INDIN)

Multi-label classification is a vital problem, as it has numerous applications in computer vision, such as automatic image annotation. The label set for each instance is always assumed to be in the original whole form. However, missing labels often occur because manual labelling is a time-consuming and label-intensive work in the case of large amount of data. The incompleteness of labels can certainly...

chapter

Skeleton Key: Image Captioning by Skeleton-Attribute Decomposition

Yufei Wang, Zhe Lin, Xiaohui Shen, Scott Cohen, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 7378 - 7387

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Recently, there has been a lot of interest in automatically generating descriptions for an image. Most existing language-model based approaches for this task learn to generate an image description word by word in its original word order. However, for humans, it is more natural to locate the objects and their relationships first, and then elaborate on each object, describing notable attributes. We...

1 ...
3
4
5
6
7
8
9

Keywords:
TRAINING
SEMANTICS

Publication date

Set your own date range

Content availability

Available (1,047)
None (7)

Keywords

FEATURE EXTRACTION (286)
VISUALIZATION (185)
IMAGE SEGMENTATION (127)
SUPPORT VECTOR MACHINES (119)
ACCURACY (111)
COMPUTATIONAL MODELING (97)
CONTEXT (97)
NEURAL NETWORKS (85)
VECTORS (77)
MACHINE LEARNING (76)
NATURAL LANGUAGE PROCESSING (69)
DATA MINING (67)
TESTING (65)
LABELING (63)
TRAINING DATA (61)
CLASSIFICATION ALGORITHMS (59)
CORRELATION (58)
IMAGE RETRIEVAL (58)
ONTOLOGIES (55)
DATA MODELS (54)
SYNTACTICS (54)
IMAGE COLOR ANALYSIS (52)
HIDDEN MARKOV MODELS (49)
DICTIONARIES (48)
DATABASES (47)
STANDARDS (47)
KERNEL (45)
PREDICTIVE MODELS (42)
MATHEMATICAL MODEL (41)
TAGGING (40)
LEARNING (ARTIFICIAL INTELLIGENCE) (39)
MEASUREMENT (39)
DETECTORS (35)
IMAGE CLASSIFICATION (35)
TEXT ANALYSIS (35)
DEEP LEARNING (34)
INTERNET (33)
INFORMATION RETRIEVAL (32)
TEXT CATEGORIZATION (30)
ALGORITHM DESIGN AND ANALYSIS (29)
SENTIMENT ANALYSIS (29)
ENCODING (28)
OPTIMIZATION (28)
ADAPTATION MODELS (27)
COMPUTER ARCHITECTURE (27)
COMPUTER VISION (27)
SPEECH (27)
SUPPORT VECTOR MACHINE CLASSIFICATION (27)
CONVOLUTION (26)
OBJECT DETECTION (26)
PROBABILISTIC LOGIC (25)
VOCABULARY (25)
CONFERENCES (24)
CONTEXT MODELING (24)
HISTOGRAMS (24)
IMAGE ANNOTATION (24)
MULTIMEDIA COMMUNICATION (23)
PROBABILITY (23)
BENCHMARK TESTING (22)
DECODING (22)
EDUCATIONAL INSTITUTIONS (22)
HUMANS (22)
BUILDINGS (21)
IMAGE RECOGNITION (21)
SHAPE (21)
COMPUTERS (20)
ENTROPY (20)
KNOWLEDGE BASED SYSTEMS (20)
NEURONS (20)
PATTERN CLASSIFICATION (20)
THREE-DIMENSIONAL DISPLAYS (20)
CLUSTERING ALGORITHMS (19)
ESTIMATION (19)
PROPOSALS (19)
ROBOTS (19)
TEXT CLASSIFICATION (19)
ANALYTICAL MODELS (18)
CAMERAS (18)
CONTENT-BASED RETRIEVAL (18)
IMAGE EDGE DETECTION (18)
NOISE MEASUREMENT (18)
SPEECH RECOGNITION (18)
INDEXES (17)
MOTION PICTURES (17)
PRAGMATICS (17)
ARTIFICIAL NEURAL NETWORKS (16)
IMAGE RECONSTRUCTION (16)
LINEAR PROGRAMMING (16)
NATURAL LANGUAGES (16)
ROBUSTNESS (16)
SUPERVISED LEARNING (16)
TWITTER (16)
GOLD (15)
MANUALS (15)
MEDIA (15)
PRINCIPAL COMPONENT ANALYSIS (15)
ROADS (15)
SEARCH ENGINES (15)
more

INFONA - science communication portal

Search results

CityPersons: A Diverse Dataset for Pedestrian Detection

Semantic Autoencoder for Zero-Shot Learning

Learning to Extract Semantic Structure from Documents Using Multimodal Fully Convolutional Neural Networks

Semantic Image Inpainting with Deep Generative Models

Object Region Mining with Adversarial Erasing: A Simple Classification to Semantic Segmentation Approach

Semantically Consistent Regularization for Zero-Shot Recognition

Matrix Tri-Factorization with Manifold Regularizations for Zero-Shot Learning

Learning Spatial Regularization with Image-Level Supervisions for Multi-label Image Classification

Learning Object Interactions and Descriptions for Semantic Image Segmentation

Webly Supervised Semantic Segmentation

Exploiting Saliency for Object Segmentation from Image Level Labels

Dilated Residual Networks

Multi-scale Continuous CRFs as Sequential Deep Networks for Monocular Depth Estimation

Predicting Ground-Level Scene Layout from Aerial Imagery

Mining Object Parts from CNNs via Active Question-Answering

Object Co-skeletonization with Co-segmentation

Surface Motion Capture Transfer with Gaussian Process Regression

A Joint Speaker-Listener-Reinforcer Model for Referring Expressions

Multi-label classification for images with missing labels

Skeleton Key: Image Captioning by Skeleton-Attribute Decomposition

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options