Search results

Items from 141 to 160 out of 1,266 results

1 ...
5
6
7
8
9
10
11

chapter

Semantically Consistent Regularization for Zero-Shot Recognition

Pedro Morgado, Nuno Vasconcelos

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2037 - 2046

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The role of semantics in zero-shot learning is considered. The effectiveness of previous approaches is analyzed according to the form of supervision provided. While some learn semantics independently, others only supervise the semantic subspace explained by training classes. Thus, the former is able to constrain the whole space but lacks the ability to model semantic correlations. The latter addresses...

chapter

Matrix Tri-Factorization with Manifold Regularizations for Zero-Shot Learning

Xing Xu, Fumin Shen, Yang Yang, Dongxiang Zhang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2007 - 2016

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Zero-shot learning (ZSL) aims to recognize objects of unseen classes with available training data from another set of seen classes. Existing solutions are focused on exploring knowledge transfer via an intermediate semantic embedding (e.g., attributes) shared between seen and unseen classes. In this paper, we propose a novel projection framework based on matrix tri-factorization with manifold regularizations...

chapter

Learning Spatial Regularization with Image-Level Supervisions for Multi-label Image Classification

Feng Zhu, Hongsheng Li, Wanli Ouyang, Nenghai Yu, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2027 - 2036

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Multi-label image classification is a fundamental but challenging task in computer vision. Great progress has been achieved by exploiting semantic relations between labels in recent years. However, conventional approaches are unable to model the underlying spatial relations between labels in multi-label images, because spatial annotations of the labels are generally not provided. In this paper, we...

chapter

Learning Object Interactions and Descriptions for Semantic Image Segmentation

Guangrun Wang, Ping Luo, Liang Lin, Xiaogang Wang

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5235 - 5243

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Recent advanced deep convolutional networks (CNNs) achieved great successes in many computer vision tasks, because of their compelling learning complexity and the presences of large-scale labeled data. However, as obtaining per-pixel annotations is expensive, performances of CNNs in semantic image segmentation are not fully exploited. This work significantly increases segmentation accuracy of CNNs...

chapter

Webly Supervised Semantic Segmentation

Bin Jin, Maria V. Ortiz Segovia, Sabine Susstrunk

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1705 - 1714

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a weakly supervised semantic segmentation algorithm that uses image tags for supervision. We apply the tags in queries to collect three sets of web images, which encode the clean foregrounds, the common backgrounds, and realistic scenes of the classes. We introduce a novel three-stage training pipeline to progressively learn semantic segmentation models. We first train and refine a class-specific...

chapter

Exploiting Saliency for Object Segmentation from Image Level Labels

Seong Joon Oh, Rodrigo Benenson, Anna Khoreva, Zeynep Akata, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5038 - 5047

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

There have been remarkable improvements in the semantic labelling task in the recent years. However, the state of the art methods rely on large-scale pixel-level annotations. This paper studies the problem of training a pixel-wise semantic labeller network from image-level annotations of the present object classes. Recently, it has been shown that high quality seeds indicating discriminative object...

chapter

Dilated Residual Networks

Fisher Yu, Vladlen Koltun, Thomas Funkhouser

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 636 - 644

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Convolutional networks for image classification progressively reduce resolution until the image is represented by tiny feature maps in which the spatial structure of the scene is no longer discernible. Such loss of spatial acuity can limit image classification accuracy and complicate the transfer of the model to downstream applications that require detailed scene understanding. These problems can...

chapter

Multi-scale Continuous CRFs as Sequential Deep Networks for Monocular Depth Estimation

Dan Xu, Elisa Ricci, Wanli Ouyang, Xiaogang Wang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 161 - 169

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper addresses the problem of depth estimation from a single still image. Inspired by recent works on multi-scale convolutional neural networks (CNN), we propose a deep model which fuses complementary information derived from multiple CNN side outputs. Different from previous methods, the integration is obtained by means of continuous Conditional Random Fields (CRFs). In particular, we propose...

chapter

Predicting Ground-Level Scene Layout from Aerial Imagery

Menghua Zhai, Zachary Bessinger, Scott Workman, Nathan Jacobs

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4132 - 4140

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We introduce a novel strategy for learning to extract semantically meaningful features from aerial imagery. Instead of manually labeling the aerial imagery, we propose to predict (noisy) semantic features automatically extracted from co-located ground imagery. Our network architecture takes an aerial image as input, extracts features using a convolutional neural network, and then applies an adaptive...

chapter

Mining Object Parts from CNNs via Active Question-Answering

Quanshi Zhang, Ruiming Cao, Ying Nian Wu, Song-Chun Zhu

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3890 - 3899

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Given a convolutional neural network (CNN) that is pre-trained for object classification, this paper proposes to use active question-answering to semanticize neural patterns in conv-layers of the CNN and mine part concepts. For each part concept, we mine neural patterns in the pre-trained CNN, which are related to the target part, and use these patterns to construct an And-Or graph (AOG) to represent...

chapter

Object Co-skeletonization with Co-segmentation

Koteswar Rao Jerripothula, Jianfei Cai, Jiangbo Lu, Junsong Yuan

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3881 - 3889

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Recent advances in the joint processing of images have certainly shown its advantages over the individual processing. Different from the existing works geared towards co-segmentation or co-localization, in this paper, we explore a new joint processing topic: co-skeletonization, which is defined as joint skeleton extraction of common objects in a set of semantically similar images. Object skeletonization...

chapter

Surface Motion Capture Transfer with Gaussian Process Regression

Adnane Boukhayma, Jean-Sebastien Franco, Edmond Boyer

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3558 - 3566

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We address the problem of transferring motion between captured 4D models. We particularly focus on human subjects for which the ability to automatically augment 4D datasets, by propagating movements between subjects, is of interest in a great deal of recent vision applications that builds on human visual corpus. Given 4D training sets for two subjects for which a sparse set of corresponding keyposes...

chapter

A Joint Speaker-Listener-Reinforcer Model for Referring Expressions

Licheng Yu, Hao Tan, Mohit Bansal, Tamara L. Berg

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3521 - 3529

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Referring expressions are natural language constructions used to identify particular objects within a scene. In this paper, we propose a unified framework for the tasks of referring expression comprehension and generation. Our model is composed of three modules: speaker, listener, and reinforcer. The speaker generates referring expressions, the listener comprehends referring expressions, and the reinforcer...

chapter

Multi-label classification for images with missing labels

Jianghong Ma, Jicong Fan, Wei Wang

2017 IEEE 15th International Conference on Industrial Informatics (INDIN) > 1050 - 1055

2017 IEEE 15th International Conference on Industrial Informatics (INDIN)

Multi-label classification is a vital problem, as it has numerous applications in computer vision, such as automatic image annotation. The label set for each instance is always assumed to be in the original whole form. However, missing labels often occur because manual labelling is a time-consuming and label-intensive work in the case of large amount of data. The incompleteness of labels can certainly...

chapter

Skeleton Key: Image Captioning by Skeleton-Attribute Decomposition

Yufei Wang, Zhe Lin, Xiaohui Shen, Scott Cohen, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 7378 - 7387

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Recently, there has been a lot of interest in automatically generating descriptions for an image. Most existing language-model based approaches for this task learn to generate an image description word by word in its original word order. However, for humans, it is more natural to locate the objects and their relationships first, and then elaborate on each object, describing notable attributes. We...

chapter

Learning Detection with Diverse Proposals

Samaneh Azadi, Jiashi Feng, Trevor Darrell

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 7369 - 7377

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

To predict a set of diverse and informative proposals with enriched representations, this paper introduces a differentiable Determinantal Point Process (DPP) layer that is able to augment the object detection architectures. Most modern object detection architectures, such as Faster R-CNN, learn to localize objects by minimizing deviations from the ground truth, but ignore correlation between multiple...

chapter

Network Dissection: Quantifying Interpretability of Deep Visual Representations

David Bau, Bolei Zhou, Aditya Khosla, Aude Oliva, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3319 - 3327

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a general framework called Network Dissection for quantifying the interpretability of latent representations of CNNs by evaluating the alignment between individual hidden units and a set of semantic concepts. Given any CNN model, the proposed method draws on a data set of concepts to score the semantics of hidden units at each intermediate convolutional layer. The units with semantics are...

chapter

Learning Multifunctional Binary Codes for Both Category and Attribute Oriented Retrieval Tasks

Haomiao Liu, Ruiping Wang, Shiguang Shan, Xilin Chen

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6259 - 6268

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper we propose a unified framework to address multiple realistic image retrieval tasks concerning both category and attributes. Considering the scale of modern datasets, hashing is favorable for its low complexity. However, most existing hashing methods are designed to preserve one single kind of similarity, thus incapable of dealing with the different tasks simultaneously. To overcome this...

chapter

Not All Pixels Are Equal: Difficulty-Aware Semantic Segmentation via Deep Layer Cascade

Xiaoxiao Li, Ziwei Liu, Ping Luo, Chen Change Loy, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6459 - 6468

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a novel deep layer cascade (LC) method to improve the accuracy and speed of semantic segmentation. Unlike the conventional model cascade (MC) that is composed of multiple independent models, LC treats a single deep model as a cascade of several sub-models. Earlier sub-models are trained to handle easy and confident regions, and they progressively feed-forward harder regions to the next...

chapter

Convolutional Random Walk Networks for Semantic Image Segmentation

Gedas Bertasius, Lorenzo Torresani, Stella X. Yu, Jianbo Shi

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6137 - 6145

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Most current semantic segmentation methods rely on fully convolutional networks (FCNs). However, their use of large receptive fields and many pooling layers cause low spatial resolution inside the deep layers. This leads to predictions with poor localization around the boundaries. Prior work has attempted to address this issue by post-processing predictions with CRFs or MRFs. But such models often...

1 ...
5
6
7
8
9
10
11

Data set:
ieee
Keywords:
TRAINING
SEMANTICS

Publication date

Set your own date range

Content availability

Available (1,259)
None (7)

Publication type

book (1,054)
article (212)

Keywords

FEATURE EXTRACTION (353)
VISUALIZATION (248)
IMAGE SEGMENTATION (151)
SUPPORT VECTOR MACHINES (137)
CONTEXT (125)
ACCURACY (117)
COMPUTATIONAL MODELING (115)
NEURAL NETWORKS (106)
VECTORS (102)
MACHINE LEARNING (90)
CORRELATION (89)
NATURAL LANGUAGE PROCESSING (80)
LABELING (78)
DATA MINING (76)
TRAINING DATA (73)
TESTING (71)
IMAGE RETRIEVAL (69)
SYNTACTICS (64)
IMAGE COLOR ANALYSIS (62)
KERNEL (62)
DATA MODELS (61)
HIDDEN MARKOV MODELS (60)
ONTOLOGIES (60)
CLASSIFICATION ALGORITHMS (59)
DICTIONARIES (58)
STANDARDS (54)
DATABASES (52)
PREDICTIVE MODELS (50)
IMAGE CLASSIFICATION (46)
DETECTORS (45)
DEEP LEARNING (44)
MATHEMATICAL MODEL (44)
TAGGING (44)
MEASUREMENT (43)
LEARNING (ARTIFICIAL INTELLIGENCE) (42)
OPTIMIZATION (39)
INTERNET (37)
TEXT ANALYSIS (37)
COMPUTER VISION (36)
CONTEXT MODELING (36)
ALGORITHM DESIGN AND ANALYSIS (35)
PROBABILISTIC LOGIC (34)
ADAPTATION MODELS (33)
COMPUTER ARCHITECTURE (33)
MULTIMEDIA COMMUNICATION (33)
SPEECH (33)
EDUCATIONAL INSTITUTIONS (32)
IMAGE ANNOTATION (32)
INFORMATION RETRIEVAL (32)
ENCODING (31)
OBJECT DETECTION (31)
VOCABULARY (31)
TEXT CATEGORIZATION (30)
CONVOLUTION (29)
HISTOGRAMS (29)
SENTIMENT ANALYSIS (29)
REMOTE SENSING (27)
SUPPORT VECTOR MACHINE CLASSIFICATION (27)
IMAGE EDGE DETECTION (26)
SPEECH RECOGNITION (26)
BUILDINGS (25)
IMAGE RECOGNITION (25)
THREE-DIMENSIONAL DISPLAYS (25)
CONFERENCES (24)
DECODING (24)
ESTIMATION (24)
ROBUSTNESS (24)
BENCHMARK TESTING (23)
HUMANS (23)
NEURONS (23)
NOISE MEASUREMENT (23)
PRAGMATICS (23)
PROBABILITY (23)
CLUSTERING ALGORITHMS (22)
KNOWLEDGE BASED SYSTEMS (22)
PATTERN CLASSIFICATION (22)
SHAPE (22)
SUPERVISED LEARNING (22)
ANALYTICAL MODELS (21)
CAMERAS (21)
MATRIX DECOMPOSITION (21)
PROPOSALS (21)
ROBOTS (21)
STREAMING MEDIA (21)
VIDEOS (21)
COMPUTERS (20)
ENTROPY (20)
INDEXES (20)
LINEAR PROGRAMMING (20)
ROADS (20)
ARTIFICIAL NEURAL NETWORKS (19)
IMAGE RECONSTRUCTION (19)
TEXT CLASSIFICATION (19)
CONTENT-BASED RETRIEVAL (18)
LEARNING SYSTEMS (18)
MEDIA (18)
SEARCH ENGINES (18)
ELECTRONIC MAIL (17)
more

INFONA - science communication portal

Search results

Semantically Consistent Regularization for Zero-Shot Recognition

Matrix Tri-Factorization with Manifold Regularizations for Zero-Shot Learning

Learning Spatial Regularization with Image-Level Supervisions for Multi-label Image Classification

Learning Object Interactions and Descriptions for Semantic Image Segmentation

Webly Supervised Semantic Segmentation

Exploiting Saliency for Object Segmentation from Image Level Labels

Dilated Residual Networks

Multi-scale Continuous CRFs as Sequential Deep Networks for Monocular Depth Estimation

Predicting Ground-Level Scene Layout from Aerial Imagery

Mining Object Parts from CNNs via Active Question-Answering

Object Co-skeletonization with Co-segmentation

Surface Motion Capture Transfer with Gaussian Process Regression

A Joint Speaker-Listener-Reinforcer Model for Referring Expressions

Multi-label classification for images with missing labels

Skeleton Key: Image Captioning by Skeleton-Attribute Decomposition

Learning Detection with Diverse Proposals

Network Dissection: Quantifying Interpretability of Deep Visual Representations

Learning Multifunctional Binary Codes for Both Category and Attribute Oriented Retrieval Tasks

Not All Pixels Are Equal: Difficulty-Aware Semantic Segmentation via Deep Layer Cascade

Convolutional Random Walk Networks for Semantic Image Segmentation

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options