Search results

Items from 121 to 140 out of 1,054 results

1 ...
4
5
6
7
8
9
10

chapter

Learning Detection with Diverse Proposals

Samaneh Azadi, Jiashi Feng, Trevor Darrell

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 7369 - 7377

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

To predict a set of diverse and informative proposals with enriched representations, this paper introduces a differentiable Determinantal Point Process (DPP) layer that is able to augment the object detection architectures. Most modern object detection architectures, such as Faster R-CNN, learn to localize objects by minimizing deviations from the ground truth, but ignore correlation between multiple...

chapter

Network Dissection: Quantifying Interpretability of Deep Visual Representations

David Bau, Bolei Zhou, Aditya Khosla, Aude Oliva, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3319 - 3327

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a general framework called Network Dissection for quantifying the interpretability of latent representations of CNNs by evaluating the alignment between individual hidden units and a set of semantic concepts. Given any CNN model, the proposed method draws on a data set of concepts to score the semantics of hidden units at each intermediate convolutional layer. The units with semantics are...

chapter

Learning Multifunctional Binary Codes for Both Category and Attribute Oriented Retrieval Tasks

Haomiao Liu, Ruiping Wang, Shiguang Shan, Xilin Chen

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6259 - 6268

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper we propose a unified framework to address multiple realistic image retrieval tasks concerning both category and attributes. Considering the scale of modern datasets, hashing is favorable for its low complexity. However, most existing hashing methods are designed to preserve one single kind of similarity, thus incapable of dealing with the different tasks simultaneously. To overcome this...

chapter

Not All Pixels Are Equal: Difficulty-Aware Semantic Segmentation via Deep Layer Cascade

Xiaoxiao Li, Ziwei Liu, Ping Luo, Chen Change Loy, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6459 - 6468

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a novel deep layer cascade (LC) method to improve the accuracy and speed of semantic segmentation. Unlike the conventional model cascade (MC) that is composed of multiple independent models, LC treats a single deep model as a cascade of several sub-models. Earlier sub-models are trained to handle easy and confident regions, and they progressively feed-forward harder regions to the next...

chapter

Convolutional Random Walk Networks for Semantic Image Segmentation

Gedas Bertasius, Lorenzo Torresani, Stella X. Yu, Jianbo Shi

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6137 - 6145

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Most current semantic segmentation methods rely on fully convolutional networks (FCNs). However, their use of large receptive fields and many pooling layers cause low spatial resolution inside the deep layers. This leads to predictions with poor localization around the boundaries. Prior work has attempted to address this issue by post-processing predictions with CRFs or MRFs. But such models often...

chapter

From Zero-Shot Learning to Conventional Supervised Classification: Unseen Visual Data Synthesis

Yang Long, Li Liu, Ling Shao, Fumin Shen, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6165 - 6174

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Robust object recognition systems usually rely on powerful feature extraction mechanisms from a large number of real images. However, in many realistic applications, collecting sufficient images for ever-growing new classes is unattainable. In this paper, we propose a new Zero-shot learning (ZSL) framework that can synthesise visual features for unseen classes without acquiring real images. Using...

chapter

Learning a Deep Embedding Model for Zero-Shot Learning

Li Zhang, Tao Xiang, Shaogang Gong

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3010 - 3019

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Zero-shot learning (ZSL) models rely on learning a joint embedding space where both textual/semantic description of object classes and visual representation of object images can be projected to for nearest neighbour search. Despite the success of deep neural networks that learn an end-to-end model between text and images in other vision problems such as image captioning, very few deep ZSL model exists...

chapter

Low-Rank Embedded Ensemble Semantic Dictionary for Zero-Shot Learning

Zhengming Ding, Ming Shao, Yun Fu

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6005 - 6013

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Zero-shot learning for visual recognition has received much interest in the most recent years. However, the semantic gap across visual features and their underlying semantics is still the biggest obstacle in zero-shot learning. To fight off this hurdle, we propose an effective Low-rank Embedded Semantic Dictionary learning (LESD) through ensemble strategy. Specifically, we formulate a novel framework...

chapter

Generative Face Completion

Yijun Li, Sifei Liu, Jimei Yang, Ming-Hsuan Yang

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5892 - 5900

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper, we propose an effective face completion algorithm using a deep generative model. Different from well-studied background completion, the face completion task is more challenging as it often requires to generate semantically new pixels for the missing key components (e.g., eyes and mouths) that contain large appearance variations. Unlike existing nonparametric algorithms that search for...

chapter

Asynchronous Temporal Fields for Action Recognition

Gunnar A. Sigurdsson, Santosh Divvala, Ali Farhadi, Abhinav Gupta

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5650 - 5659

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Actions are more than just movements and trajectories: we cook to eat and we hold a cup to drink from it. A thorough understanding of videos requires going beyond appearance modeling and necessitates reasoning about the sequence of activities, as well as the higher-level constructs such as intentions. But how do we model and reason about these? We propose a fully-connected temporal CRF model for reasoning...

chapter

Simple Does It: Weakly Supervised Instance and Semantic Segmentation

Anna Khoreva, Rodrigo Benenson, Jan Hosang, Matthias Hein, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1665 - 1674

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Semantic labelling and instance segmentation are two tasks that require particularly costly annotations. Starting from weak supervision in the form of bounding box detection annotations, we propose a new approach that does not require modification of the segmentation training procedure. We show that when carefully designing the input labels from given bounding boxes, even a single round of training...

chapter

Captioning Images with Diverse Objects

Subhashini Venugopalan, Lisa Anne Hendricks, Marcus Rohrbach, Raymond Mooney, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1170 - 1178

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Recent captioning models are limited in their ability to scale and describe concepts unseen in paired image-text corpora. We propose the Novel Object Captioner (NOC), a deep visual semantic captioning model that can describe a large number of object categories not present in existing image-caption datasets. Our model takes advantage of external sources – labeled images from object recognition...

chapter

UberNet: Training a Universal Convolutional Neural Network for Low-, Mid-, and High-Level Vision Using Diverse Datasets and Limited Memory

Iasonas Kokkinos

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5454 - 5463

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this work we train in an end-to-end manner a convolutional neural network (CNN) that jointly handles low-, mid-, and high-level vision tasks in a unified architecture. Such a network can act like a swiss knife for vision tasks, we call it an UberNet to indicate its overarching nature. The main contribution of this work consists in handling challenges that emerge when scaling up to many tasks. We...

chapter

One-Shot Video Object Segmentation

S. Caelles, K. -K. Maninis, J. Pont-Tuset, L. Leal-Taixe, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5320 - 5329

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper tackles the task of semi-supervised video object segmentation, i.e., the separation of an object from the background in a video, given the mask of the first frame. We present One-Shot Video Object Segmentation (OSVOS), based on a fully-convolutional neural network architecture that is able to successively transfer generic semantic information, learned on ImageNet, to the task of foreground...

chapter

Surveillance Video Parsing with Single Frame Supervision

Si Liu, Changhu Wang, Ruihe Qian, Han Yu, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1013 - 1021

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Surveillance video parsing, which segments the video frames into several labels, e.g., face, pants, left-leg, has wide applications [41, 8]. However, pixel-wisely annotating all frames is tedious and inefficient. In this paper, we develop a Single frame Video Parsing (SVP) method which requires only one labeled frame per video in training stage. To parse one particular frame, the video segment preceding...

chapter

Semantic Regularisation for Recurrent Image Annotation

Feng Liu, Tao Xiang, Timothy M. Hospedales, Wankou Yang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4160 - 4168

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The CNN-RNN design pattern is increasingly widely applied in a variety of image annotation tasks including multi-label classification and captioning. Existing models use the weakly semantic CNN hidden layer or its transform as the image embedding that provides the interface between the CNN and RNN. This leaves the RNN overstretched with two jobs: predicting the visual concepts and modelling their...

chapter

Learning to Detect Salient Objects with Image-Level Supervision

Lijun Wang, Huchuan Lu, Yifan Wang, Mengyang Feng, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3796 - 3805

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Deep Neural Networks (DNNs) have substantially improved the state-of-the-art in salient object detection. However, training DNNs requires costly pixel-level annotations. In this paper, we leverage the observation that image-level tags provide important cues of foreground salient objects, and develop a weakly supervised learning method for saliency detection using image-level tags only. The Foreground...

chapter

Adversarial neural networks for basal membrane segmentation of microinvasive cervix carcinoma in histopathology images

Du Wang, Chaochen Gu, Kaijie Wu, Xinping Guan

2017 International Conference on Machine Learning and Cybernetics (ICMLC) > 2 > 385 - 389

2017 International Conference on Machine Learning and Cybernetics (ICMLC)

In this paper, a deep learning based method is proposed to perform the basal membrane segmentation, which is a crucial prerequisite step in the diagnosis of Microinvasive Carcinoma of the Cervix (MIC). We regard this as an object contour segmentaion problem and modify the recently proposed contour segmentation deep learning model with adversarial training strategy. Traditional adversarial networks...

chapter

Combining Bottom-Up, Top-Down, and Smoothness Cues for Weakly Supervised Image Segmentation

Anirban Roy, Sinisa Todorovic

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 7282 - 7291

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper addresses the problem of weakly supervised semantic image segmentation. Our goal is to label every pixel in a new image, given only image-level object labels associated with training images. Our problem statement differs from common semantic segmentation, where pixel-wise annotations are typically assumed available in training. We specify a novel deep architecture which fuses three distinct...

chapter

STD2P: RGBD Semantic Segmentation Using Spatio-Temporal Data-Driven Pooling

Yang He, Wei-Chen Chiu, Margret Keuper, Mario Fritz

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 7158 - 7167

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a novel superpixel-based multi-view convolutional neural network for semantic image segmentation. The proposed network produces a high quality segmentation of a single image by leveraging information from additional views of the same scene. Particularly in indoor videos such as captured by robotic platforms or handheld and bodyworn RGBD cameras, nearby video frames provide diverse viewpoints...

1 ...
4
5
6
7
8
9
10

Keywords:
TRAINING
SEMANTICS

Publication date

Set your own date range

Content availability

Available (1,047)
None (7)

Keywords

FEATURE EXTRACTION (286)
VISUALIZATION (185)
IMAGE SEGMENTATION (127)
SUPPORT VECTOR MACHINES (119)
ACCURACY (111)
COMPUTATIONAL MODELING (97)
CONTEXT (97)
NEURAL NETWORKS (85)
VECTORS (77)
MACHINE LEARNING (76)
NATURAL LANGUAGE PROCESSING (69)
DATA MINING (67)
TESTING (65)
LABELING (63)
TRAINING DATA (61)
CLASSIFICATION ALGORITHMS (59)
CORRELATION (58)
IMAGE RETRIEVAL (58)
ONTOLOGIES (55)
DATA MODELS (54)
SYNTACTICS (54)
IMAGE COLOR ANALYSIS (52)
HIDDEN MARKOV MODELS (49)
DICTIONARIES (48)
DATABASES (47)
STANDARDS (47)
KERNEL (45)
PREDICTIVE MODELS (42)
MATHEMATICAL MODEL (41)
TAGGING (40)
LEARNING (ARTIFICIAL INTELLIGENCE) (39)
MEASUREMENT (39)
DETECTORS (35)
IMAGE CLASSIFICATION (35)
TEXT ANALYSIS (35)
DEEP LEARNING (34)
INTERNET (33)
INFORMATION RETRIEVAL (32)
TEXT CATEGORIZATION (30)
ALGORITHM DESIGN AND ANALYSIS (29)
SENTIMENT ANALYSIS (29)
ENCODING (28)
OPTIMIZATION (28)
ADAPTATION MODELS (27)
COMPUTER ARCHITECTURE (27)
COMPUTER VISION (27)
SPEECH (27)
SUPPORT VECTOR MACHINE CLASSIFICATION (27)
CONVOLUTION (26)
OBJECT DETECTION (26)
PROBABILISTIC LOGIC (25)
VOCABULARY (25)
CONFERENCES (24)
CONTEXT MODELING (24)
HISTOGRAMS (24)
IMAGE ANNOTATION (24)
MULTIMEDIA COMMUNICATION (23)
PROBABILITY (23)
BENCHMARK TESTING (22)
DECODING (22)
EDUCATIONAL INSTITUTIONS (22)
HUMANS (22)
BUILDINGS (21)
IMAGE RECOGNITION (21)
SHAPE (21)
COMPUTERS (20)
ENTROPY (20)
KNOWLEDGE BASED SYSTEMS (20)
NEURONS (20)
PATTERN CLASSIFICATION (20)
THREE-DIMENSIONAL DISPLAYS (20)
CLUSTERING ALGORITHMS (19)
ESTIMATION (19)
PROPOSALS (19)
ROBOTS (19)
TEXT CLASSIFICATION (19)
ANALYTICAL MODELS (18)
CAMERAS (18)
CONTENT-BASED RETRIEVAL (18)
IMAGE EDGE DETECTION (18)
NOISE MEASUREMENT (18)
SPEECH RECOGNITION (18)
INDEXES (17)
MOTION PICTURES (17)
PRAGMATICS (17)
ARTIFICIAL NEURAL NETWORKS (16)
IMAGE RECONSTRUCTION (16)
LINEAR PROGRAMMING (16)
NATURAL LANGUAGES (16)
ROBUSTNESS (16)
SUPERVISED LEARNING (16)
TWITTER (16)
GOLD (15)
MANUALS (15)
MEDIA (15)
PRINCIPAL COMPONENT ANALYSIS (15)
ROADS (15)
SEARCH ENGINES (15)
more

INFONA - science communication portal

Search results

Learning Detection with Diverse Proposals

Network Dissection: Quantifying Interpretability of Deep Visual Representations

Learning Multifunctional Binary Codes for Both Category and Attribute Oriented Retrieval Tasks

Not All Pixels Are Equal: Difficulty-Aware Semantic Segmentation via Deep Layer Cascade

Convolutional Random Walk Networks for Semantic Image Segmentation

From Zero-Shot Learning to Conventional Supervised Classification: Unseen Visual Data Synthesis

Learning a Deep Embedding Model for Zero-Shot Learning

Low-Rank Embedded Ensemble Semantic Dictionary for Zero-Shot Learning

Generative Face Completion

Asynchronous Temporal Fields for Action Recognition

Simple Does It: Weakly Supervised Instance and Semantic Segmentation

Captioning Images with Diverse Objects

UberNet: Training a Universal Convolutional Neural Network for Low-, Mid-, and High-Level Vision Using Diverse Datasets and Limited Memory

One-Shot Video Object Segmentation

Surveillance Video Parsing with Single Frame Supervision

Semantic Regularisation for Recurrent Image Annotation

Learning to Detect Salient Objects with Image-Level Supervision

Adversarial neural networks for basal membrane segmentation of microinvasive cervix carcinoma in histopathology images

Combining Bottom-Up, Top-Down, and Smoothness Cues for Weakly Supervised Image Segmentation

STD2P: RGBD Semantic Segmentation Using Spatio-Temporal Data-Driven Pooling

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options