Advanced search

Advanced search in people

From:

To:

Items from 101 to 120 out of 2,015 results

1 ...
3
4
5
6
7
8
9

chapter

CityPersons: A Diverse Dataset for Pedestrian Detection

Shanshan Zhang, Rodrigo Benenson, Bernt Schiele

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4457 - 4465

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Convnets have enabled significant progress in pedestrian detection recently, but there are still open questions regarding suitable architectures and training data. We revisit CNN design and point out key adaptations, enabling plain FasterRCNN to obtain state-of-the-art results on the Caltech dataset. To achieve further improvement from more and better data, we introduce CityPersons, a new set of person...

chapter

Learning to Extract Semantic Structure from Documents Using Multimodal Fully Convolutional Neural Networks

Xiao Yang, Ersin Yumer, Paul Asente, Mike Kraley, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4342 - 4351

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present an end-to-end, multimodal, fully convolutional network for extracting semantic structures from document images. We consider document semantic structure extraction as a pixel-wise segmentation task, and propose a unified model that classifies pixels based not only on their visual appearance, as in the traditional page segmentation task, but also on the content of underlying text. Moreover,...

chapter

Learning Video Object Segmentation from Static Images

Federico Perazzi, Anna Khoreva, Rodrigo Benenson, Bernt Schiele, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3491 - 3500

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Inspired by recent advances of deep learning in instance segmentation and object tracking, we introduce the concept of convnet-based guidance applied to video object segmentation. Our model proceeds on a per-frame basis, guided by the output of the previous frame towards the object of interest in the next frame. We demonstrate that highly accurate object segmentation in videos can be enabled by using...

chapter

Object Region Mining with Adversarial Erasing: A Simple Classification to Semantic Segmentation Approach

Yunchao Wei, Jiashi Feng, Xiaodan Liang, Ming-Ming Cheng, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6488 - 6496

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We investigate a principle way to progressively mine discriminative object regions using classification networks to address the weakly-supervised semantic segmentation problems. Classification networks are only responsive to small and sparse discriminative regions from the object of interest, which deviates from the requirement of the segmentation task that needs to localize dense, interior and integral...

chapter

ShapeOdds: Variational Bayesian Learning of Generative Shape Models

Shireen Elhabian, Ross Whitaker

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2185 - 2196

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Shape models provide a compact parameterization of a class of shapes, and have been shown to be important to a variety of vision problems, including object detection, tracking, and image segmentation. Learning generative shape models from grid-structured representations, aka silhouettes, is usually hindered by (1) data likelihoods with intractable marginals and posteriors, (2) high-dimensional shape...

chapter

Improving RANSAC-Based Segmentation through CNN Encapsulation

Dustin Morley, Hassan Foroosh

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2661 - 2670

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this work, we present a method for improving a random sample consensus (RANSAC) based image segmentation algorithm by encapsulating it within a convolutional neural network (CNN). The improvements are gained by gradient descent training on the set of pre-RANSAC filtering and thresholding operations using a novel RANSAC-based loss function, which is geared toward optimizing the strength of the correct...

chapter

Learning Object Interactions and Descriptions for Semantic Image Segmentation

Guangrun Wang, Ping Luo, Liang Lin, Xiaogang Wang

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5235 - 5243

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Recent advanced deep convolutional networks (CNNs) achieved great successes in many computer vision tasks, because of their compelling learning complexity and the presences of large-scale labeled data. However, as obtaining per-pixel annotations is expensive, performances of CNNs in semantic image segmentation are not fully exploited. This work significantly increases segmentation accuracy of CNNs...

chapter

SPFTN: A Self-Paced Fine-Tuning Network for Segmenting Objects in Weakly Labelled Videos

Dingwen Zhang, Le Yang, Deyu Meng, Dong Xu, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5340 - 5348

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Object segmentation in weakly labelled videos is an interesting yet challenging task, which aims at learning to perform category-specific video object segmentation by only using video-level tags. Existing works in this research area might still have some limitations, e.g., lack of effective DNN-based learning frameworks, under-exploring the context information, and requiring to leverage the unstable...

chapter

Webly Supervised Semantic Segmentation

Bin Jin, Maria V. Ortiz Segovia, Sabine Susstrunk

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1705 - 1714

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a weakly supervised semantic segmentation algorithm that uses image tags for supervision. We apply the tags in queries to collect three sets of web images, which encode the clean foregrounds, the common backgrounds, and realistic scenes of the classes. We introduce a novel three-stage training pipeline to progressively learn semantic segmentation models. We first train and refine a class-specific...

chapter

Exploiting Saliency for Object Segmentation from Image Level Labels

Seong Joon Oh, Rodrigo Benenson, Anna Khoreva, Zeynep Akata, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5038 - 5047

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

There have been remarkable improvements in the semantic labelling task in the recent years. However, the state of the art methods rely on large-scale pixel-level annotations. This paper studies the problem of training a pixel-wise semantic labeller network from image-level annotations of the present object classes. Recently, it has been shown that high quality seeds indicating discriminative object...

chapter

Dilated Residual Networks

Fisher Yu, Vladlen Koltun, Thomas Funkhouser

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 636 - 644

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Convolutional networks for image classification progressively reduce resolution until the image is represented by tiny feature maps in which the spatial structure of the scene is no longer discernible. Such loss of spatial acuity can limit image classification accuracy and complicate the transfer of the model to downstream applications that require detailed scene understanding. These problems can...

chapter

Predicting Ground-Level Scene Layout from Aerial Imagery

Menghua Zhai, Zachary Bessinger, Scott Workman, Nathan Jacobs

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4132 - 4140

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We introduce a novel strategy for learning to extract semantically meaningful features from aerial imagery. Instead of manually labeling the aerial imagery, we propose to predict (noisy) semantic features automatically extracted from co-located ground imagery. Our network architecture takes an aerial image as input, extracts features using a convolutional neural network, and then applies an adaptive...

chapter

Object Co-skeletonization with Co-segmentation

Koteswar Rao Jerripothula, Jianfei Cai, Jiangbo Lu, Junsong Yuan

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3881 - 3889

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Recent advances in the joint processing of images have certainly shown its advantages over the individual processing. Different from the existing works geared towards co-segmentation or co-localization, in this paper, we explore a new joint processing topic: co-skeletonization, which is defined as joint skeleton extraction of common objects in a set of semantically similar images. Object skeletonization...

chapter

Network Dissection: Quantifying Interpretability of Deep Visual Representations

David Bau, Bolei Zhou, Aditya Khosla, Aude Oliva, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3319 - 3327

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a general framework called Network Dissection for quantifying the interpretability of latent representations of CNNs by evaluating the alignment between individual hidden units and a set of semantic concepts. Given any CNN model, the proposed method draws on a data set of concepts to score the semantics of hidden units at each intermediate convolutional layer. The units with semantics are...

chapter

Not All Pixels Are Equal: Difficulty-Aware Semantic Segmentation via Deep Layer Cascade

Xiaoxiao Li, Ziwei Liu, Ping Luo, Chen Change Loy, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6459 - 6468

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a novel deep layer cascade (LC) method to improve the accuracy and speed of semantic segmentation. Unlike the conventional model cascade (MC) that is composed of multiple independent models, LC treats a single deep model as a cascade of several sub-models. Earlier sub-models are trained to handle easy and confident regions, and they progressively feed-forward harder regions to the next...

chapter

Convolutional Random Walk Networks for Semantic Image Segmentation

Gedas Bertasius, Lorenzo Torresani, Stella X. Yu, Jianbo Shi

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6137 - 6145

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Most current semantic segmentation methods rely on fully convolutional networks (FCNs). However, their use of large receptive fields and many pooling layers cause low spatial resolution inside the deep layers. This leads to predictions with poor localization around the boundaries. Prior work has attempted to address this issue by post-processing predictions with CRFs or MRFs. But such models often...

chapter

MCMLSD: A Dynamic Programming Approach to Line Segment Detection

Emilio J. Almazan, Ron Tal, Yiming Qian, James H. Elder

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5854 - 5862

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Prior approaches to line segment detection typically involve perceptual grouping in the image domain or global accumulation in the Hough domain. Here we propose a probabilistic algorithm that merges the advantages of both approaches. In a first stage lines are detected using a global probabilistic Hough approach. In the second stage each detected line is analyzed in the image domain to localize the...

chapter

Simple Does It: Weakly Supervised Instance and Semantic Segmentation

Anna Khoreva, Rodrigo Benenson, Jan Hosang, Matthias Hein, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1665 - 1674

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Semantic labelling and instance segmentation are two tasks that require particularly costly annotations. Starting from weak supervision in the form of bounding box detection annotations, we propose a new approach that does not require modification of the segmentation training procedure. We show that when carefully designing the input labels from given bounding boxes, even a single round of training...

chapter

Hidden Layers in Perceptual Learning

Gad Cohen, Daphna Weinshall

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5349 - 5357

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Studies in visual perceptual learning investigate the way human performance improves with practice, in the context of relatively simple (and therefore more manageable) visual tasks. Building on the powerful tools currently available for the training of Convolution Neural Networks (CNN), networks whose original architecture was inspired by the visual system, we revisited some of the open computational...

chapter

One-Shot Video Object Segmentation

S. Caelles, K. -K. Maninis, J. Pont-Tuset, L. Leal-Taixe, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5320 - 5329

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper tackles the task of semi-supervised video object segmentation, i.e., the separation of an object from the background in a video, given the mask of the first frame. We present One-Shot Video Object Segmentation (OSVOS), based on a fully-convolutional neural network architecture that is able to successively transfer generic semantic information, learned on ImageNet, to the task of foreground...

1 ...
3
4
5
6
7
8
9

Keywords:
TRAINING
IMAGE SEGMENTATION

Publication date

Set your own date range

Content availability

Available (2,002)
None (13)

Publication type

book (1,742)
article (273)

Keywords

FEATURE EXTRACTION (754)
SHAPE (375)
IMAGE COLOR ANALYSIS (340)
PIXEL (282)
SUPPORT VECTOR MACHINES (278)
IMAGE CLASSIFICATION (255)
ACCURACY (229)
IMAGE EDGE DETECTION (171)
COMPUTATIONAL MODELING (159)
MEDICAL IMAGE PROCESSING (154)
SEMANTICS (151)
DATA MINING (147)
ARTIFICIAL NEURAL NETWORKS (143)
DATABASES (143)
SEGMENTATION (142)
VISUALIZATION (142)
CLASSIFICATION ALGORITHMS (139)
IMAGE RECOGNITION (138)
HISTOGRAMS (135)
OBJECT DETECTION (130)
BIOMEDICAL IMAGING (121)
LEARNING (ARTIFICIAL INTELLIGENCE) (116)
HIDDEN MARKOV MODELS (111)
VECTORS (111)
COMPUTER VISION (110)
KERNEL (107)
MAGNETIC RESONANCE IMAGING (107)
MACHINE LEARNING (101)
NEURAL NETWORKS (101)
TESTING (100)
PATTERN RECOGNITION (95)
IMAGE PROCESSING (86)
LABELING (83)
ROBUSTNESS (83)
CHARACTER RECOGNITION (81)
CAMERAS (79)
DETECTORS (78)
IMAGE RESOLUTION (77)
COMPUTED TOMOGRAPHY (76)
OBJECT RECOGNITION (76)
NEURAL NETS (75)
IMAGE COLOUR ANALYSIS (72)
MATHEMATICAL MODEL (72)
NEURONS (72)
ALGORITHM DESIGN AND ANALYSIS (71)
REMOTE SENSING (69)
HANDWRITING RECOGNITION (67)
PRINCIPAL COMPONENT ANALYSIS (66)
CLASSIFICATION (65)
CONTEXT (64)
SUPPORT VECTOR MACHINE (64)
TRAINING DATA (64)
ESTIMATION (61)
THREE-DIMENSIONAL DISPLAYS (59)
NOISE (57)
SKIN (57)
FACE (55)
PROBABILITY (54)
OPTIMIZATION (53)
CLUSTERING ALGORITHMS (52)
IMAGE SEQUENCES (52)
DEEP LEARNING (51)
FACE RECOGNITION (51)
HUMANS (51)
PROBABILISTIC LOGIC (51)
SIGNAL PROCESSING (50)
THREE DIMENSIONAL DISPLAYS (50)
COMPUTER ARCHITECTURE (49)
DICTIONARIES (49)
IMAGE TEXTURE (49)
TRANSFORMS (49)
CANCER (48)
HYPERSPECTRAL IMAGING (48)
IMAGE RECONSTRUCTION (48)
OBJECT SEGMENTATION (46)
ROADS (45)
SVM (45)
IMAGE RETRIEVAL (44)
LEVEL SET (44)
STANDARDS (44)
SUPPORT VECTOR MACHINE CLASSIFICATION (43)
TUMORS (43)
VIDEO SIGNAL PROCESSING (43)
BRAIN (42)
IMAGE ANALYSIS (42)
IMAGE REPRESENTATION (42)
OPTICAL IMAGING (42)
BIOMEDICAL MRI (41)
DISEASES (41)
EQUATIONS (41)
MOTION SEGMENTATION (41)
STATISTICAL ANALYSIS (41)
CONVOLUTION (40)
LESIONS (40)
MICROSCOPY (40)
OPTICAL CHARACTER RECOGNITION SOFTWARE (40)
VEGETATION (40)
MANUALS (39)
more

Data set

ieee (2,014)
Springer (1)

INFONA - science communication portal

Advanced search

Advanced search in people

CityPersons: A Diverse Dataset for Pedestrian Detection

Learning to Extract Semantic Structure from Documents Using Multimodal Fully Convolutional Neural Networks

Learning Video Object Segmentation from Static Images

Object Region Mining with Adversarial Erasing: A Simple Classification to Semantic Segmentation Approach

ShapeOdds: Variational Bayesian Learning of Generative Shape Models

Improving RANSAC-Based Segmentation through CNN Encapsulation

Learning Object Interactions and Descriptions for Semantic Image Segmentation

SPFTN: A Self-Paced Fine-Tuning Network for Segmenting Objects in Weakly Labelled Videos

Webly Supervised Semantic Segmentation

Exploiting Saliency for Object Segmentation from Image Level Labels

Dilated Residual Networks

Predicting Ground-Level Scene Layout from Aerial Imagery

Object Co-skeletonization with Co-segmentation

Network Dissection: Quantifying Interpretability of Deep Visual Representations

Not All Pixels Are Equal: Difficulty-Aware Semantic Segmentation via Deep Layer Cascade

Convolutional Random Walk Networks for Semantic Image Segmentation

MCMLSD: A Dynamic Programming Approach to Line Segment Detection

Simple Does It: Weakly Supervised Instance and Semantic Segmentation

Hidden Layers in Perceptual Learning

One-Shot Video Object Segmentation

Filter options

Publication date

Content availability

Publication type

Keywords

Data set

INFONA - science communication portal

Advanced search

Advanced search in people

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Data set

Reporting an error / abuse

Sending the report failed

Accessibility options