Search results

Items from 141 to 160 out of 1,054 results

1 ...
5
6
7
8
9
10
11

chapter

Look into Person: Self-Supervised Structure-Sensitive Learning and a New Benchmark for Human Parsing

Ke Gong, Xiaodan Liang, Dongyu Zhang, Xiaohui Shen, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6757 - 6765

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Human parsing has recently attracted a lot of research interests due to its huge application potentials. However existing datasets have limited number of images and annotations, and lack the variety of human appearances and the coverage of challenging cases in unconstrained environment. In this paper, we introduce a new benchmark Look into Person (LIP) that makes a significant advance in terms of...

chapter

Commonly Uncommon: Semantic Sparsity in Situation Recognition

Mark Yatskar, Vicente Ordonez, Luke Zettlemoyer, Ali Farhadi

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6335 - 6344

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Semantic sparsity is a common challenge in structured visual classification problems, when the output space is complex, the vast majority of the possible predictions are rarely, if ever, seen in the training set. This paper studies semantic sparsity in situation recognition, the task of producing structured summaries of what is happening in images, including activities, objects and the roles objects...

chapter

Zero-Shot Learning — The Good, the Bad and the Ugly

Yongqin Xian, Bernt Schiele, Zeynep Akata

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3077 - 3086

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Due to the importance of zero-shot learning, the number of proposed approaches has increased steadily recently. We argue that it is time to take a step back and to analyze the status quo of the area. The purpose of this paper is three-fold. First, given the fact that there is no agreed upon zero-shot learning benchmark, we first define a new benchmark by unifying both the evaluation protocols and...

chapter

Budget-Aware Deep Semantic Video Segmentation

Behrooz Mahasseni, Sinisa Todorovic, Alan Fern

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2077 - 2086

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this work, we study a poorly understood trade-off between accuracy and runtime costs for deep semantic video segmentation. While recent work has demonstrated advantages of learning to speed-up deep activity detection, it is not clear if similar advantages will hold for our very different segmentation loss function, which is defined over individual pixels across the frames. In deep video segmentation,...

chapter

Spatial-Semantic Image Search by Visual Feature Synthesis

Long Mai, Hailin Jin, Zhe Lin, Chen Fang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1121 - 1130

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The performance of image retrieval has been improved tremendously in recent years through the use of deep feature representations. Most existing methods, however, aim to retrieve images that are visually similar or semantically relevant to the query, irrespective of spatial configuration. In this paper, we develop a spatial-semantic image search technology that enables users to search for images with...

chapter

Semantic Compositional Networks for Visual Captioning

Zhe Gan, Chuang Gan, Xiaodong He, Yunchen Pu, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1141 - 1150

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

A Semantic Compositional Network (SCN) is developed for image captioning, in which semantic concepts (i.e., tags) are detected from the image, and the probability of each tag is used to compose the parameters in a long short-term memory (LSTM) network. The SCN extends each weight matrix of the LSTM to an ensemble of tag-dependent weight matrices. The degree to which each member of the ensemble is...

chapter

Beyond Instance-Level Image Retrieval: Leveraging Captions to Learn a Global Visual Representation for Semantic Retrieval

Albert Gordo, Diane Larlus

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5272 - 5281

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Querying with an example image is a simple and intuitive interface to retrieve information from a visual database. Most of the research in image retrieval has focused on the task of instance-level image retrieval, where the goal is to retrieve images that contain the same object instance as the query image. In this work we move beyond instance-level retrieval and consider the task of semantic image...

chapter

Weakly Supervised Actor-Action Segmentation via Robust Multi-task Ranking

Yan Yan, Chenliang Xu, Dawen Cai, Jason J. Corso

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1022 - 1031

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Fine-grained activity understanding in videos has attracted considerable recent attention with a shift from action classification to detailed actor and action understanding that provides compelling results for perceptual needs of cutting-edge autonomous systems. However, current methods for detailed understanding of actor and action have significant limitations: they require large amounts of finely...

chapter

StyleNet: Generating Attractive Visual Captions with Styles

Chuang Gan, Zhe Gan, Xiaodong He, Jianfeng Gao, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 955 - 964

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a novel framework named StyleNet to address the task of generating attractive captions for images and videos with different styles. To this end, we devise a novel model component, named factored LSTM, which automatically distills the style factors in the monolingual text corpus. Then at runtime, we can explicitly control the style in the caption generation process so as to produce attractive...

chapter

Colorization as a Proxy Task for Visual Understanding

Gustav Larsson, Michael Maire, Gregory Shakhnarovich

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 840 - 849

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We investigate and improve self-supervision as a drop-in replacement for ImageNet pretraining, focusing on automatic colorization as the proxy task. Self-supervised training has been shown to be more promising for utilizing unlabeled data than other, traditional unsupervised learning methods. We build on this success and evaluate the ability of our self-supervised network in several contexts. On VOC...

chapter

FCSS: Fully Convolutional Self-Similarity for Dense Semantic Correspondence

Seungryong Kim, Dongbo Min, Bumsub Ham, Sangryul Jeon, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 616 - 625

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present a descriptor, called fully convolutional self-similarity (FCSS), for dense semantic correspondence. To robustly match points among different instances within the same object class, we formulate FCSS using local self-similarity (LSS) within a fully convolutional network. In contrast to existing CNN-based descriptors, FCSS is inherently insensitive to intra-class appearance variations because...

chapter

A new semantic segmentation model for remote sensing images

Xin Wei, Yajing Guo, Xin Gao, Menglong Yan, more

2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS) > 1776 - 1779

IGARSS 2017 - 2017 IEEE International Geoscience and Remote Sensing Symposium

Semantic segmentation for remote sensing images is a critical process in the workflow of object-based image analysis. Recently, convolutional neural networks(CNNs) are powerful visual models that yield hierarchies of features. In this paper, we propose a deep convolutional encoder-decoder model for remote sensing images segmentation. Specifically, we rely on the encoder network to extract the high-level...

chapter

Synthesizing remote sensing images by conditional adversarial networks

Dao-Yu Lin, Yang Wang, Guang-Luan Xu, Kun Fu

2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS) > 48 - 50

IGARSS 2017 - 2017 IEEE International Geoscience and Remote Sensing Symposium

Automated annotation of urban areas from overhead imagery plays an essential role in many remote sensing applications. Generative Adversarial Nets (GANs) is one of the most effective ways to handle this problem. In this manuscript, two tricks were added in conditional GANs(cGANs) which learn the mapping from input image to output remote sensing image. All the experimental results demonstrated that...

chapter

Comparison of belief propagation and graph-cut approaches for contextual classification of 3D lidar point cloud data

L. Landrieu, C. Mallet, M. Weinmann

2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS) > 2768 - 2771

IGARSS 2017 - 2017 IEEE International Geoscience and Remote Sensing Symposium

In this paper, we focus on the classification of lidar point cloud data acquired via mobile laser scanning, whereby the classification relies on a context model based on a Conditional Random Field (CRF). We present two approximate inference algorithms based on belief propagation, as well as a graph-cut-based approach not yet applied in this context. To demonstrate the performance of our approach,...

chapter

Can semantic labeling methods generalize to any city? the inria aerial image labeling benchmark

Emmanuel Maggiori, Yuliya Tarabalka, Guillaume Charpiat, Pierre Alliez

2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS) > 3226 - 3229

IGARSS 2017 - 2017 IEEE International Geoscience and Remote Sensing Symposium

New challenges in remote sensing impose the necessity of designing pixel classification methods that, once trained on a certain dataset, generalize to other areas of the earth. This may include regions where the appearance of the same type of objects is significantly different. In the literature it is common to use a single image and split it into training and test sets to train a classifier and assess...

chapter

Toward country scale building detection with convolutional neural network using aerial images

Hsiuhan Lexie Yang, Dalton Lunga, Jiangye Yuan

2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS) > 870 - 873

IGARSS 2017 - 2017 IEEE International Geoscience and Remote Sensing Symposium

Establishing up-to-date nationwide building maps is essential to understand urban dynamics, such as estimating population and urban planning and many other applications. However, an efficient and effective solution is yet to be developed. In this paper, for the first time we evaluate three state-of-the-art CNNs for detecting buildings across entire United States using aerial images. The three CNN...

chapter

A Multi-Label Classification Method on Chinese Temporal Expressions Based on Character Embedding

Baosheng Yin, Bowen Jin

2017 4th International Conference on Information Science and Control Engineering (ICISCE) > 51 - 54

2017 4th International Conference on Information Science and Control Engineering (ICISCE)

Understanding temporal expressions is the important foundation of many NLP tasks. However, the varied representations of temporal expressions is difficulty in analysis and understanding. To parsing expressions, an effective classification method of temporal expressions is significant. A temporal expression may belong to one or more classes, but the classification usually requires manual annotation...

chapter

Loss Max-Pooling for Semantic Image Segmentation

Samuel Rota Bulo, Gerhard Neuhold, Peter Kontschieder

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 7082 - 7091

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We introduce a novel loss max-pooling concept for handling imbalanced training data distributions, applicable as alternative loss layer in the context of deep neural networks for semantic image segmentation. Most real-world semantic segmentation datasets exhibit long tail distributions with few object categories comprising the majority of data and consequently biasing the classifiers towards them...

chapter

Joint height estimation and semantic labeling of monocular aerial images with CNNS

Shivangi Srivastava, Michele Volpi, Devis Tuia

2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS) > 5173 - 5176

IGARSS 2017 - 2017 IEEE International Geoscience and Remote Sensing Symposium

We aim to jointly estimate height and semantically label monocular aerial images. These two tasks are traditionally addressed separately in remote sensing, despite their strong correlation. Therefore, a model learning both height and classes jointly seems advantageous and so, we propose a multitask Convolutional Neural Network (CNN) architecture with two losses: one performing semantic labeling, and...

chapter

Knowledge-guided recurrent neural network learning for task-oriented action prediction

Liang Lin, Lili Huang, Tianshui Chen, Yukang Gan, more

2017 IEEE International Conference on Multimedia and Expo (ICME) > 625 - 630

2017 IEEE International Conference on Multimedia and Expo (ICME)

This paper aims at task-oriented action prediction, i.e., predicting a sequence of actions towards accomplishing a specific task under a certain scene, which is a new problem in computer vision research. The main challenges lie in how to model task-specific knowledge and integrate it in the learning procedure. In this work, we propose to train a recurrent longshort term memory (LSTM) network for handling...

1 ...
5
6
7
8
9
10
11

Keywords:
TRAINING
SEMANTICS

Publication date

Set your own date range

Content availability

Available (1,047)
None (7)

Keywords

FEATURE EXTRACTION (286)
VISUALIZATION (185)
IMAGE SEGMENTATION (127)
SUPPORT VECTOR MACHINES (119)
ACCURACY (111)
COMPUTATIONAL MODELING (97)
CONTEXT (97)
NEURAL NETWORKS (85)
VECTORS (77)
MACHINE LEARNING (76)
NATURAL LANGUAGE PROCESSING (69)
DATA MINING (67)
TESTING (65)
LABELING (63)
TRAINING DATA (61)
CLASSIFICATION ALGORITHMS (59)
CORRELATION (58)
IMAGE RETRIEVAL (58)
ONTOLOGIES (55)
DATA MODELS (54)
SYNTACTICS (54)
IMAGE COLOR ANALYSIS (52)
HIDDEN MARKOV MODELS (49)
DICTIONARIES (48)
DATABASES (47)
STANDARDS (47)
KERNEL (45)
PREDICTIVE MODELS (42)
MATHEMATICAL MODEL (41)
TAGGING (40)
LEARNING (ARTIFICIAL INTELLIGENCE) (39)
MEASUREMENT (39)
DETECTORS (35)
IMAGE CLASSIFICATION (35)
TEXT ANALYSIS (35)
DEEP LEARNING (34)
INTERNET (33)
INFORMATION RETRIEVAL (32)
TEXT CATEGORIZATION (30)
ALGORITHM DESIGN AND ANALYSIS (29)
SENTIMENT ANALYSIS (29)
ENCODING (28)
OPTIMIZATION (28)
ADAPTATION MODELS (27)
COMPUTER ARCHITECTURE (27)
COMPUTER VISION (27)
SPEECH (27)
SUPPORT VECTOR MACHINE CLASSIFICATION (27)
CONVOLUTION (26)
OBJECT DETECTION (26)
PROBABILISTIC LOGIC (25)
VOCABULARY (25)
CONFERENCES (24)
CONTEXT MODELING (24)
HISTOGRAMS (24)
IMAGE ANNOTATION (24)
MULTIMEDIA COMMUNICATION (23)
PROBABILITY (23)
BENCHMARK TESTING (22)
DECODING (22)
EDUCATIONAL INSTITUTIONS (22)
HUMANS (22)
BUILDINGS (21)
IMAGE RECOGNITION (21)
SHAPE (21)
COMPUTERS (20)
ENTROPY (20)
KNOWLEDGE BASED SYSTEMS (20)
NEURONS (20)
PATTERN CLASSIFICATION (20)
THREE-DIMENSIONAL DISPLAYS (20)
CLUSTERING ALGORITHMS (19)
ESTIMATION (19)
PROPOSALS (19)
ROBOTS (19)
TEXT CLASSIFICATION (19)
ANALYTICAL MODELS (18)
CAMERAS (18)
CONTENT-BASED RETRIEVAL (18)
IMAGE EDGE DETECTION (18)
NOISE MEASUREMENT (18)
SPEECH RECOGNITION (18)
INDEXES (17)
MOTION PICTURES (17)
PRAGMATICS (17)
ARTIFICIAL NEURAL NETWORKS (16)
IMAGE RECONSTRUCTION (16)
LINEAR PROGRAMMING (16)
NATURAL LANGUAGES (16)
ROBUSTNESS (16)
SUPERVISED LEARNING (16)
TWITTER (16)
GOLD (15)
MANUALS (15)
MEDIA (15)
PRINCIPAL COMPONENT ANALYSIS (15)
ROADS (15)
SEARCH ENGINES (15)
more

INFONA - science communication portal

Search results

Look into Person: Self-Supervised Structure-Sensitive Learning and a New Benchmark for Human Parsing

Commonly Uncommon: Semantic Sparsity in Situation Recognition

Zero-Shot Learning — The Good, the Bad and the Ugly

Budget-Aware Deep Semantic Video Segmentation

Spatial-Semantic Image Search by Visual Feature Synthesis

Semantic Compositional Networks for Visual Captioning

Beyond Instance-Level Image Retrieval: Leveraging Captions to Learn a Global Visual Representation for Semantic Retrieval

Weakly Supervised Actor-Action Segmentation via Robust Multi-task Ranking

StyleNet: Generating Attractive Visual Captions with Styles

Colorization as a Proxy Task for Visual Understanding

FCSS: Fully Convolutional Self-Similarity for Dense Semantic Correspondence

A new semantic segmentation model for remote sensing images

Synthesizing remote sensing images by conditional adversarial networks

Comparison of belief propagation and graph-cut approaches for contextual classification of 3D lidar point cloud data

Can semantic labeling methods generalize to any city? the inria aerial image labeling benchmark

Toward country scale building detection with convolutional neural network using aerial images

A Multi-Label Classification Method on Chinese Temporal Expressions Based on Character Embedding

Loss Max-Pooling for Semantic Image Segmentation

Joint height estimation and semantic labeling of monocular aerial images with CNNS

Knowledge-guided recurrent neural network learning for task-oriented action prediction

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options