Search results

Items from 61 to 80 out of 805 results

chapter

Learning Spatial Regularization with Image-Level Supervisions for Multi-label Image Classification

Feng Zhu, Hongsheng Li, Wanli Ouyang, Nenghai Yu, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2027 - 2036

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Multi-label image classification is a fundamental but challenging task in computer vision. Great progress has been achieved by exploiting semantic relations between labels in recent years. However, conventional approaches are unable to model the underlying spatial relations between labels in multi-label images, because spatial annotations of the labels are generally not provided. In this paper, we...

chapter

Improving RANSAC-Based Segmentation through CNN Encapsulation

Dustin Morley, Hassan Foroosh

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2661 - 2670

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this work, we present a method for improving a random sample consensus (RANSAC) based image segmentation algorithm by encapsulating it within a convolutional neural network (CNN). The improvements are gained by gradient descent training on the set of pre-RANSAC filtering and thresholding operations using a novel RANSAC-based loss function, which is geared toward optimizing the strength of the correct...

chapter

Adaptive Class Preserving Representation for Image Classification

Jian-Xun Mi, Qiankun Fu, Weisheng Li

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2624 - 2632

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In linear representation-based image classification, an unlabeled sample is represented by the entire training set. To obtain a stable and discriminative solution, regularization on the vector of representation coefficients is necessary. For example, the representation in sparse representation-based classification (SRC) uses L1 norm penalty as regularization, which is equal to lasso. However, lasso...

chapter

PoseAgent: Budget-Constrained 6D Object Pose Estimation via Reinforcement Learning

Alexander Krull, Eric Brachmann, Sebastian Nowozin, Frank Michel, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2566 - 2574

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

State-of-the-art computer vision algorithms often achieve efficiency by making discrete choices about which hypotheses to explore next. This allows allocation of computational resources to promising candidates, however, such decisions are non-differentiable. As a result, these algorithms are hard to train in an end-to-end fashion. In this work we propose to learn an efficient algorithm for the task...

chapter

From Red Wine to Red Tomato: Composition with Context

Ishan Misra, Abhinav Gupta, Martial Hebert

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1160 - 1169

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Compositionality and contextuality are key building blocks of intelligence. They allow us to compose known concepts to generate new and complex ones. However, traditional learning methods do not model both these properties and require copious amounts of labeled data to learn new concepts. A large fraction of existing techniques, e.g., using late fusion, compose concepts but fail to model contextuality...

chapter

Unsupervised Adaptive Re-identification in Open World Dynamic Camera Networks

Rameswar Panda, Amran Bhuiyan, Vittorio Murino, Amit K. Roy-Chowdhury

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1377 - 1386

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Person re-identification is an open and challenging problem in computer vision. Existing approaches have concentrated on either designing the best feature representation or learning optimal matching metrics in a static setting where the number of cameras are fixed in a network. Most approaches have neglected the dynamic and open world nature of the re-identification problem, where a new camera may...

chapter

Deep Hashing Network for Unsupervised Domain Adaptation

Hemanth Venkateswara, Jose Eusebio, Shayok Chakraborty, Sethuraman Panchanathan

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5385 - 5394

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In recent years, deep neural networks have emerged as a dominant machine learning tool for a wide variety of application domains. However, training a deep neural network requires a large amount of labeled data, which is an expensive process in terms of time, labor and human expertise. Domain adaptation or transfer learning algorithms address this challenge by leveraging labeled data in a different,...

chapter

Are You Smarter Than a Sixth Grader? Textbook Question Answering for Multimodal Machine Comprehension

Aniruddha Kembhavi, Minjoon Seo, Dustin Schwenk, Jonghyun Choi, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5376 - 5384

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We introduce the task of Multi-Modal Machine Comprehension (M3C), which aims at answering multimodal questions given a context of text, diagrams and images. We present the Textbook Question Answering (TQA) dataset that includes 1,076 lessons and 26,260 multi-modal questions, taken from middle school science curricula. Our analysis shows that a significant portion of questions require complex parsing...

chapter

SPFTN: A Self-Paced Fine-Tuning Network for Segmenting Objects in Weakly Labelled Videos

Dingwen Zhang, Le Yang, Deyu Meng, Dong Xu, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5340 - 5348

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Object segmentation in weakly labelled videos is an interesting yet challenging task, which aims at learning to perform category-specific video object segmentation by only using video-level tags. Existing works in this research area might still have some limitations, e.g., lack of effective DNN-based learning frameworks, under-exploring the context information, and requiring to leverage the unstable...

chapter

Teaching Compositionality to CNNs

Austin Stone, Huayan Wang, Michael Stark, Yi Liu, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 732 - 741

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Convolutional neural networks (CNNs) have shown great success in computer vision, approaching human-level performance when trained for specific tasks via application-specific loss functions. In this paper, we propose a method for augmenting and training CNNs so that their learned features are compositional. It encourages networks to form representations that disentangle objects from their surroundings...

chapter

Learning Discriminative and Transformation Covariant Local Feature Detectors

Xu Zhang, Felix X. Yu, Svebor Karaman, Shih-Fu Chang

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4923 - 4931

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Robust covariant local feature detectors are important for detecting local features that are (1) discriminative of the image content and (2) can be repeatably detected at consistent locations when the image undergoes diverse transformations. Such detectors are critical for applications such as image search and scene reconstruction. Many learning-based local feature detectors address one of these two...

chapter

Training Object Class Detectors with Click Supervision

Dim P. Papadopoulos, Jasper R. R. Uijlings, Frank Keller, Vittorio Ferrari

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 180 - 189

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Training object class detectors typically requires a large set of images with objects annotated by bounding boxes. However, manually drawing bounding boxes is very time consuming. In this paper we greatly reduce annotation time by proposing center-click annotations: we ask annotators to click on the center of an imaginary bounding box which tightly encloses the object instance. We then incorporate...

chapter

Soft-Margin Mixture of Regressions

Dong Huang, Longfei Han, Fernando De la Torre

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4058 - 4066

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Nonlinear regression is a common statistical tool to solve many computer vision problems (e.g., age estimation, pose estimation). Existing approaches to nonlinear regression fall into two main categories: (1) The universal approach provides an implicit or explicit homogeneous feature mapping (e.g., kernel ridge regression, Gaussian process regression, neural networks). These approaches may fail when...

chapter

Improving Training of Deep Neural Networks via Singular Value Bounding

Kui Jia, Dacheng Tao, Shenghua Gao, Xiangmin Xu

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3994 - 4002

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Deep learning methods achieve great success recently on many computer vision problems. In spite of these practical successes, optimization of deep networks remains an active topic in deep learning research. In this work, we focus on investigation of the network solution properties that can potentially lead to good performance. Our research is inspired by theoretical and empirical results that use...

chapter

Hierarchical random forest for senior action recognition in videos

Runlin Zhao, Yang Zhao, Ruoyu Deng, Fenhua Li

2017 International Conference on Wavelet Analysis and Pattern Recognition (ICWAPR) > 171 - 176

2017 International Conference on Wavelet Analysis and Pattern Recognition (ICWAPR)

Action recognition in videos is a hot research topic in computer vision because of the popularization of application such as human-machine interaction, intelligent monitoring. Recently, with the aging phenomenon of population becoming more and more serious, the analysis of senior actions is becoming more and more important. Random forest has been wildly used in action recognition because of its efficiency...

chapter

Deep Mixture of Linear Inverse Regressions Applied to Head-Pose Estimation

Stephane Lathuiliere, Remi Juge, Pablo Mesejo, Rafael Munoz-Salinas, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 7149 - 7157

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Convolutional Neural Networks (ConvNets) have become the state-of-the-art for many classification and regression problems in computer vision. When it comes to regression, approaches such as measuring the Euclidean distance of target and predictions are often employed as output layer. In this paper, we propose the coupling of a Gaussian mixture of linear inverse regressions with a ConvNet, and we describe...

chapter

Domain Adaptation by Mixture of Alignments of Second-or Higher-Order Scatter Tensors

Piotr Koniusz, Yusuf Tas, Fatih Porikli

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 7139 - 7148

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper, we propose an approach to the domain adaptation, dubbed Second-or Higher-order Transfer of Knowledge (So-HoT), based on the mixture of alignments of second-or higher-order scatter statistics between the source and target domains. The human ability to learn from few labeled samples is a recurring motivation in the literature for domain adaptation. Towards this end, we investigate the...

chapter

From Zero-Shot Learning to Conventional Supervised Classification: Unseen Visual Data Synthesis

Yang Long, Li Liu, Ling Shao, Fumin Shen, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6165 - 6174

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Robust object recognition systems usually rely on powerful feature extraction mechanisms from a large number of real images. However, in many realistic applications, collecting sufficient images for ever-growing new classes is unattainable. In this paper, we propose a new Zero-shot learning (ZSL) framework that can synthesise visual features for unseen classes without acquiring real images. Using...

chapter

MIML-FCN+: Multi-Instance Multi-Label Learning via Fully Convolutional Networks with Privileged Information

Hao Yang, Joey Tianyi Zhou, Jianfei Cai, Yew Soon Ong

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5996 - 6004

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Multi-instance multi-label (MIML) learning has many interesting applications in computer visions, including multi-object recognition and automatic image tagging. In these applications, additional information such as bounding-boxes, image captions and descriptions is often available during training phrase, which is referred as privileged information (PI). However, as existing works on learning using...

chapter

Learning Detailed Face Reconstruction from a Single Image

Elad Richardson, Matan Sela, Roy Or-El, Ron Kimmel

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5553 - 5562

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Reconstructing the detailed geometric structure of a face from a given image is a key to many computer vision and graphics applications, such as motion capture and reenactment. The reconstruction task is challenging as human faces vary extensively when considering expressions, poses, textures, and intrinsic geometries. While many approaches tackle this complexity by using additional data to reconstruct...

Keywords:
TRAINING
COMPUTER VISION

Publication date

Set your own date range

Content availability

Available (796)
None (9)

Keywords

FEATURE EXTRACTION (347)
ACCURACY (124)
COMPUTATIONAL MODELING (118)
PATTERN RECOGNITION (117)
OBJECT DETECTION (116)
SUPPORT VECTOR MACHINES (109)
IMAGE CLASSIFICATION (107)
VISUALIZATION (107)
CLASSIFICATION ALGORITHMS (101)
IMAGE SEGMENTATION (101)
IMAGE RECOGNITION (99)
DATABASES (91)
SHAPE (88)
FACE RECOGNITION (85)
LEARNING (ARTIFICIAL INTELLIGENCE) (85)
IMAGE COLOR ANALYSIS (83)
DETECTORS (79)
CAMERAS (77)
CONFERENCES (77)
FACE (76)
PIXEL (75)
TESTING (73)
ARTIFICIAL NEURAL NETWORKS (70)
MACHINE LEARNING (70)
OBJECT RECOGNITION (70)
DATA MINING (69)
HISTOGRAMS (69)
ROBUSTNESS (68)
NEURAL NETWORKS (66)
PRINCIPAL COMPONENT ANALYSIS (65)
ESTIMATION (62)
HUMANS (61)
KERNEL (61)
IMAGE EDGE DETECTION (56)
ALGORITHM DESIGN AND ANALYSIS (55)
IMAGE PROCESSING (53)
IMAGE MOTION ANALYSIS (49)
MATHEMATICAL MODEL (45)
LIGHTING (44)
SIGNAL PROCESSING (44)
HIDDEN MARKOV MODELS (41)
COMPUTER ARCHITECTURE (39)
EQUATIONS (37)
TRAINING DATA (37)
OPTIMIZATION (36)
VECTORS (35)
REAL TIME SYSTEMS (34)
TRANSFORMS (34)
BOOSTING (32)
CONVOLUTION (32)
IMAGE REPRESENTATION (32)
IMAGE SEQUENCES (32)
MACHINE VISION (32)
IMAGE RESOLUTION (31)
CORRELATION (29)
NEURAL NETS (29)
SUPPORT VECTOR MACHINE (29)
VEHICLES (29)
COMPUTERS (28)
IMAGE RECONSTRUCTION (28)
OPTICAL IMAGING (28)
VIDEOS (28)
SEMANTICS (27)
SIGNAL PROCESSING ALGORITHMS (27)
TRACKING (27)
VIDEO SIGNAL PROCESSING (27)
STANDARDS (26)
NEURONS (25)
NOISE (25)
SUPPORT VECTOR MACHINE CLASSIFICATION (25)
THREE DIMENSIONAL DISPLAYS (25)
DATA MODELS (23)
EIGENVALUES AND EIGENFUNCTIONS (23)
IMAGE COLOUR ANALYSIS (23)
POSE ESTIMATION (23)
ARTIFICIAL INTELLIGENCE (22)
DEEP LEARNING (22)
FACE DETECTION (22)
TARGET TRACKING (22)
CLUSTERING ALGORITHMS (21)
IMAGE MATCHING (21)
DICTIONARIES (20)
PEDESTRIAN DETECTION (20)
GESTURE RECOGNITION (19)
IMAGE RETRIEVAL (19)
PATTERN CLASSIFICATION (19)
SOLID MODELING (19)
VIDEO SEQUENCES (19)
ADABOOST (18)
COMPLEXITY THEORY (18)
DECISION TREES (18)
EDUCATIONAL INSTITUTIONS (18)
VISUAL DATABASES (18)
ANALYTICAL MODELS (17)
GEOMETRY (17)
INSPECTION (17)
MEASUREMENT (17)
SVM (17)
more

INFONA - science communication portal

Search results

Learning Spatial Regularization with Image-Level Supervisions for Multi-label Image Classification

Improving RANSAC-Based Segmentation through CNN Encapsulation

Adaptive Class Preserving Representation for Image Classification

PoseAgent: Budget-Constrained 6D Object Pose Estimation via Reinforcement Learning

From Red Wine to Red Tomato: Composition with Context

Unsupervised Adaptive Re-identification in Open World Dynamic Camera Networks

Deep Hashing Network for Unsupervised Domain Adaptation

Are You Smarter Than a Sixth Grader? Textbook Question Answering for Multimodal Machine Comprehension

SPFTN: A Self-Paced Fine-Tuning Network for Segmenting Objects in Weakly Labelled Videos

Teaching Compositionality to CNNs

Learning Discriminative and Transformation Covariant Local Feature Detectors

Training Object Class Detectors with Click Supervision

Soft-Margin Mixture of Regressions

Improving Training of Deep Neural Networks via Singular Value Bounding

Hierarchical random forest for senior action recognition in videos

Deep Mixture of Linear Inverse Regressions Applied to Head-Pose Estimation

Domain Adaptation by Mixture of Alignments of Second-or Higher-Order Scatter Tensors

From Zero-Shot Learning to Conventional Supervised Classification: Unseen Visual Data Synthesis

MIML-FCN+: Multi-Instance Multi-Label Learning via Fully Convolutional Networks with Privileged Information

Learning Detailed Face Reconstruction from a Single Image

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options