Search results

Items from 121 to 140 out of 3,095 results

1 ...
4
5
6
7
8
9
10

chapter

Semantic Autoencoder for Zero-Shot Learning

Elyor Kodirov, Tao Xiang, Shaogang Gong

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4447 - 4456

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Existing zero-shot learning (ZSL) models typically learn a projection function from a feature space to a semantic embedding space (e.g. attribute space). However, such a projection function is only concerned with predicting the training seen class semantic representation (e.g. attribute prediction) or classification. When applied to test data, which in the context of ZSL contains different (unseen)...

chapter

Low-Rank Bilinear Pooling for Fine-Grained Classification

Shu Kong, Charless Fowlkes

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 7025 - 7034

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Pooling second-order local feature statistics to form a high-dimensional bilinear feature has been shown to achieve state-of-the-art performance on a variety of fine-grained classification tasks. To address the computational demands of high feature dimensionality, we propose to represent the covariance features as a matrix and apply a low-rank bilinear classifier. The resulting classifier can be evaluated...

chapter

SST: Single-Stream Temporal Action Proposals

Shyamal Buch, Victor Escorcia, Chuanqi Shen, Bernard Ghanem, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6373 - 6382

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Our paper presents a new approach for temporal detection of human actions in long, untrimmed video sequences. We introduce Single-Stream Temporal Action Proposals (SST), a new effective and efficient deep architecture for the generation of temporal action proposals. Our network can run continuously in a single stream over very long input video sequences, without the need to divide input into short...

chapter

Lean Crowdsourcing: Combining Humans and Machines in an Online System

Steve Branson, Grant Van Horn, Pietro Perona

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6109 - 6118

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We introduce a method to greatly reduce the amount of redundant annotations required when crowdsourcing annotations such as bounding boxes, parts, and class labels. For example, if two Mechanical Turkers happen to click on the same pixel location when annotating a part in a given image–an event that is very unlikely to occur by random chance–, it is a strong indication that the...

chapter

FASON: First and Second Order Information Fusion Network for Texture Recognition

Xiyang Dai, Joe Yue-Hei Ng, Larry S. Davis

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6100 - 6108

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Deep networks have shown impressive performance on many computer vision tasks. Recently, deep convolutional neural networks (CNNs) have been used to learn discriminative texture representations. One of the most successful approaches is Bilinear CNN model that explicitly captures the second order statistics within deep features. However, these networks cut off the first order information flow in the...

chapter

ShapeOdds: Variational Bayesian Learning of Generative Shape Models

Shireen Elhabian, Ross Whitaker

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2185 - 2196

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Shape models provide a compact parameterization of a class of shapes, and have been shown to be important to a variety of vision problems, including object detection, tracking, and image segmentation. Learning generative shape models from grid-structured representations, aka silhouettes, is usually hindered by (1) data likelihoods with intractable marginals and posteriors, (2) high-dimensional shape...

chapter

Semantically Consistent Regularization for Zero-Shot Recognition

Pedro Morgado, Nuno Vasconcelos

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2037 - 2046

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The role of semantics in zero-shot learning is considered. The effectiveness of previous approaches is analyzed according to the form of supervision provided. While some learn semantics independently, others only supervise the semantic subspace explained by training classes. Thus, the former is able to constrain the whole space but lacks the ability to model semantic correlations. The latter addresses...

chapter

End-to-End Training of Hybrid CNN-CRF Models for Stereo

Patrick Knobelreiter, Christian Reinbacher, Alexander Shekhovtsov, Thomas Pock

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1456 - 1465

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a novel and principled hybrid CNN+CRF model for stereo estimation. Our model allows to exploit the advantages of both, convolutional neural networks (CNNs) and conditional random fields (CRFs) in an unified approach. The CNNs compute expressive features for matching and distinctive color edges, which in turn are used to compute the unary and binary costs of the CRF. For inference, we apply...

chapter

Training Object Class Detectors with Click Supervision

Dim P. Papadopoulos, Jasper R. R. Uijlings, Frank Keller, Vittorio Ferrari

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 180 - 189

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Training object class detectors typically requires a large set of images with objects annotated by bounding boxes. However, manually drawing bounding boxes is very time consuming. In this paper we greatly reduce annotation time by proposing center-click annotations: we ask annotators to click on the center of an imaginary bounding box which tightly encloses the object instance. We then incorporate...

chapter

A Joint Speaker-Listener-Reinforcer Model for Referring Expressions

Licheng Yu, Hao Tan, Mohit Bansal, Tamara L. Berg

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3521 - 3529

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Referring expressions are natural language constructions used to identify particular objects within a scene. In this paper, we propose a unified framework for the tasks of referring expression comprehension and generation. Our model is composed of three modules: speaker, listener, and reinforcer. The speaker generates referring expressions, the listener comprehends referring expressions, and the reinforcer...

chapter

Attentional Push: A Deep Convolutional Network for Augmenting Image Salience with Shared Attention Modeling in Social Scenes

Siavash Gorji, James J. Clark

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3472 - 3481

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present a novel visual attention tracking technique based on Shared Attention modeling. By considering the viewer as a participant in the activity occurring in the scene, our model learns the loci of attention of the scene actors and use it to augment image salience. We go beyond image salience and instead of only computing the power of image regions to pull attention, we also consider the strength...

chapter

A novel roller bearing fault diagnosis method based on the wavelet extreme learning machine

Xin Yu, Li Shunming, Wang Jingrui

2017 Prognostics and System Health Management Conference (PHM-Harbin) > 1 - 6

2017 Prognostics and System Health Management Conference (PHM-Harbin)

The safety and reliability of roller bearing always have significant importance in rotating machinery. It is needful to build an efficient and excellent accuracy method to monitoring and diagnosis the baring failure. A novel method is presented in this paper to classify the fault feature by wavelet function and extreme learning machine(ELM) that take into account the high accuracy and efficient. The...

chapter

An improved KNN text classification algorithm based on Simhash

Jie Liu, Ting Jin, Kejia Pan, Yi Yang, more

2017 IEEE 16th International Conference on Cognitive Informatics & Cognitive Computing (ICCI*CC) > 92 - 95

2017 IEEE 16th International Conference on Cognitive Informatics & Cognitive Computing (ICCI*CC)

An improved KNN text classification algorithm based on Simhash has been proposed by introducing Simhash and the average Hamming distance of adjacent texts as a unit, which solves the problems caused by data imbalance and the large computational overhead in the traditional KNN text classification algorithms. Experimental results demonstrate that the proposed algorithm performs a higher precision, a...

chapter

A Dataset and Exploration of Models for Understanding Video Data through Fill-in-the-Blank Question-Answering

Tegan Maharaj, Nicolas Ballas, Anna Rohrbach, Aaron Courville, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 7359 - 7368

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

While deep convolutional neural networks frequently approach or exceed human-level performance in benchmark tasks involving static images, extending this success to moving images is not straightforward. Video understanding is of interest for many applications, including content recommendation, prediction, summarization, event/object detection, and understanding human visual perception. However, many...

chapter

Deep Mixture of Linear Inverse Regressions Applied to Head-Pose Estimation

Stephane Lathuiliere, Remi Juge, Pablo Mesejo, Rafael Munoz-Salinas, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 7149 - 7157

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Convolutional Neural Networks (ConvNets) have become the state-of-the-art for many classification and regression problems in computer vision. When it comes to regression, approaches such as measuring the Euclidean distance of target and predictions are often employed as output layer. In this paper, we propose the coupling of a Gaussian mixture of linear inverse regressions with a ConvNet, and we describe...

chapter

Not All Pixels Are Equal: Difficulty-Aware Semantic Segmentation via Deep Layer Cascade

Xiaoxiao Li, Ziwei Liu, Ping Luo, Chen Change Loy, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6459 - 6468

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a novel deep layer cascade (LC) method to improve the accuracy and speed of semantic segmentation. Unlike the conventional model cascade (MC) that is composed of multiple independent models, LC treats a single deep model as a cascade of several sub-models. Earlier sub-models are trained to handle easy and confident regions, and they progressively feed-forward harder regions to the next...

chapter

Deep Roots: Improving CNN Efficiency with Hierarchical Filter Groups

Yani Ioannou, Duncan Robertson, Roberto Cipolla, Antonio Criminisi

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5977 - 5986

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We propose a new method for creating computationally efficient and compact convolutional neural networks (CNNs) using a novel sparse connection structure that resembles a tree root. This allows a significant reduction in computational cost and number of parameters compared to state-of-the-art deep CNNs, without compromising accuracy, by exploiting the sparsity of inter-layer filter dependencies. We...

chapter

Learning a Deep Embedding Model for Zero-Shot Learning

Li Zhang, Tao Xiang, Shaogang Gong

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3010 - 3019

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Zero-shot learning (ZSL) models rely on learning a joint embedding space where both textual/semantic description of object classes and visual representation of object images can be projected to for nearest neighbour search. Despite the success of deep neural networks that learn an end-to-end model between text and images in other vision problems such as image captioning, very few deep ZSL model exists...

chapter

Seeing What is Not There: Learning Context to Determine Where Objects are Missing

Jin Sun, David W. Jacobs

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1234 - 1242

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Most of computer vision focuses on what is in an image. We propose to train a standalone object-centric context representation to perform the opposite task: seeing what is not there. Given an image, our context model can predict where objects should exist, even when no object instances are present. Combined with object detection results, we can perform a novel vision task: finding where objects are...

chapter

Hidden Layers in Perceptual Learning

Gad Cohen, Daphna Weinshall

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5349 - 5357

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Studies in visual perceptual learning investigate the way human performance improves with practice, in the context of relatively simple (and therefore more manageable) visual tasks. Building on the powerful tools currently available for the training of Convolution Neural Networks (CNN), networks whose original architecture was inspired by the visual system, we revisited some of the open computational...

1 ...
4
5
6
7
8
9
10

Keywords:
TRAINING
COMPUTATIONAL MODELING

Publication date

Set your own date range

Content availability

Available (3,081)
None (14)

Keywords

DATA MODELS (545)
FEATURE EXTRACTION (466)
MATHEMATICAL MODEL (426)
ARTIFICIAL NEURAL NETWORKS (344)
PREDICTIVE MODELS (315)
HIDDEN MARKOV MODELS (307)
ACCURACY (285)
SUPPORT VECTOR MACHINES (285)
DATA MINING (212)
SOLID MODELING (203)
TESTING (194)
NEURAL NETWORKS (192)
VISUALIZATION (189)
NEURONS (177)
COMPUTERS (172)
LEARNING (ARTIFICIAL INTELLIGENCE) (171)
SHAPE (169)
SPEECH (165)
MACHINE LEARNING (164)
BIOLOGICAL SYSTEM MODELING (163)
TRAINING DATA (163)
DATABASES (151)
OPTIMIZATION (151)
ADAPTATION MODELS (144)
NEURAL NETS (142)
ANALYTICAL MODELS (141)
KERNEL (138)
EDUCATIONAL INSTITUTIONS (137)
IMAGE SEGMENTATION (134)
CLASSIFICATION ALGORITHMS (132)
ALGORITHM DESIGN AND ANALYSIS (127)
SPEECH RECOGNITION (126)
ESTIMATION (123)
COMPUTER VISION (118)
VECTORS (116)
COMPUTER ARCHITECTURE (115)
ADAPTATION MODEL (112)
SOFTWARE (109)
IMAGE COLOR ANALYSIS (103)
CONTEXT (102)
SEMANTICS (97)
CORRELATION (96)
PATTERN RECOGNITION (93)
ROBUSTNESS (92)
OBJECT DETECTION (84)
PROBABILITY (84)
CONFERENCES (82)
VIRTUAL REALITY (82)
GAMES (81)
PATTERN CLASSIFICATION (79)
HUMANS (78)
COMPLEXITY THEORY (77)
ACOUSTICS (76)
DETECTORS (75)
EQUATIONS (75)
SIGNAL PROCESSING (74)
NATURAL LANGUAGE PROCESSING (73)
DEEP LEARNING (72)
ROBOTS (72)
NEURAL NETWORK (71)
FACE (70)
IMAGE RECOGNITION (70)
FACE RECOGNITION (68)
CAMERAS (67)
OBJECT RECOGNITION (66)
SIMULATION (65)
STANDARDS (65)
CLUSTERING ALGORITHMS (63)
PREDICTION ALGORITHMS (62)
NOISE (61)
INTERNET (60)
REGRESSION ANALYSIS (60)
ARTIFICIAL INTELLIGENCE (59)
BAYES METHODS (59)
HISTOGRAMS (59)
GAUSSIAN PROCESSES (58)
INDEXES (58)
THREE DIMENSIONAL DISPLAYS (58)
COMPUTER BASED TRAINING (57)
MEASUREMENT (57)
IMAGE CLASSIFICATION (56)
STATISTICAL ANALYSIS (56)
PRINCIPAL COMPONENT ANALYSIS (55)
SUPPORT VECTOR MACHINE (54)
BACKPROPAGATION (52)
BIOLOGICAL NEURAL NETWORKS (52)
CONVOLUTION (52)
ELECTRONIC MAIL (52)
DECODING (51)
GENETIC ALGORITHMS (51)
PROBABILISTIC LOGIC (51)
REAL TIME SYSTEMS (51)
ATMOSPHERIC MODELING (50)
BAYESIAN METHODS (50)
ENTROPY (50)
CONTEXT MODELING (49)
IMAGE PROCESSING (49)
LABELING (49)
more

INFONA - science communication portal

Search results

Semantic Autoencoder for Zero-Shot Learning

Low-Rank Bilinear Pooling for Fine-Grained Classification

SST: Single-Stream Temporal Action Proposals

Lean Crowdsourcing: Combining Humans and Machines in an Online System

FASON: First and Second Order Information Fusion Network for Texture Recognition

ShapeOdds: Variational Bayesian Learning of Generative Shape Models

Semantically Consistent Regularization for Zero-Shot Recognition

End-to-End Training of Hybrid CNN-CRF Models for Stereo

Training Object Class Detectors with Click Supervision

A Joint Speaker-Listener-Reinforcer Model for Referring Expressions

Attentional Push: A Deep Convolutional Network for Augmenting Image Salience with Shared Attention Modeling in Social Scenes

A novel roller bearing fault diagnosis method based on the wavelet extreme learning machine

An improved KNN text classification algorithm based on Simhash

A Dataset and Exploration of Models for Understanding Video Data through Fill-in-the-Blank Question-Answering

Deep Mixture of Linear Inverse Regressions Applied to Head-Pose Estimation

Not All Pixels Are Equal: Difficulty-Aware Semantic Segmentation via Deep Layer Cascade

Deep Roots: Improving CNN Efficiency with Hierarchical Filter Groups

Learning a Deep Embedding Model for Zero-Shot Learning

Seeing What is Not There: Learning Context to Determine Where Objects are Missing

Hidden Layers in Perceptual Learning

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options