Search results

Items from 1 to 20 out of 189 results

chapter

Dataset Selection for Controlling Swarms by Visual Demonstration

Karan Kumar Budhraja, Tim Oates

2017 IEEE International Conference on Data Mining Workshops (ICDMW) > 932 - 941

2017 IEEE International Conference on Data Mining Workshops (ICDMW)

Agent-based modeling is a paradigm of modeling dynamic systems of interacting agents that are individually governed by specified behavioral rules. Training a model of such agents to produce an emergent behavior by specification of the emergent (as opposed to agent) behavior is easier from a demonstration perspective. Without the involvement of manual behavior specification via code or reliance on...

chapter

Attributes2Classname: A Discriminative Model for Attribute-Based Unsupervised Zero-Shot Learning

Berkan Demirel, Ramazan Gokberk Cinbis, Nazli Ikizler-Cinbis

2017 IEEE International Conference on Computer Vision (ICCV) > 1241 - 1250

2017 IEEE International Conference on Computer Vision (ICCV)

We propose a novel approach for unsupervised zero-shot learning (ZSL) of classes based on their names. Most existing unsupervised ZSL methods aim to learn a model for directly comparing image features and class names. However, this proves to be a difficult task due to dominance of non-visual semantics in underlying vector-space embeddings of class names. To address this issue, we discriminatively...

chapter

Areas of Attention for Image Captioning

Marco Pedersoli, Thomas Lucas, Cordelia Schmid, Jakob Verbeek

2017 IEEE International Conference on Computer Vision (ICCV) > 1251 - 1259

2017 IEEE International Conference on Computer Vision (ICCV)

We propose “Areas of Attention”, a novel attentionbased model for automatic image captioning. Our approach models the dependencies between image regions, caption words, and the state of an RNN language model, using three pairwise interactions. In contrast to previous attentionbased approaches that associate image regions only to the RNN state, our method allows a direct association between caption...

chapter

Bilingualism advantage in handwritten character recognition: A deep learning investigation on Persian and Latin scripts

Zahra Sadeghi, Alberto Testolin, Marco Zorzi

2017 7th International Conference on Computer and Knowledge Engineering (ICCKE) > 27 - 32

2017 7th International Conference on Computer and Knowledge Engineering (ICCKE)

In this study, we investigated the effects of mastering multiple scripts in handwritten character recognition by means of computational simulations. In particular, we trained a set of deep neural networks on two different datasets of handwritten characters: the HODA dataset, which is a collection of images of handwritten Persian digits, and the MNIST dataset, which contains Latin handwritten digits...

chapter

A saliency detection model combined local and global features

Pin Wang, Guohui Tian, Huanzhao Chen

2017 Chinese Automation Congress (CAC) > 2863 - 2870

2017 Chinese Automation Congress (CAC)

Most present methods of saliency detection emphasize too much on the local contrast while ignore the global feature of image. The detailed characteristics of the image can be reflected based on the local comparison of image. However, the overall saliency of the image cannot be reflected. In this paper, a saliency detection model combined local and global features was proposed. Firstly, a local feature...

chapter

Revisiting Unreasonable Effectiveness of Data in Deep Learning Era

Chen Sun, Abhinav Shrivastava, Saurabh Singh, Abhinav Gupta

2017 IEEE International Conference on Computer Vision (ICCV) > 843 - 852

2017 IEEE International Conference on Computer Vision (ICCV)

The success of deep learning in vision can be attributed to: (a) models with high capacity; (b) increased computational power; and (c) availability of large-scale labeled data. Since 2012, there have been significant advances in representation capabilities of the models and computational capabilities of GPUs. But the size of the biggest dataset has surprisingly remained constant. What will happen...

chapter

PUnDA: Probabilistic Unsupervised Domain Adaptation for Knowledge Transfer Across Visual Categories

Behnam Gholami, Ognjen Rudovic, Vladimir Pavlovic

2017 IEEE International Conference on Computer Vision (ICCV) > 3601 - 3610

2017 IEEE International Conference on Computer Vision (ICCV)

This paper introduces a probabilistic latent variable model to address unsupervised domain adaptation problems. Specifically, we tackle the task of categorization of visual input from different domains by learning projections from each domain to a latent (shared) space jointly with the classifier in the latent space, which simultaneously minimizes the domain disparity while maximizing the classifier's...

chapter

SORT: Second-Order Response Transform for Visual Recognition

Yan Wang, Lingxi Xie, Chenxi Liu, Siyuan Qiao, more

2017 IEEE International Conference on Computer Vision (ICCV) > 1368 - 1377

2017 IEEE International Conference on Computer Vision (ICCV)

In this paper, we reveal the importance and benefits of introducing second-order operations into deep neural networks. We propose a novel approach named Second-Order Response Transform (SORT), which appends element-wise product transform to the linear sum of a two-branch network module. A direct advantage of SORT is to facilitate cross-branch response propagation, so that each branch can update its...

chapter

Generative Adversarial Networks for Parallel Vision

Wang Kunfeng, Li Xuan, Yan Lan, Wang Fei-Yue

2017 Chinese Automation Congress (CAC) > 7670 - 7675

2017 Chinese Automation Congress (CAC)

Video image dataset is playing an essential role in design and evaluation of traffic vision methods. However, there is a longstanding difficulty that manually collecting and annotating large-scale diversified dataset from real scenes is time-consuming and prone to error. In 2016, we proposed the parallel vision methodology to tackle the issues of conventional vision computing approach in data collection,...

chapter

Understanding convolutional neural networks using a minimal model for handwritten digit recognition

Matthew Y. W. Teow

2017 IEEE 2nd International Conference on Automatic Control and Intelligent Systems (I2CACIS) > 167 - 172

2017 IEEE 2nd International Conference on Automatic Control and Intelligent Systems (I2CACIS)

The contribution of this paper is to bridge the gap on understanding the mathematical structure and the computational implementation of a convolutional neural network (CNN) using a minimal model (Minimal CNN). The proposed minimal CNN is presented using a layering approach. This approach provides a concise and accessible understanding of the main mathematical operations of a CNN. Hence, it benefits...

chapter

WordSup: Exploiting Word Annotations for Character Based Text Detection

Han Hu, Chengquan Zhang, Yuxuan Luo, Yuzhuo Wang, more

2017 IEEE International Conference on Computer Vision (ICCV) > 4950 - 4959

2017 IEEE International Conference on Computer Vision (ICCV)

Imagery texts are usually organized as a hierarchy of several visual elements, i.e. characters, words, text lines and text blocks. Among these elements, character is the most basic one for various languages such as Western, Chinese, Japanese, mathematical expression and etc. It is natural and convenient to construct a common text detection engine based on character detectors. However, training character...

chapter

Modeling component and pattern motion selectivity in the mt area of visual cortex

Anila Gundavarapu, Karthik Soman, V. Srinivasa Chakravarthy

2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI) > 561 - 565

2017 International Conference on Advances in Computing, Communications and Informatics (ICACCI)

Area V5 or Middle Temporal (MT) area of the primate brain is said to be involved in visual motion perception. Physiological studies indicate that the neurons in MT respond selectively to the direction of moving stimuli. However in response to the complex stimuli containing multiple oriented components, a set of MT neurons are selective to the direction of the component motion whereas the other set...

chapter

Growing a Brain: Fine-Tuning by Increasing Model Capacity

Yu-Xiong Wang, Deva Ramanan, Martial Hebert

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3029 - 3038

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

CNNs have made an undeniable impact on computer vision through the ability to learn high-capacity models with large annotated training sets. One of their remarkable properties is the ability to transfer knowledge from a large source dataset to a (typically smaller) target dataset. This is usually accomplished through fine-tuning a fixed-size network on new target data. Indeed, virtually every contemporary...

chapter

Generative Hierarchical Learning of Sparse FRAME Models

Jianwen Xie, Yifei Xu, Erik Nijkamp, Ying Nian Wu, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1933 - 1941

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper proposes a method for generative learning of hierarchical random field models. The resulting model, which we call the hierarchical sparse FRAME (Filters, Random field, And Maximum Entropy) model, is a generalization of the original sparse FRAME model by decomposing it into multiple parts that are allowed to shift their locations, scales and rotations, so that the resulting model becomes...

chapter

Deep Unsupervised Similarity Learning Using Partially Ordered Sets

Miguel A. Bautista, Artsiom Sanakoyeu, Bjorn Ommer

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1923 - 1932

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Unsupervised learning of visual similarities is of paramount importance to computer vision, particularly due to lacking training data for fine-grained similarities. Deep learning of similarities is often based on relationships between pairs or triplets of samples. Many of these relations are unreliable and mutually contradicting, implying inconsistencies when trained without supervision information...

chapter

Semantic Autoencoder for Zero-Shot Learning

Elyor Kodirov, Tao Xiang, Shaogang Gong

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4447 - 4456

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Existing zero-shot learning (ZSL) models typically learn a projection function from a feature space to a semantic embedding space (e.g. attribute space). However, such a projection function is only concerned with predicting the training seen class semantic representation (e.g. attribute prediction) or classification. When applied to test data, which in the context of ZSL contains different (unseen)...

chapter

SST: Single-Stream Temporal Action Proposals

Shyamal Buch, Victor Escorcia, Chuanqi Shen, Bernard Ghanem, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6373 - 6382

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Our paper presents a new approach for temporal detection of human actions in long, untrimmed video sequences. We introduce Single-Stream Temporal Action Proposals (SST), a new effective and efficient deep architecture for the generation of temporal action proposals. Our network can run continuously in a single stream over very long input video sequences, without the need to divide input into short...

chapter

Semantically Consistent Regularization for Zero-Shot Recognition

Pedro Morgado, Nuno Vasconcelos

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2037 - 2046

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The role of semantics in zero-shot learning is considered. The effectiveness of previous approaches is analyzed according to the form of supervision provided. While some learn semantics independently, others only supervise the semantic subspace explained by training classes. Thus, the former is able to constrain the whole space but lacks the ability to model semantic correlations. The latter addresses...

chapter

A Joint Speaker-Listener-Reinforcer Model for Referring Expressions

Licheng Yu, Hao Tan, Mohit Bansal, Tamara L. Berg

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3521 - 3529

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Referring expressions are natural language constructions used to identify particular objects within a scene. In this paper, we propose a unified framework for the tasks of referring expression comprehension and generation. Our model is composed of three modules: speaker, listener, and reinforcer. The speaker generates referring expressions, the listener comprehends referring expressions, and the reinforcer...

chapter

Attentional Push: A Deep Convolutional Network for Augmenting Image Salience with Shared Attention Modeling in Social Scenes

Siavash Gorji, James J. Clark

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3472 - 3481

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We present a novel visual attention tracking technique based on Shared Attention modeling. By considering the viewer as a participant in the activity occurring in the scene, our model learns the loci of attention of the scene actors and use it to augment image salience. We go beyond image salience and instead of only computing the power of image regions to pull attention, we also consider the strength...

Keywords:
VISUALIZATION
TRAINING
COMPUTATIONAL MODELING

Publication date

Set your own date range

Keywords

FEATURE EXTRACTION (60)
COMPUTER VISION (23)
SOLID MODELING (21)
DATA MODELS (19)
IMAGE COLOR ANALYSIS (19)
IMAGE CLASSIFICATION (18)
OBJECT RECOGNITION (18)
SEMANTICS (18)
COMPUTER ARCHITECTURE (14)
IMAGE SEGMENTATION (14)
OBJECT DETECTION (14)
DEEP LEARNING (13)
LEARNING (ARTIFICIAL INTELLIGENCE) (13)
DATABASES (12)
NEURAL NETWORKS (12)
VIRTUAL REALITY (12)
ACCURACY (11)
BIOLOGICAL SYSTEM MODELING (11)
MATHEMATICAL MODEL (11)
NEURONS (11)
PREDICTIVE MODELS (11)
DETECTORS (10)
IMAGE RECOGNITION (10)
IMAGE RETRIEVAL (10)
COMPUTERS (9)
ENCODING (9)
HIDDEN MARKOV MODELS (9)
IMAGE CODING (9)
SUPPORT VECTOR MACHINES (9)
VOCABULARY (9)
ADAPTATION MODELS (8)
PROBABILISTIC LATENT SEMANTIC ANALYSIS (8)
PROBABILITY (8)
SHAPE (8)
TARGET TRACKING (8)
TESTING (8)
VISUAL ATTENTION (8)
ADAPTATION MODEL (7)
CAMERAS (7)
CLASSIFICATION ALGORITHMS (7)
COMPUTER BASED TRAINING (7)
DATA MINING (7)
DATA VISUALISATION (7)
ESTIMATION (7)
GAMES (7)
HISTOGRAMS (7)
IMAGE REPRESENTATION (7)
KERNEL (7)
MACHINE LEARNING (7)
TRAINING DATA (7)
AUTOMATIC IMAGE ANNOTATION (6)
COMPLEXITY THEORY (6)
CONFERENCES (6)
CONVOLUTION (6)
CORRELATION (6)
HAPTIC INTERFACES (6)
OPTIMIZATION (6)
PATTERN RECOGNITION (6)
REVIEWS (6)
ROBOTS (6)
ROBUSTNESS (6)
ARTIFICIAL INTELLIGENCE (5)
ARTIFICIAL NEURAL NETWORKS (5)
BUILDINGS (5)
DISPLAYS (5)
EDUCATIONAL INSTITUTIONS (5)
IMAGE ANNOTATION (5)
IMAGE PROCESSING (5)
IMAGE RECONSTRUCTION (5)
IMAGE SEQUENCES (5)
LABORATORIES (5)
LIGHTING (5)
MEASUREMENT (5)
NAVIGATION (5)
PLSA (5)
PROBABILISTIC LOGIC (5)
PROPOSALS (5)
PROTOTYPES (5)
SOFTWARE (5)
STANDARDS (5)
UNSUPERVISED LEARNING (5)
ANALYTICAL MODELS (4)
ANIMATION (4)
ATMOSPHERIC MODELING (4)
CLUSTERING ALGORITHMS (4)
COLOR (4)
COMPUTER GRAPHICS (4)
CONTEXT (4)
CONVOLUTIONAL NEURAL NETWORK (4)
DEFORMABLE MODELS (4)
DICTIONARIES (4)
DISTANCE MEASUREMENT (4)
ENGINES (4)
IMAGE EDGE DETECTION (4)
IMAGING (4)
INTERNET (4)
MAINTENANCE ENGINEERING (4)
more

INFONA - science communication portal

Search results

Dataset Selection for Controlling Swarms by Visual Demonstration

Attributes2Classname: A Discriminative Model for Attribute-Based Unsupervised Zero-Shot Learning

Areas of Attention for Image Captioning

Bilingualism advantage in handwritten character recognition: A deep learning investigation on Persian and Latin scripts

A saliency detection model combined local and global features

Revisiting Unreasonable Effectiveness of Data in Deep Learning Era

PUnDA: Probabilistic Unsupervised Domain Adaptation for Knowledge Transfer Across Visual Categories

SORT: Second-Order Response Transform for Visual Recognition

Generative Adversarial Networks for Parallel Vision

Understanding convolutional neural networks using a minimal model for handwritten digit recognition

WordSup: Exploiting Word Annotations for Character Based Text Detection

Modeling component and pattern motion selectivity in the mt area of visual cortex

Growing a Brain: Fine-Tuning by Increasing Model Capacity

Generative Hierarchical Learning of Sparse FRAME Models

Deep Unsupervised Similarity Learning Using Partially Ordered Sets

Semantic Autoencoder for Zero-Shot Learning

SST: Single-Stream Temporal Action Proposals

Semantically Consistent Regularization for Zero-Shot Recognition

A Joint Speaker-Listener-Reinforcer Model for Referring Expressions

Attentional Push: A Deep Convolutional Network for Augmenting Image Salience with Shared Attention Modeling in Social Scenes

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options