Search results

Items from 21 to 40 out of 844 results

chapter

Parallel Tracking and Verifying: A Framework for Real-Time and High Accuracy Visual Tracking

Heng Fan, Haibin Ling

2017 IEEE International Conference on Computer Vision (ICCV) > 5487 - 5495

2017 IEEE International Conference on Computer Vision (ICCV)

Being intensively studied, visual tracking has seen great recent advances in either speed (e.g., with correlation filters) or accuracy (e.g., with deep features). Real-time and high accuracy tracking algorithms, however, remain scarce. In this paper we study the problem from a new perspective and present a novel parallel tracking and verifying (PTAV) framework, by taking advantage of the ubiquity...

chapter

Non-linear Convolution Filters for CNN-Based Learning

Georgios Zoumpourlis, Alexandros Doumanoglou, Nicholas Vretos, Petros Daras

2017 IEEE International Conference on Computer Vision (ICCV) > 4771 - 4779

2017 IEEE International Conference on Computer Vision (ICCV)

During the last years, Convolutional Neural Networks (CNNs) have achieved state-of-the-art performance in image classification. Their architectures have largely drawn inspiration by models of the primate visual system. However, while recent research results of neuroscience prove the existence of non-linear operations in the response of complex visual cells, little effort has been devoted to extend...

chapter

Leveraging Weak Semantic Relevance for Complex Video Event Classification

Heng Tao Shen, Chao Li, Jiewei Cao, Zi Huang, more

2017 IEEE International Conference on Computer Vision (ICCV) > 3667 - 3676

2017 IEEE International Conference on Computer Vision (ICCV)

Existing video event classification approaches suffer from limited human-labeled semantic annotations. Weak semantic annotations can be harvested from Web-knowledge without involving any human interaction. However such weak annotations are noisy, thus can not be effectively utilized without distinguishing its reliability. In this paper, we propose a novel approach to automatically maximize the utility...

chapter

What will Happen Next? Forecasting Player Moves in Sports Videos

Panna Felsen, Pulkit Agrawal, Jitendra Malik

2017 IEEE International Conference on Computer Vision (ICCV) > 3362 - 3371

2017 IEEE International Conference on Computer Vision (ICCV)

A large number of very popular team sports involve the act of one team trying to score a goal against the other. During this game play, defending players constantly try to predict the next move of the attackers to prevent them from scoring, whereas attackers constantly try to predict the next move of the defenders in order to defy them and score. Such behavior is a prime example of the general human...

chapter

Privacy-Preserving Visual Learning Using Doubly Permuted Homomorphic Encryption

Ryo Yonetani, Vishnu Naresh Boddeti, Kris M. Kitani, Yoichi Sato

2017 IEEE International Conference on Computer Vision (ICCV) > 2059 - 2069

2017 IEEE International Conference on Computer Vision (ICCV)

We propose a privacy-preserving framework for learning visual classifiers by leveraging distributed private image data. This framework is designed to aggregate multiple classifiers updated locally using private data and to ensure that no private information about the data is exposed during and after its learning procedure. We utilize a homomorphic cryptosystem that can aggregate the local classifiers...

chapter

Toward Perceptually-Consistent Stereo: A Scanline Study

Jialiang Wang, Daniel Glasner, Todd Zickler

2017 IEEE International Conference on Computer Vision (ICCV) > 1557 - 1565

2017 IEEE International Conference on Computer Vision (ICCV)

Two types of information exist in a stereo pair: correlation (matching) and decorrelation (half-occlusion). Vision science has shown that both types of information are used in the visual cortex, and that people can perceive depth even when correlation cues are absent or very weak, a capability that remains absent from most computational stereo systems. As a step toward stereo algorithms that are more...

chapter

Visual Transformation Aided Contrastive Learning for Video-Based Kinship Verification

Hamdi Dibeklioglu

2017 IEEE International Conference on Computer Vision (ICCV) > 2478 - 2487

2017 IEEE International Conference on Computer Vision (ICCV)

Automatic kinship verification from facial information is a relatively new and open research problem in computer vision. This paper explores the possibility of learning an efficient facial representation for video-based kinship verification by exploiting the visual transformation between facial appearance of kin pairs. To this end, a Siamese-like coupled convolutional encoder-decoder network is proposed...

chapter

Multi-modal Factorized Bilinear Pooling with Co-attention Learning for Visual Question Answering

Zhou Yu, Jun Yu, Jianping Fan, Dacheng Tao

2017 IEEE International Conference on Computer Vision (ICCV) > 1839 - 1848

2017 IEEE International Conference on Computer Vision (ICCV)

Visual question answering (VQA) is challenging because it requires a simultaneous understanding of both the visual content of images and the textual content of questions. The approaches used to represent the images and questions in a fine-grained manner and questions and to fuse these multimodal features play key roles in performance. Bilinear pooling based models have been shown to outperform traditional...

chapter

HydraPlus-Net: Attentive Deep Features for Pedestrian Analysis

Xihui Liu, Haiyu Zhao, Maoqing Tian, Lu Sheng, more

2017 IEEE International Conference on Computer Vision (ICCV) > 350 - 359

2017 IEEE International Conference on Computer Vision (ICCV)

Pedestrian analysis plays a vital role in intelligent video surveillance and is a key component for security-centric computer vision systems. Despite that the convolutional neural networks are remarkable in learning discriminative features from images, the learning of comprehensive features of pedestrians for fine-grained tasks remains an open problem. In this study, we propose a new attentionbased...

chapter

DualNet: Learn Complementary Features for Image Recognition

Saihui Hou, Xu Liu, Zilei Wang

2017 IEEE International Conference on Computer Vision (ICCV) > 502 - 510

2017 IEEE International Conference on Computer Vision (ICCV)

In this work we propose a novel framework named Dual-Net aiming at learning more accurate representation for image recognition. Here two parallel neural networks are coordinated to learn complementary features and thus a wider network is constructed. Specifically, we logically divide an end-to-end deep convolutional neural network into two functional parts, i.e., feature extractor and image classifier...

chapter

Unsupervised Learning from Video to Detect Foreground Objects in Single Images

Ioana Croitoru, Simion-Vlad Bogolin, Marius Leordeanu

2017 IEEE International Conference on Computer Vision (ICCV) > 4345 - 4353

2017 IEEE International Conference on Computer Vision (ICCV)

Unsupervised learning from visual data is one of the most difficult challenges in computer vision. It is essential for understanding how visual recognition works. Learning from unsupervised input has an immense practical value, as huge quantities of unlabeled videos can be collected at low cost. Here we address the task of unsupervised learning to detect and segment foreground objects in single images...

chapter

Sparse decomposition of convolutional features for scene recognition

Lin Xie, Feifei Lee, Yan Yan, Qiu Chen

2017 2nd IEEE International Conference on Computational Intelligence and Applications (ICCIA) > 345 - 348

2017 2nd IEEE International Conference on Computational Intelligence and Applications (ICCIA)

Scene recognition is an important and challenging problem in the field of computer vision owing to the variations in the same class and the similarities between different classes. This paper presents a novel approach that learns a reasonable dictionary from convolutional features to effectively describe the distinctive and shared properties in scene images. Substantial convolution operations in Deep...

chapter

A grey relational analysis based evaluation metric for image captioning and video captioning

Miao Ma, Bolong Wang

2017 International Conference on Grey Systems and Intelligent Services (GSIS) > 76 - 81

2017 International Conference on Grey Systems and Intelligent Services (GSIS)

Aiming at the performance evaluation on image captioning and video captioning, this paper discusses the existing performance metrics and then suggests a novel overall performance metric based on grey relational analysis of Grey System Theory. In our metric, all the available performance metrics of each captioning model is used to extract a comparative sequence. Meanwhile, a reference sequence is constructed...

chapter

Readability of the gaze and expressions of a robot museum visitor: Impact of the low level sensory-motor control

Aliaa Moualla, Ali Karaouzene, Sofiane Boucenna, Denis Vidal, more

2017 26th IEEE International Symposium on Robot and Human Interactive Communication (RO-MAN) > 712 - 719

2017 26th IEEE International Symposium on Robot and Human Interactive Communication (RO-MAN)

In this paper we propose a neural network allowing a mobile robot to learn artwork appreciation. The learning is based on the social referencing approach. The robot acquires its knowledge (artificial taste) from the interaction with humans. We present and analyze specifically the visual system, its impact on the robot behavior, and at the end, we analyze the readability of our robot behavior according...

chapter

A fast algorithm based on human visual system for abnormal event detection

Fengchang Fei, Zhijun Fang, Lei Shu

2017 International Conference on Computer, Information and Telecommunication Systems (CITS) > 185 - 189

2017 International Conference on Computer, Information and Telecommunication Systems (CITS)

Fast abnormal event detection algorithm has high application value. But it is difficult to select appropriate feature representation to realize fast abnormal event detection. In view of HVS's dual pulse propagation theory and computational complexity, LBP and OF are used as temporal and spatial feature representation of video in this paper. Since human understanding involves the abstraction of the...

chapter

Architecture of vision systems with several fields of view as a part of information support of mobile systems

Sergey M. Sokolov, Andrey A. Boguslavsky

2017 19th International Conference on Transparent Optical Networks (ICTON) > 1 - 9

2017 19th International Conference on Transparent Optical Networks (ICTON)

The unified program architecture of real time vision system (VS) with several fields of view construction is described. The offered architecture provides both onboard and stationary usage. In case of stationary usage, it provides determination of location and trajectory of objects movement in the ordered system of co-ordinates. Principles of open architecture, componential technology and use of standard...

chapter

Self-Supervised Learning of Visual Features through Embedding Images into Text Topic Spaces

Lluis Gomez, Yash Patel, Marcal Rusinol, Dimosthenis Karatzas, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2017 - 2026

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

End-to-end training from scratch of current deep architectures for new computer vision problems would require Imagenet-scale datasets, and this is not always possible. In this paper we present a method that is able to take advantage of freely available multi-modal content to train computer vision algorithms without human supervision. We put forward the idea of performing self-supervised learning of...

chapter

Deep Unsupervised Similarity Learning Using Partially Ordered Sets

Miguel A. Bautista, Artsiom Sanakoyeu, Bjorn Ommer

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1923 - 1932

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Unsupervised learning of visual similarities is of paramount importance to computer vision, particularly due to lacking training data for fine-grained similarities. Deep learning of similarities is often based on relationships between pairs or triplets of samples. Many of these relations are unreliable and mutually contradicting, implying inconsistencies when trained without supervision information...

chapter

Weakly Supervised Dense Video Captioning

Zhiqiang Shen, Jianguo Li, Zhou Su, Minjun Li, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5159 - 5167

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper focuses on a novel and challenging vision task, dense video captioning, which aims to automatically describe a video clip with multiple informative and diverse caption sentences. The proposed method is trained without explicit annotation of fine-grained sentence to video region-sequence correspondence, but is only based on weak video-level sentence annotations. It differs from existing...

chapter

Scene Parsing through ADE20K Dataset

Bolei Zhou, Hang Zhao, Xavier Puig, Sanja Fidler, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5122 - 5130

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Scene parsing, or recognizing and segmenting objects and stuff in an image, is one of the key problems in computer vision. Despite the communitys efforts in data collection, there are still few image datasets covering a wide range of scenes and object categories with dense and detailed annotations for scene parsing. In this paper, we introduce and analyze the ADE20K dataset, spanning diverse annotations...

Keywords:
VISUALIZATION
COMPUTER VISION

Publication date

Set your own date range

Content availability

Available (839)
None (5)

Keywords

FEATURE EXTRACTION (276)
CAMERAS (151)
COMPUTATIONAL MODELING (140)
IMAGE COLOR ANALYSIS (125)
TRAINING (107)
OBJECT DETECTION (96)
HUMANS (85)
IMAGE SEGMENTATION (84)
OBJECT RECOGNITION (77)
HISTOGRAMS (76)
PIXEL (73)
CONFERENCES (71)
PATTERN RECOGNITION (68)
DATA MINING (63)
SHAPE (62)
ROBUSTNESS (61)
IMAGE MOTION ANALYSIS (59)
TARGET TRACKING (53)
IMAGE PROCESSING (52)
IMAGE CLASSIFICATION (51)
SEMANTICS (49)
TRACKING (49)
IMAGE EDGE DETECTION (46)
IMAGE RETRIEVAL (45)
IMAGE RECOGNITION (44)
ACCURACY (41)
ESTIMATION (40)
MACHINE VISION (40)
THREE DIMENSIONAL DISPLAYS (39)
DETECTORS (38)
VIDEO SIGNAL PROCESSING (38)
DATABASES (37)
SUPPORT VECTOR MACHINES (37)
MATHEMATICAL MODEL (35)
KERNEL (34)
IMAGE REPRESENTATION (33)
VOCABULARY (33)
IMAGE RESOLUTION (31)
LIGHTING (31)
COMPUTERS (30)
IMAGE COLOUR ANALYSIS (30)
FACE (29)
LEARNING (ARTIFICIAL INTELLIGENCE) (29)
DISTANCE MEASUREMENT (28)
VISUAL TRACKING (28)
ALGORITHM DESIGN AND ANALYSIS (27)
CLASSIFICATION ALGORITHMS (27)
IMAGE MATCHING (27)
CORRELATION (26)
IMAGE SEQUENCES (25)
VISUAL ATTENTION (25)
NAVIGATION (24)
COMPUTER ARCHITECTURE (23)
IMAGE RECONSTRUCTION (23)
MACHINE LEARNING (23)
OPTICAL IMAGING (23)
ROBOTS (23)
THREE-DIMENSIONAL DISPLAYS (23)
VIDEOS (23)
VISUAL PERCEPTION (23)
VEHICLES (22)
NEURAL NETWORKS (21)
VECTORS (21)
COMPUTER GRAPHICS (20)
INSPECTION (20)
STEREO IMAGE PROCESSING (20)
SURVEILLANCE (20)
TRAJECTORY (20)
CLUSTERING ALGORITHMS (19)
EQUATIONS (19)
OBJECT TRACKING (19)
STEREO VISION (19)
BIOLOGICAL SYSTEM MODELING (18)
DATA VISUALISATION (18)
HIDDEN MARKOV MODELS (18)
REAL-TIME SYSTEMS (18)
SENSORS (18)
SOLID MODELING (18)
TRANSFORMS (18)
VIDEO SURVEILLANCE (18)
BRAIN MODELING (17)
CONTEXT (17)
DICTIONARIES (17)
MOBILE ROBOTS (17)
OPTIMIZATION (17)
PARTICLE FILTER (17)
ROBOT VISION (17)
STREAMING MEDIA (17)
COLOR (16)
ELECTRONIC MAIL (16)
ENCODING (16)
ENTROPY (16)
HUMAN VISUAL SYSTEM (16)
NEURONS (16)
OBSERVERS (16)
PARTICLE FILTERING (NUMERICAL METHODS) (16)
REAL TIME SYSTEMS (16)
SALIENCY MAP (16)
more

INFONA - science communication portal

Search results

Parallel Tracking and Verifying: A Framework for Real-Time and High Accuracy Visual Tracking

Non-linear Convolution Filters for CNN-Based Learning

Leveraging Weak Semantic Relevance for Complex Video Event Classification

What will Happen Next? Forecasting Player Moves in Sports Videos

Privacy-Preserving Visual Learning Using Doubly Permuted Homomorphic Encryption

Toward Perceptually-Consistent Stereo: A Scanline Study

Visual Transformation Aided Contrastive Learning for Video-Based Kinship Verification

Multi-modal Factorized Bilinear Pooling with Co-attention Learning for Visual Question Answering

HydraPlus-Net: Attentive Deep Features for Pedestrian Analysis

DualNet: Learn Complementary Features for Image Recognition

Unsupervised Learning from Video to Detect Foreground Objects in Single Images

Sparse decomposition of convolutional features for scene recognition

A grey relational analysis based evaluation metric for image captioning and video captioning

Readability of the gaze and expressions of a robot museum visitor: Impact of the low level sensory-motor control

A fast algorithm based on human visual system for abnormal event detection

Architecture of vision systems with several fields of view as a part of information support of mobile systems

Self-Supervised Learning of Visual Features through Embedding Images into Text Topic Spaces

Deep Unsupervised Similarity Learning Using Partially Ordered Sets

Weakly Supervised Dense Video Captioning

Scene Parsing through ADE20K Dataset

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options