Search results

Items from 1 to 20 out of 1,134 results

chapter

Survey of Visual Feature Extraction Algorithms in a Mars-like Environment

Martin Oelsch, Dominik Van Opdenbosch, Eckehard Steinbach

2017 IEEE International Symposium on Multimedia (ISM) > 322 - 325

2017 IEEE International Symposium on Multimedia (ISM)

This paper presents a performance comparison of several state-of-the-art visual feature extraction algorithms when applied in a poorly-structured environment as found on the planet Mars. So far, no systematic evaluation of feature extraction algorithms in extraterrestrial environments is available. The algorithms in this paper are evaluated using the Devon Island dataset which is said to have one...

chapter

Region of Interest Autoencoders with an Application to Pedestrian Detection

Jerome Williams, Gustavo Carneiro, David Suter

2017 International Conference on Digital Image Computing: Techniques and Applications (DICTA) > 1 - 8

2017 International Conference on Digital Image Computing: Techniques and Applications (DICTA)

We present the Region of Interest Autoencoder (ROIAE), a combined supervised and reconstruction model for the automatic visual detection of objects. More specifically, we augment the detection loss function with a reconstruction loss that targets only foreground examples. This allows us to exploit more effectively the information available in the sparsely populated foreground training data used in...

chapter

Synergy between Face Alignment and Tracking via Discriminative Global Consensus Optimization

Muhammad Haris Khan, John McDonagh, Georgios Tzimiropoulos

2017 IEEE International Conference on Computer Vision (ICCV) > 3811 - 3819

2017 IEEE International Conference on Computer Vision (ICCV)

An open question in facial landmark localization in video is whether one should perform tracking or tracking-by-detection (i.e. face alignment). Tracking produces fittings of high accuracy but is prone to drifting. Tracking-by-detection is drift-free but results in low accuracy fittings. To provide a solution to this problem, we describe the very first, to the best of our knowledge, synergistic approach...

chapter

Weakly Supervised Object Localization Using Things and Stuff Transfer

Miaojing Shi, Holger Caesar, Vittorio Ferrari

2017 IEEE International Conference on Computer Vision (ICCV) > 3401 - 3410

2017 IEEE International Conference on Computer Vision (ICCV)

We propose to help weakly supervised object localization for classes where location annotations are not available, by transferring things and stuff knowledge from a source set with available annotations. The source and target classes might share similar appearance (e.g. bear fur is similar to cat fur) or appear against similar background (e.g. horse and sheep appear against grass). To exploit this,...

chapter

Weakly-Supervised Learning of Visual Relations

Julia Peyre, Ivan Laptev, Cordelia Schmid, Josef Sivic

2017 IEEE International Conference on Computer Vision (ICCV) > 5189 - 5198

2017 IEEE International Conference on Computer Vision (ICCV)

This paper introduces a novel approach for modeling visual relations between pairs of objects. We call relation a triplet of the form (subject; predicate; object) where the predicate is typically a preposition (eg. ’under’, ’in front of’) or a verb (’hold’, ’ride’) that links a pair of objects (subject; object). Learning such relations is challenging as the objects have different spatial configurations...

chapter

RMPE: Regional Multi-person Pose Estimation

Hao-Shu Fang, Shuqin Xie, Yu-Wing Tai, Cewu Lu

2017 IEEE International Conference on Computer Vision (ICCV) > 2353 - 2362

2017 IEEE International Conference on Computer Vision (ICCV)

Multi-person pose estimation in the wild is challenging. Although state-of-the-art human detectors have demonstrated good performance, small errors in localization and recognition are inevitable. These errors can cause failures for a single-person pose estimator (SPPE), especially for methods that solely depend on human detection results. In this paper, we propose a novel regional multi-person pose...

chapter

DSOD: Learning Deeply Supervised Object Detectors from Scratch

Zhiqiang Shen, Zhuang Liu, Jianguo Li, Yu-Gang Jiang, more

2017 IEEE International Conference on Computer Vision (ICCV) > 1937 - 1945

2017 IEEE International Conference on Computer Vision (ICCV)

We present Deeply Supervised Object Detector (DSOD), a framework that can learn object detectors from scratch. State-of-the-art object objectors rely heavily on the off the-shelf networks pre-trained on large-scale classification datasets like Image Net, which incurs learning bias due to the difference on both the loss functions and the category distributions between classification and detection tasks...

chapter

Fast and accurate vehicle detection by aspect ratio regression

Fangying Luo, Yun Zhao, Zejian Yuan

2017 Chinese Automation Congress (CAC) > 1169 - 1174

2017 Chinese Automation Congress (CAC)

Traditional vehicle detectors always utilize singletemplate model to represent the vehicle which can not encircle vehicles with different aspect ratios. In this paper, we propose a fast and accurate approach for detecting vehicles which joints classification and aspect ratio regression. The key idea is extending the boosting decision trees method to estimate vehicle's aspect ratio during vehicle detection,...

chapter

Areas of Attention for Image Captioning

Marco Pedersoli, Thomas Lucas, Cordelia Schmid, Jakob Verbeek

2017 IEEE International Conference on Computer Vision (ICCV) > 1251 - 1259

2017 IEEE International Conference on Computer Vision (ICCV)

We propose “Areas of Attention”, a novel attentionbased model for automatic image captioning. Our approach models the dependencies between image regions, caption words, and the state of an RNN language model, using three pairwise interactions. In contrast to previous attentionbased approaches that associate image regions only to the RNN state, our method allows a direct association between caption...

chapter

Shadow Detection with Conditional Generative Adversarial Networks

Vu Nguyen, Tomas F. Yago Vicente, Maozheng Zhao, Minh Hoai, more

2017 IEEE International Conference on Computer Vision (ICCV) > 4520 - 4528

2017 IEEE International Conference on Computer Vision (ICCV)

We introduce scGAN, a novel extension of conditional Generative Adversarial Networks (GAN) tailored for the challenging problem of shadow detection in images. Previous methods for shadow detection focus on learning the local appearance of shadow regions, while using limited local context reasoning in the form of pairwise potentials in a Conditional Random Field. In contrast, the proposed adversarial...

chapter

Spatial-Aware Object Embeddings for Zero-Shot Localization and Classification of Actions

Pascal Mettes, Cees G. M. Snoek

2017 IEEE International Conference on Computer Vision (ICCV) > 4453 - 4462

2017 IEEE International Conference on Computer Vision (ICCV)

We aim for zero-shot localization and classification of human actions in video. Where traditional approaches rely on global attribute or object classification scores for their zero-shot knowledge transfer, our main contribution is a spatial-aware object embedding. To arrive at spatial awareness, we build our embedding on top of freely available actor and object detectors. Relevance of objects is determined...

chapter

Multi-label Learning of Part Detectors for Heavily Occluded Pedestrian Detection

Chunluan Zhou, Junsong Yuan

2017 IEEE International Conference on Computer Vision (ICCV) > 3506 - 3515

2017 IEEE International Conference on Computer Vision (ICCV)

Detecting pedestrians that are partially occluded remains a challenging problem due to variations and uncertainties of partial occlusion patterns. Following a commonly used framework of handling partial occlusions by part detection, we propose a multi-label learning approach to jointly learn part detectors to capture partial occlusion patterns. The part detectors share a set of decision trees via...

chapter

WeText: Scene Text Detection under Weak Supervision

Shangxuan Tian, Shijian Lu, Chongshou Li

2017 IEEE International Conference on Computer Vision (ICCV) > 1501 - 1509

2017 IEEE International Conference on Computer Vision (ICCV)

The requiring of large amounts of annotated training data has become a common constraint on various deep learning systems. In this paper, we propose a weakly supervised scene text detection method (WeText) that trains robust and accurate scene text detection models by learning from unannotated or weakly annotated data. With a "light" supervised model trained on a small fully annotated dataset,...

chapter

SSD-6D: Making RGB-Based 3D Detection and 6D Pose Estimation Great Again

Wadim Kehl, Fabian Manhardt, Federico Tombari, Slobodan Ilic, more

2017 IEEE International Conference on Computer Vision (ICCV) > 1530 - 1538

2017 IEEE International Conference on Computer Vision (ICCV)

We present a novel method for detecting 3D model instances and estimating their 6D poses from RGB data in a single shot. To this end, we extend the popular SSD paradigm to cover the full 6D pose space and train on synthetic model data only. Our approach competes or surpasses current state-of-the-art methods that leverage RGBD data on multiple challenging datasets. Furthermore, our method produces...

chapter

Fast and reliable two-wheeler detection algorithm for blind spot detection systems

Jang Woon Baek, Byung-Gil Han, Hyunwoo Kang, Yoonsu Chung

2017 International Conference on Information and Communication Technology Convergence (ICTC) > 513 - 516

2017 International Conference on Information and Communication Technology Convergence (ICTC)

In this paper, we propose a real-time detection algorithm using a MCT AdaBoost classifier which detects two-wheeler in a blind spot. The proposed algorithm uses a cascade classifier generated by AdaBoost learning based on the MCT feature vector. The MCT AdaBoost classifier is composed of weak classifiers as many as the number of pixels of the detection window, and each pixel becomes a weak classifier...

chapter

Generation and evaluation of synthetic models for training people detectors

Rafael Martin-Nieto, Jesus Molina Merchan, Alvaro Garcia-Martin, Jose M. Martinez

2017 International Carnahan Conference on Security Technology (ICCST) > 1 - 6

2017 International Carnahan Conference on Security Technology (ICCST)

There is a large demand in the area of video-surveillance, especially in people detection, which has caused a large increase in the number of researches and resources in this field. As training images and annotations are not always available, it is important to consider the cost involved in creating the detector models. For example, for elderly people detection, the detector must have into account...

chapter

Dictionary pair learning in compressed space for action recognition

Zhijun Pei, Yaxin Wang, John Mkhomoi Afridon

2017 IEEE 3rd Information Technology and Mechatronics Engineering Conference (ITOEC) > 313 - 317

2017 IEEE 3rd Information Technology and Mechatronics Engineering Conference (ITOEC)

Action recognition is still a challenging problem. In order to catch effective compact representation of the action sequences, the discriminative dictionaries could be learned by sparse coding. But sparse coding is needed in both the training and testing phases of the classifier framework. And it is also time consuming for the adoption of 1-norm sparsity constraint on the representation coefficients...

chapter

Pedestrian detection based on YOLOv2 with skip structure in underground coal mine

Lin Wang, Weishan Li, Yuliang Zhang, Chen Wei

2017 IEEE 3rd Information Technology and Mechatronics Engineering Conference (ITOEC) > 1216 - 1220

2017 IEEE 3rd Information Technology and Mechatronics Engineering Conference (ITOEC)

Pedestrian detection is an important topic in object detection. Compared with other object detectors, YOLOv2 achieves high accuracy and fast speed for general object detection, however it degrades accuracy when detecting crowed pedestrians. In this paper, combining with the skip structure of FCN, we tailor the YOLOv2 network to improve the accuracy in detecting small pedestrians which appear in groups...

chapter

Automatic license plate recognition with convolutional neural networks trained on synthetic data

Tomas Bjorklund, Attilio Fiandrotti, Mauro Annarumma, Gianluca Francini, more

2017 IEEE 19th International Workshop on Multimedia Signal Processing (MMSP) > 1 - 6

2017 IEEE 19th International Workshop on Multimedia Signal Processing (MMSP)

We present an Automatic License Plate Recognition system designed around Convolutional Neural Networks (CNNs) and trained over synthetic plate images. We first design CNNs suitable for plate and character detection, sharing a common architecture and training procedure. Then, we generate synthetic images that account for the varying illumination and pose conditions encountered with real plate images...

chapter

A multi-scale fusion convolutional neural network for face detection

Qiaosong Chen, Xiaomin Meng, Wen Li, Xingyu Fu, more

2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 1013 - 1018

2017 IEEE International Conference on Systems, Man and Cybernetics (SMC)

Nowadays, more and more methods have been proposed to solve the problem of face detection based on computer implementation. Due to the variations in background, illumination, pose and facial expressions, the problem of machine face detection is complex. Recently, deep learning approaches achieve an impressive performance on face detection. In this paper, a model named Multi-Scale Fusion Convolutional...

Data set:
ieee
Keywords:
DETECTORS
TRAINING
Publication type:
book

Publication date

Set your own date range

Content availability

Available (1,126)
None (8)

Keywords

FEATURE EXTRACTION (500)
OBJECT DETECTION (266)
SUPPORT VECTOR MACHINES (169)
FACE (141)
SHAPE (100)
FACE DETECTION (96)
LEARNING (ARTIFICIAL INTELLIGENCE) (94)
ROBUSTNESS (94)
CAMERAS (93)
VISUALIZATION (91)
CLASSIFICATION ALGORITHMS (90)
ACCURACY (89)
HISTOGRAMS (88)
IMAGE CLASSIFICATION (86)
DATABASES (80)
IMAGE COLOR ANALYSIS (80)
COMPUTER VISION (79)
TESTING (79)
IMAGE EDGE DETECTION (78)
FACE RECOGNITION (76)
COMPUTATIONAL MODELING (75)
PIXEL (71)
BOOSTING (70)
DATA MINING (68)
TRAINING DATA (66)
ESTIMATION (63)
MACHINE LEARNING (63)
ARTIFICIAL NEURAL NETWORKS (61)
IMAGE SEGMENTATION (61)
VECTORS (59)
VEHICLES (56)
HUMANS (55)
KERNEL (55)
NEURAL NETWORKS (55)
PROPOSALS (55)
HIDDEN MARKOV MODELS (47)
ALGORITHM DESIGN AND ANALYSIS (45)
OBJECT RECOGNITION (42)
IMAGE RECOGNITION (41)
TRACKING (40)
VIDEOS (39)
NOISE (35)
SEMANTICS (35)
LIGHTING (34)
PATTERN RECOGNITION (34)
PEDESTRIAN DETECTION (34)
SPEECH (34)
ADABOOST (33)
ANOMALY DETECTION (32)
CONFERENCES (32)
SIGNAL TO NOISE RATIO (31)
TRANSFORMS (31)
DATA MODELS (30)
OPTIMIZATION (30)
PRINCIPAL COMPONENT ANALYSIS (29)
SIGNAL PROCESSING (29)
CONTEXT (28)
CORRELATION (28)
IMAGE RESOLUTION (28)
MATHEMATICAL MODEL (28)
SURVEILLANCE (28)
EQUATIONS (27)
INTRUSION DETECTION (27)
COMPLEXITY THEORY (26)
SECURITY OF DATA (26)
VIDEO SURVEILLANCE (26)
DETECTION ALGORITHMS (25)
IMAGE SEQUENCES (25)
IMMUNE SYSTEM (25)
SIGNAL PROCESSING ALGORITHMS (25)
TRAFFIC ENGINEERING COMPUTING (25)
VIDEO SEQUENCES (25)
NEURONS (24)
PATTERN CLASSIFICATION (24)
STANDARDS (24)
SUPPORT VECTOR MACHINE CLASSIFICATION (24)
THREE-DIMENSIONAL DISPLAYS (24)
ADAPTATION MODELS (23)
POSE ESTIMATION (23)
SPEECH RECOGNITION (23)
SUPPORT VECTOR MACHINE (23)
TARGET TRACKING (23)
THREE DIMENSIONAL DISPLAYS (23)
MONITORING (22)
NEURAL NETS (22)
VIDEO SIGNAL PROCESSING (22)
IMAGE PROCESSING (21)
RELIABILITY (21)
ROADS (21)
SOLID MODELING (21)
ARTIFICIAL IMMUNE SYSTEM (20)
ARTIFICIAL IMMUNE SYSTEMS (20)
DICTIONARIES (20)
LABELING (20)
SVM (20)
HAAR TRANSFORMS (19)
IMAGE MOTION ANALYSIS (19)
BENCHMARK TESTING (18)
more

INFONA - science communication portal

Search results

Survey of Visual Feature Extraction Algorithms in a Mars-like Environment

Region of Interest Autoencoders with an Application to Pedestrian Detection

Synergy between Face Alignment and Tracking via Discriminative Global Consensus Optimization

Weakly Supervised Object Localization Using Things and Stuff Transfer

Weakly-Supervised Learning of Visual Relations

RMPE: Regional Multi-person Pose Estimation

DSOD: Learning Deeply Supervised Object Detectors from Scratch

Fast and accurate vehicle detection by aspect ratio regression

Areas of Attention for Image Captioning

Shadow Detection with Conditional Generative Adversarial Networks

Spatial-Aware Object Embeddings for Zero-Shot Localization and Classification of Actions

Multi-label Learning of Part Detectors for Heavily Occluded Pedestrian Detection

WeText: Scene Text Detection under Weak Supervision

SSD-6D: Making RGB-Based 3D Detection and 6D Pose Estimation Great Again

Fast and reliable two-wheeler detection algorithm for blind spot detection systems

Generation and evaluation of synthetic models for training people detectors

Dictionary pair learning in compressed space for action recognition

Pedestrian detection based on YOLOv2 with skip structure in underground coal mine

Automatic license plate recognition with convolutional neural networks trained on synthetic data

A multi-scale fusion convolutional neural network for face detection

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options