Search results

Items from 1 to 20 out of 114 results

chapter

Increasing CNN Robustness to Occlusions by Reducing Filter Support

Elad Osherov, Michael Lindenbaum

2017 IEEE International Conference on Computer Vision (ICCV) > 550 - 561

2017 IEEE International Conference on Computer Vision (ICCV)

Convolutional neural networks (CNNs) provide the current state of the art in visual object classification, but they are far less accurate when classifying partially occluded objects. A straightforward way to improve classification under occlusion conditions is to train the classifier using partially occluded object examples. However, training the network on many combinations of object instances and...

chapter

Understanding convolutional neural networks using a minimal model for handwritten digit recognition

Matthew Y. W. Teow

2017 IEEE 2nd International Conference on Automatic Control and Intelligent Systems (I2CACIS) > 167 - 172

2017 IEEE 2nd International Conference on Automatic Control and Intelligent Systems (I2CACIS)

The contribution of this paper is to bridge the gap on understanding the mathematical structure and the computational implementation of a convolutional neural network (CNN) using a minimal model (Minimal CNN). The proposed minimal CNN is presented using a layering approach. This approach provides a concise and accessible understanding of the main mathematical operations of a CNN. Hence, it benefits...

chapter

Generalized Orderless Pooling Performs Implicit Salient Matching

Marcel Simon, Yang Gao, Trevor Darrell, Joachim Denzler, more

2017 IEEE International Conference on Computer Vision (ICCV) > 4970 - 4979

2017 IEEE International Conference on Computer Vision (ICCV)

Most recent CNN architectures use average pooling as a final feature encoding step. In the field of fine-grained recognition, however, recent global representations like bilinear pooling offer improved performance. In this paper, we generalize average and bilinear pooling to “α-pooling”, allowing for learning the pooling strategy during training. In addition, we present a novel way to visualize decisions...

chapter

Ordered minimum distance bag-of-words approach for aerial object identification

Eren Unlu, Emmanuel Zenou, Nicolas Riviere

2017 14th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS) > 1 - 6

2017 14th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS)

Detecting potential aerial threats like drones with computer vision is at the paramount of interest for the protection of critical locations. This type of a system should prevent efficiently the false alarms caused by non-malign objects such as birds, which intrude the image plane. In this paper, we propose an improved version of a previously presented Speeded-up Robust Feature Transform (SURF) based...

chapter

Robust visual tracking based on kernelized correlation filters

Min Jiang, Jianyu Shen, Jun Kong, Benxuan Wang

2017 IEEE International Conference on Information and Automation (ICIA) > 110 - 115

2017 IEEE International Conference on Information and Automation (ICIA)

Recently, kernelized correlation Filter-based trackers have aroused the interest of many researchers and achieved good results in the field of tracking. However, the current tracking model based on kernelized correlation filters can not deal with the changes of the target appearance and scale effectively. Therefore, in this paper, we intend to solve these two problems and improve the robustness of...

chapter

Kernel Pooling for Convolutional Neural Networks

Yin Cui, Feng Zhou, Jiang Wang, Xiao Liu, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3049 - 3058

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Convolutional Neural Networks (CNNs) with Bilinear Pooling, initially in their full form and later using compact representations, have yielded impressive performance gains on a wide range of visual tasks, including fine-grained visual categorization, visual question answering, face recognition, and description of texture and style. The key to their success lies in the spatially invariant modeling...

chapter

Learning Spatial Regularization with Image-Level Supervisions for Multi-label Image Classification

Feng Zhu, Hongsheng Li, Wanli Ouyang, Nenghai Yu, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2027 - 2036

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Multi-label image classification is a fundamental but challenging task in computer vision. Great progress has been achieved by exploiting semantic relations between labels in recent years. However, conventional approaches are unable to model the underlying spatial relations between labels in multi-label images, because spatial annotations of the labels are generally not provided. In this paper, we...

chapter

Borrowing Treasures from the Wealthy: Deep Transfer Learning through Selective Joint Fine-Tuning

Weifeng Ge, Yizhou Yu

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 10 - 19

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Deep neural networks require a large amount of labeled training data during supervised learning. However, collecting and labeling so much data might be infeasible in many cases. In this paper, we introduce a deep transfer learning scheme, called selective joint fine-tuning, for improving the performance of deep learning tasks with insufficient training data. In this scheme, a target learning task...

chapter

Recognition of Affect in the Wild Using Deep Neural Networks

Dimitrios Kollias, Mihalis A. Nicolaou, Irene Kotsia, Guoying Zhao, more

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 1972 - 1979

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

In this paper we utilize the first large-scale "in-the-wild" (Aff-Wild) database, which is annotated in terms of the valence-arousal dimensions, to train and test an end-to-end deep neural architecture for the estimation of continuous emotion dimensions based on visual cues. The proposed architecture is based on jointly training convolutional (CNN) and recurrent neural network (RNN) layers,...

chapter

Kernalised Multi-resolution Convnet for Visual Tracking

Di Wu, Wenbin Zou, Xia Li, Yong Zhao

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 2241 - 2248

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

Visual tracking is intrinsically a temporal problem. Discriminative Correlation Filters (DCF) have demonstrated excellent performance for high-speed generic visual object tracking. Built upon their seminal work, there has been a plethora of recent improvements relying on convolutional neural network (CNN) pretrained on ImageNet as a feature extractor for visual tracking. However, most of their works...

chapter

Multi-modal Score Fusion and Decision Trees for Explainable Automatic Job Candidate Screening from Video CVs

Heysem Kaya, Furkan Gurpinar, Albert Ali Salah

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 1651 - 1659

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

We describe an end-to-end system for explainable automatic job candidate screening from video CVs. In this application, audio, face and scene features are first computed from an input video CV, using rich feature sets. These multiple modalities are fed into modality-specific regressors to predict apparent personality traits and a variable that predicts whether the subject will be invited to the interview...

chapter

The human detection in images using the depth map

Dmitriy Tatarenkov, Dmitry Podolsky

2017 Systems of Signal Synchronization, Generating and Processing in Telecommunications (SINKHROINFO) > 1 - 4

2017 Systems of Signal Synchronization, Generating and Processing in Telecommunications (SINKHROINFO)

In today world the necessity for the autonomous mobile robots and vehicles is increasing. The safety autonomous moving demands the reliable and fast detection algorithms. The Histogram of Oriented Gradients (HOG) descriptors show significantly outperforms the existing feature sets for a human detection. Though the given method has a lot of type I errors. The amount of these errors can be decreased...

chapter

Automatic Privacy Prediction to Accelerate Social Image Sharing

Zhenzhong Kuang, Zongmin Li, Dan Lin, Jianping Fan

2017 IEEE Third International Conference on Multimedia Big Data (BigMM) > 197 - 200

2017 IEEE Third International Conference on Multimedia Big Data (BigMM)

The manual process for privacy setting could be very time-consuming and challenging for common users. By assuming that there are hidden correlations between the visual properties of images (i.e., visual features) or object classes and the privacy settings for image sharing, an effective algorithm is developed in this paper to achieve automatic prediction of image privacy, so that the best-matching...

chapter

One-class slab support vector machine

Victor Fragoso, Walter Scheirer, Joao Hespanha, Matthew Turk

2016 23rd International Conference on Pattern Recognition (ICPR) > 420 - 425

2016 23rd International Conference on Pattern Recognition (ICPR)

This work introduces the one-class slab SVM (OCSSVM), a one-class classifier that aims at improving the performance of the one-class SVM. The proposed strategy reduces the false positive rate and increases the accuracy of detecting instances from novel classes. To this end, it uses two parallel hyperplanes to learn the normal region of the decision scores of the target class. OCSSVM extends one-class...

chapter

Multimodal fusion of audio, scene, and face features for first impression estimation

Furkan Gurpinar, Heysem Kaya, Albert Ali Salah

2016 23rd International Conference on Pattern Recognition (ICPR) > 43 - 48

2016 23rd International Conference on Pattern Recognition (ICPR)

Affective computing, particularly emotion and personality trait recognition, is of increasing interest in many research disciplines. The interplay of emotion and personality shows itself in the first impression left on other people. Moreover, the ambient information, e.g. the environment and objects surrounding the subject, also affect these impressions. In this work, we employ pre-trained Deep Convolutional...

chapter

Multi-scale kernelized least squares for visual tracking

Junbin Liu, Weixin Xie, Liangqun Li

2016 IEEE 13th International Conference on Signal Processing (ICSP) > 914 - 918

2016 IEEE 13th International Conference on Signal Processing (ICSP)

In order to cope with the complex variation of target appearance during visual tracking, a robust tracking algorithm based on multi-scale kernelized least squares (KLS) is proposed. First, by showing that the dense sampling set of translated patches is circulant, using the well-established theory of circulant matrices, kernelized least squares is efficient computed with fast Fourier transform (FFT)...

chapter

Scale-adaptive visual tracking with occlusion detection

Yulong Xu, Jiabao Wang, Yang Li, Zhuang Miao, more

2016 IEEE 13th International Conference on Signal Processing (ICSP) > 938 - 942

2016 IEEE 13th International Conference on Signal Processing (ICSP)

Occlusion is a challenging problem in visual object tracking. Most state-of-the-art trackers may learn the appearance of the occluding target when it becomes occluded by other objects in the scene. This paper proposes a novel approach of detecting occlusion by dividing the target into several patches and computing the peak-to-sidelobe ratio of every response map. Furthermore, our method can calculate...

chapter

A Two-Stage Outdoor - Indoor Scene Classification Framework: Experimental Study for the Outdoor Stage

Mana Shahriari, Robert Bergevin

2016 International Conference on Digital Image Computing: Techniques and Applications (DICTA) > 1 - 8

2016 International Conference on Digital Image Computing: Techniques and Applications (DICTA)

The state-of-the-art image classification methods require an intensive learning stage and a considerable amount of training images. Recently, with the introduction of these models (and in particular convolutional neural network (CNN)), it is believed that the best solution to achieve a system with high performance on scene classification is to learn deep scene features using CNN. While this can be...

chapter

Complex convolution Kernel for deep networks

Kaizhou Li, Hong Shi, Qinghua Hu

2016 8th International Conference on Wireless Communications & Signal Processing (WCSP) > 1 - 5

2016 8th International Conference on Wireless Communications & Signal Processing (WCSP)

Deep Convolutional Neural Network (CNN) is one of the most popular methods for image processing and recognition. There are many research works to improve the performance of CNNs. However, as an important part of CNNs, convolution kernel has rarely been discussed. As one Original Convolution Kernel (OCK) can only detect one type of visual feature with a fixed deformation, the networks using OCKs may...

chapter

Image-based localization using Gaussian Processes

Manuel Lopez-Antequera, Nicolai Petkov, Javier Gonzalez-Jimenez

2016 International Conference on Indoor Positioning and Indoor Navigation (IPIN) > 1 - 7

2016 International Conference on Indoor Positioning and Indoor Navigation (IPIN)

Visual localization is the process of finding the location of a camera from the appearance of the images it captures. In this work, we propose an observation model that allows the use of images for particle filter localization. To achieve this, we exploit the capabilities of Gaussian Processes to calculate the likelihood of the observation for any given pose, in contrast to methods which restrict...

Data set:
ieee
Keywords:
KERNEL
VISUALIZATION
TRAINING
Publication type:
book

Publication date

Set your own date range

Content availability

Available (113)
None (1)

Keywords

SUPPORT VECTOR MACHINES (53)
FEATURE EXTRACTION (43)
IMAGE CLASSIFICATION (25)
HISTOGRAMS (18)
COMPUTER VISION (14)
LEARNING (ARTIFICIAL INTELLIGENCE) (13)
VECTORS (13)
IMAGE REPRESENTATION (12)
OBJECT RECOGNITION (11)
CORRELATION (10)
OBJECT DETECTION (10)
SEMANTICS (10)
ACCURACY (8)
DATA MINING (8)
OPTIMIZATION (8)
COMPUTATIONAL MODELING (7)
DETECTORS (7)
FACE (7)
IMAGE COLOR ANALYSIS (7)
TARGET TRACKING (7)
IMAGE RETRIEVAL (6)
MEASUREMENT (6)
SHAPE (6)
VOCABULARY (6)
APPROXIMATION METHODS (5)
CONTEXT (5)
DATABASES (5)
IMAGE RECOGNITION (5)
IMAGE SEGMENTATION (5)
INTERNET (5)
MACHINE LEARNING (5)
NEURAL NETWORKS (5)
ROBOTS (5)
ROBUSTNESS (5)
STANDARDS (5)
SUPPORT VECTOR MACHINE (5)
ENCODING (4)
ESTIMATION (4)
PATTERN CLUSTERING (4)
TESTING (4)
VIDEO SIGNAL PROCESSING (4)
VISUAL TRACKING (4)
CAMERAS (3)
CLASSIFICATION ALGORITHMS (3)
CONFERENCES (3)
CONTENT-BASED RETRIEVAL (3)
CONVOLUTION (3)
DECISION TREES (3)
DEEP LEARNING (3)
DICTIONARIES (3)
DISTANCE MEASUREMENT (3)
DOMAIN ADAPTATION (3)
FACE RECOGNITION (3)
IMAGE CATEGORIZATION (3)
IMAGE CODING (3)
IMAGE RECONSTRUCTION (3)
INDEXING (3)
JOINTS (3)
LAYOUT (3)
MANIFOLDS (3)
OBJECT CATEGORIZATION (3)
PATTERN CLASSIFICATION (3)
PIXEL (3)
REGRESSION ANALYSIS (3)
SEARCH ENGINES (3)
SUPERVISED LEARNING (3)
SUPPORT VECTOR MACHINE CLASSIFICATION (3)
TAGGING (3)
TENSILE STRESS (3)
TRAINING DATA (3)
ADAPTATION MODELS (2)
BAG-OF-WORDS MODEL (2)
BENCHMARK TESTING (2)
BIRDS (2)
BOOSTING (2)
BUILDINGS (2)
CATEGORY THEORY (2)
CBIR (2)
CLASSIFICATION (2)
CLUSTERING ALGORITHMS (2)
COLLABORATION (2)
COMPUTATIONAL COMPLEXITY (2)
COMPUTERS (2)
CONVOLUTIONAL NEURAL NETWORK (2)
CORRELATION FILTER (2)
DATA MODELS (2)
ELECTROENCEPHALOGRAPHY (2)
EQUATIONS (2)
FEATURE DESCRIPTORS (2)
GRASPING (2)
HIDDEN MARKOV MODELS (2)
IMAGE EDGE DETECTION (2)
IMAGE MATCHING (2)
IMAGE PROCESSING (2)
INCREMENTAL LEARNING (2)
KERNEL DISCRIMINANT ANALYSIS (2)
LABELED IMAGES (2)
more

INFONA - science communication portal

Search results

Increasing CNN Robustness to Occlusions by Reducing Filter Support

Understanding convolutional neural networks using a minimal model for handwritten digit recognition

Generalized Orderless Pooling Performs Implicit Salient Matching

Ordered minimum distance bag-of-words approach for aerial object identification

Robust visual tracking based on kernelized correlation filters

Kernel Pooling for Convolutional Neural Networks

Learning Spatial Regularization with Image-Level Supervisions for Multi-label Image Classification

Borrowing Treasures from the Wealthy: Deep Transfer Learning through Selective Joint Fine-Tuning

Recognition of Affect in the Wild Using Deep Neural Networks

Kernalised Multi-resolution Convnet for Visual Tracking

Multi-modal Score Fusion and Decision Trees for Explainable Automatic Job Candidate Screening from Video CVs

The human detection in images using the depth map

Automatic Privacy Prediction to Accelerate Social Image Sharing

One-class slab support vector machine

Multimodal fusion of audio, scene, and face features for first impression estimation

Multi-scale kernelized least squares for visual tracking

Scale-adaptive visual tracking with occlusion detection

A Two-Stage Outdoor - Indoor Scene Classification Framework: Experimental Study for the Outdoor Stage

Complex convolution Kernel for deep networks

Image-based localization using Gaussian Processes

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options