Search results

Items from 21 to 40 out of 805 results

chapter

Understanding convolutional neural networks using a minimal model for handwritten digit recognition

Matthew Y. W. Teow

2017 IEEE 2nd International Conference on Automatic Control and Intelligent Systems (I2CACIS) > 167 - 172

2017 IEEE 2nd International Conference on Automatic Control and Intelligent Systems (I2CACIS)

The contribution of this paper is to bridge the gap on understanding the mathematical structure and the computational implementation of a convolutional neural network (CNN) using a minimal model (Minimal CNN). The proposed minimal CNN is presented using a layering approach. This approach provides a concise and accessible understanding of the main mathematical operations of a CNN. Hence, it benefits...

chapter

Scale-Adaptive Convolutions for Scene Parsing

Rui Zhang, Sheng Tang, Yongdong Zhang, Jintao Li, more

2017 IEEE International Conference on Computer Vision (ICCV) > 2050 - 2058

2017 IEEE International Conference on Computer Vision (ICCV)

Many existing scene parsing methods adopt Convolutional Neural Networks with fixed-size receptive fields, which frequently result in inconsistent predictions of large objects and invisibility of small objects. To tackle this issue, we propose a scale-adaptive convolution to acquire flexiblesize receptive fields during scene parsing. Through adding a new scale regression layer, we can dynamically infer...

chapter

Deep Metric Learning with Angular Loss

Jian Wang, Feng Zhou, Shilei Wen, Xiao Liu, more

2017 IEEE International Conference on Computer Vision (ICCV) > 2612 - 2620

2017 IEEE International Conference on Computer Vision (ICCV)

The modern image search system requires semantic understanding of image, and a key yet under-addressed problem is to learn a good metric for measuring the similarity between images. While deep metric learning has yielded impressive performance gains by extracting high level abstractions from image data, a proper objective loss function becomes the central issue to boost the performance. In this paper,...

chapter

Unsupervised Action Discovery and Localization in Videos

Khurram Soomro, Mubarak Shah

2017 IEEE International Conference on Computer Vision (ICCV) > 696 - 705

2017 IEEE International Conference on Computer Vision (ICCV)

This paper is the first to address the problem of unsupervised action localization in videos. Given unlabeled data without bounding box annotations, we propose a novel approach that: 1) Discovers action class labels and 2) Spatio-temporally localizes actions in videos. It begins by computing local video features to apply spectral clustering on a set of unlabeled training videos. For each cluster of...

chapter

Open Set Domain Adaptation

Pau Panareda Busto, Juergen Gall

2017 IEEE International Conference on Computer Vision (ICCV) > 754 - 763

2017 IEEE International Conference on Computer Vision (ICCV)

When the training and the test data belong to different domains, the accuracy of an object classifier is significantly reduced. Therefore, several algorithms have been proposed in the last years to diminish the so called domain shift between datasets. However, all available evaluation protocols for domain adaptation describe a closed set recognition task, where both domains, namely source and target,...

chapter

Deep Adaptive Image Clustering

Jianlong Chang, Lingfeng Wang, Gaofeng Meng, Shiming Xiang, more

2017 IEEE International Conference on Computer Vision (ICCV) > 5880 - 5888

2017 IEEE International Conference on Computer Vision (ICCV)

Image clustering is a crucial but challenging task in machine learning and computer vision. Existing methods often ignore the combination between feature learning and clustering. To tackle this problem, we propose Deep Adaptive Clustering (DAC) that recasts the clustering problem into a binary pairwise-classification framework to judge whether pairs of images belong to the same clusters. In DAC, the...

chapter

Range Loss for Deep Face Recognition with Long-Tailed Training Data

Xiao Zhang, Zhiyuan Fang, Yandong Wen, Zhifeng Li, more

2017 IEEE International Conference on Computer Vision (ICCV) > 5419 - 5428

2017 IEEE International Conference on Computer Vision (ICCV)

Deep convolutional neural networks have achieved significant improvements on face recognition task due to their ability to learn highly discriminative features from tremendous amounts of face images. Many large scale face datasets exhibit long-tail distribution where a small number of entities (persons) have large number of face images while a large number of persons only have very few face samples...

chapter

The application of deep learning in computer vision

Qing Wu, Yungang Liu, Qiang Li, Shaoli Jin, more

2017 Chinese Automation Congress (CAC) > 6522 - 6527

2017 Chinese Automation Congress (CAC)

As the deep learning exhibits strong advantages in the feature extraction, it has been widely used in the field of computer vision and among others, and gradually replaced traditional machine learning algorithms. This paper first reviews the main ideas of deep learning, and displays several related frequently-used algorithms for computer vision. Afterwards, the current research status of computer...

chapter

Study of object detection based on Faster R-CNN

Bin Liu, Wencang Zhao, Qiaoqiao Sun

2017 Chinese Automation Congress (CAC) > 6233 - 6236

2017 Chinese Automation Congress (CAC)

Faster R-CNN (R corresponds to “Region”) which combined the RPN network and the Fast R-CNN network is one of the best ways to object detection of R-CNN series based on deep learning. The proposal obtained by RPN is directly connected to the ROI Pooling layer, which is a framework for CNN to achieve end-to-end object detection. The feasibility of Faster R-CNN implementation of ResNet101 network and...

chapter

Self-Organized Text Detection with Minimal Post-processing via Border Learning

Yue Wu, Prem Natarajan

2017 IEEE International Conference on Computer Vision (ICCV) > 5010 - 5019

2017 IEEE International Conference on Computer Vision (ICCV)

In this paper we propose a new solution to the text detection problem via border learning. Specifically, we make four major contributions: 1) We analyze the insufficiencies of the classic non-text and text settings for text detection. 2) We introduce the border class to the text detection problem for the first time, and validate that the decoding process is largely simplified with the help of text...

chapter

Deep Scene Image Classification with the MFAFVNet

Yunsheng Li, Mandar Dixit, Nuno Vasconcelos

2017 IEEE International Conference on Computer Vision (ICCV) > 5757 - 5765

2017 IEEE International Conference on Computer Vision (ICCV)

The problem of transferring a deep convolutional network trained for object recognition to the task of scene image classification is considered. An embedded implementation of the recently proposed mixture of factor analyzers Fisher vector (MFA-FV) is proposed. This enables the design of a network architecture, the MFAFVNet, that can be trained in an end to end manner. The new architecture involves...

chapter

Learning Spread-Out Local Feature Descriptors

Xu Zhang, Felix X. Yu, Sanjiv Kumar, Shih-Fu Chang

2017 IEEE International Conference on Computer Vision (ICCV) > 4605 - 4613

2017 IEEE International Conference on Computer Vision (ICCV)

We propose a simple, yet powerful regularization technique that can be used to significantly improve both the pairwise and triplet losses in learning local feature descriptors. The idea is that in order to fully utilize the expressive power of the descriptor space, good local feature descriptors should be sufficiently “spread-out” over the space. In this work, we propose a regularization term to maximize...

chapter

Self-Supervised Learning of Pose Embeddings from Spatiotemporal Relations in Videos

Omer Sumer, Tobias Dencker, Bjorn Ommer

2017 IEEE International Conference on Computer Vision (ICCV) > 4308 - 4317

2017 IEEE International Conference on Computer Vision (ICCV)

Human pose analysis is presently dominated by deep convolutional networks trained with extensive manual annotations of joint locations and beyond. To avoid the need for expensive labeling, we exploit spatiotemporal relations in training videos for self-supervised learning of pose embeddings. The key idea is to combine temporal ordering and spatial placement estimation as auxiliary tasks for learning...

chapter

Focal Loss for Dense Object Detection

Tsung-Yi Lin, Priya Goyal, Ross Girshick, Kaiming He, more

2017 IEEE International Conference on Computer Vision (ICCV) > 2999 - 3007

2017 IEEE International Conference on Computer Vision (ICCV)

The highest accuracy object detectors to date are based on a two-stage approach popularized by R-CNN, where a classifier is applied to a sparse set of candidate object locations. In contrast, one-stage detectors that are applied over a regular, dense sampling of possible object locations have the potential to be faster and simpler, but have trailed the accuracy of two-stage detectors thus far. In...

chapter

Curriculum Dropout

Pietro Morerio, Jacopo Cavazza, Riccardo Volpi, Rene Vidal, more

2017 IEEE International Conference on Computer Vision (ICCV) > 3564 - 3572

2017 IEEE International Conference on Computer Vision (ICCV)

Dropout is a very effective way of regularizing neural networks. Stochastically “dropping out” units with a certain probability discourages over-specific co-adaptations of feature detectors, preventing overfitting and improving network generalization. Besides, Dropout can be interpreted as an approximate model aggregation technique, where an exponential number of smaller networks are averaged in order...

chapter

DualGAN: Unsupervised Dual Learning for Image-to-Image Translation

Zili Yi, Hao Zhang, Ping Tan, Minglun Gong

2017 IEEE International Conference on Computer Vision (ICCV) > 2868 - 2876

2017 IEEE International Conference on Computer Vision (ICCV)

Conditional Generative Adversarial Networks (GANs) for cross-domain image-to-image translation have made much progress recently [7, 8, 21, 12, 4, 18]. Depending on the task complexity, thousands to millions of labeled image pairs are needed to train a conditional GAN. However, human labeling is expensive, even impractical, and large quantities of data may not always be available. Inspired by dual...

chapter

Fine-Grained Recognition in the Wild: A Multi-task Domain Adaptation Approach

Timnit Gebru, Judy Hoffman, Li Fei-Fei

2017 IEEE International Conference on Computer Vision (ICCV) > 1358 - 1367

2017 IEEE International Conference on Computer Vision (ICCV)

While fine-grained object recognition is an important problem in computer vision, current models are unlikely to accurately classify objects in the wild. These fully supervised models need additional annotated images to classify objects in every new scenario, a task that is infeasible. However, sources such as e-commerce websites and field guides provide annotated images for many classes. In this...

chapter

No Fuss Distance Metric Learning Using Proxies

Yair Movshovitz-Attias, Alexander Toshev, Thomas K. Leung, Sergey Ioffe, more

2017 IEEE International Conference on Computer Vision (ICCV) > 360 - 368

2017 IEEE International Conference on Computer Vision (ICCV)

We address the problem of distance metric learning (DML), defined as learning a distance consistent with a notion of semantic similarity. Traditionally, for this problem supervision is expressed in the form of sets of points that follow an ordinal relationship – an anchor point x is similar to a set of positive points Y , and dissimilar to a set of negative points Z, and a loss defined over these...

chapter

RankIQA: Learning from Rankings for No-Reference Image Quality Assessment

Xialei Liu, Joost van de Weijer, Andrew D. Bagdanov

2017 IEEE International Conference on Computer Vision (ICCV) > 1040 - 1049

2017 IEEE International Conference on Computer Vision (ICCV)

We propose a no-reference image quality assessment (NR-IQA) approach that learns from rankings (RankIQA). To address the problem of limited IQA dataset size, we train a Siamese Network to rank images in terms of image quality by using synthetically generated distortions for which relative image quality is known. These ranked image sets can be automatically generated without laborious human labeling...

chapter

DualNet: Learn Complementary Features for Image Recognition

Saihui Hou, Xu Liu, Zilei Wang

2017 IEEE International Conference on Computer Vision (ICCV) > 502 - 510

2017 IEEE International Conference on Computer Vision (ICCV)

In this work we propose a novel framework named Dual-Net aiming at learning more accurate representation for image recognition. Here two parallel neural networks are coordinated to learn complementary features and thus a wider network is constructed. Specifically, we logically divide an end-to-end deep convolutional neural network into two functional parts, i.e., feature extractor and image classifier...

Keywords:
TRAINING
COMPUTER VISION

Publication date

Set your own date range

Content availability

Available (796)
None (9)

Keywords

FEATURE EXTRACTION (347)
ACCURACY (124)
COMPUTATIONAL MODELING (118)
PATTERN RECOGNITION (117)
OBJECT DETECTION (116)
SUPPORT VECTOR MACHINES (109)
IMAGE CLASSIFICATION (107)
VISUALIZATION (107)
CLASSIFICATION ALGORITHMS (101)
IMAGE SEGMENTATION (101)
IMAGE RECOGNITION (99)
DATABASES (91)
SHAPE (88)
FACE RECOGNITION (85)
LEARNING (ARTIFICIAL INTELLIGENCE) (85)
IMAGE COLOR ANALYSIS (83)
DETECTORS (79)
CAMERAS (77)
CONFERENCES (77)
FACE (76)
PIXEL (75)
TESTING (73)
ARTIFICIAL NEURAL NETWORKS (70)
MACHINE LEARNING (70)
OBJECT RECOGNITION (70)
DATA MINING (69)
HISTOGRAMS (69)
ROBUSTNESS (68)
NEURAL NETWORKS (66)
PRINCIPAL COMPONENT ANALYSIS (65)
ESTIMATION (62)
HUMANS (61)
KERNEL (61)
IMAGE EDGE DETECTION (56)
ALGORITHM DESIGN AND ANALYSIS (55)
IMAGE PROCESSING (53)
IMAGE MOTION ANALYSIS (49)
MATHEMATICAL MODEL (45)
LIGHTING (44)
SIGNAL PROCESSING (44)
HIDDEN MARKOV MODELS (41)
COMPUTER ARCHITECTURE (39)
EQUATIONS (37)
TRAINING DATA (37)
OPTIMIZATION (36)
VECTORS (35)
REAL TIME SYSTEMS (34)
TRANSFORMS (34)
BOOSTING (32)
CONVOLUTION (32)
IMAGE REPRESENTATION (32)
IMAGE SEQUENCES (32)
MACHINE VISION (32)
IMAGE RESOLUTION (31)
CORRELATION (29)
NEURAL NETS (29)
SUPPORT VECTOR MACHINE (29)
VEHICLES (29)
COMPUTERS (28)
IMAGE RECONSTRUCTION (28)
OPTICAL IMAGING (28)
VIDEOS (28)
SEMANTICS (27)
SIGNAL PROCESSING ALGORITHMS (27)
TRACKING (27)
VIDEO SIGNAL PROCESSING (27)
STANDARDS (26)
NEURONS (25)
NOISE (25)
SUPPORT VECTOR MACHINE CLASSIFICATION (25)
THREE DIMENSIONAL DISPLAYS (25)
DATA MODELS (23)
EIGENVALUES AND EIGENFUNCTIONS (23)
IMAGE COLOUR ANALYSIS (23)
POSE ESTIMATION (23)
ARTIFICIAL INTELLIGENCE (22)
DEEP LEARNING (22)
FACE DETECTION (22)
TARGET TRACKING (22)
CLUSTERING ALGORITHMS (21)
IMAGE MATCHING (21)
DICTIONARIES (20)
PEDESTRIAN DETECTION (20)
GESTURE RECOGNITION (19)
IMAGE RETRIEVAL (19)
PATTERN CLASSIFICATION (19)
SOLID MODELING (19)
VIDEO SEQUENCES (19)
ADABOOST (18)
COMPLEXITY THEORY (18)
DECISION TREES (18)
EDUCATIONAL INSTITUTIONS (18)
VISUAL DATABASES (18)
ANALYTICAL MODELS (17)
GEOMETRY (17)
INSPECTION (17)
MEASUREMENT (17)
SVM (17)
more

INFONA - science communication portal

Search results

Understanding convolutional neural networks using a minimal model for handwritten digit recognition

Scale-Adaptive Convolutions for Scene Parsing

Deep Metric Learning with Angular Loss

Unsupervised Action Discovery and Localization in Videos

Open Set Domain Adaptation

Deep Adaptive Image Clustering

Range Loss for Deep Face Recognition with Long-Tailed Training Data

The application of deep learning in computer vision

Study of object detection based on Faster R-CNN

Self-Organized Text Detection with Minimal Post-processing via Border Learning

Deep Scene Image Classification with the MFAFVNet

Learning Spread-Out Local Feature Descriptors

Self-Supervised Learning of Pose Embeddings from Spatiotemporal Relations in Videos

Focal Loss for Dense Object Detection

Curriculum Dropout

DualGAN: Unsupervised Dual Learning for Image-to-Image Translation

Fine-Grained Recognition in the Wild: A Multi-task Domain Adaptation Approach

No Fuss Distance Metric Learning Using Proxies

RankIQA: Learning from Rankings for No-Reference Image Quality Assessment

DualNet: Learn Complementary Features for Image Recognition

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options