Search results

Items from 1 to 20 out of 805 results

chapter

A Self-Paced Category-Aware Approach for Unsupervised Adaptation Networks

Wenzhen Huang, Peipei Yang, Kaiqi Huang

2017 IEEE International Conference on Data Mining (ICDM) > 919 - 924

2017 IEEE International Conference on Data Mining (ICDM)

The success of deep neural networks usually relies on a large number of labeled training samples, which unfortunately are not easy to obtain in practice. Unsupervised domain adaptation focuses on the problem where there is no labeled data in the target domain. In this paper, we propose a novel deep unsupervised domain adaptation method that learns transferable features. Different from most existing...

chapter

Reducing Computational Costs of an Embedded Classifier to Determine Leather Quality

Fausto Sampaio, Lucas Costa da Silva, Pedro Pedrosa Reboucas Filho, Elias Teodoro da Silva

2017 VII Brazilian Symposium on Computing Systems Engineering (SBESC) > 211 - 216

2017 VII Brazilian Symposium on Computing Systems Engineering (SBESC)

Embedded computer vision applications have been incorporated in industrial automation, improving quality and safety of processes. Such systems involve pattern classifiers for specific functions that, many times, demand high memory footprint and processing time. This work suggests a strategy to choose GLCM (Gray Level Co-occurrence Matrix) features for an SVM classifier that can reduce computer resources...

chapter

Binarized Convolutional Landmark Localizers for Human Pose Estimation and Face Alignment with Limited Resources

Adrian Bulat, Georgios Tzimiropoulos

2017 IEEE International Conference on Computer Vision (ICCV) > 3726 - 3734

2017 IEEE International Conference on Computer Vision (ICCV)

Our goal is to design architectures that retain the groundbreaking performance of CNNs for landmark localization and at the same time are lightweight, compact and suitable for applications with limited computational resources. To this end, we make the following contributions: (a) we are the first to study the effect of neural network binarization on localization tasks, namely human pose estimation...

chapter

RMPE: Regional Multi-person Pose Estimation

Hao-Shu Fang, Shuqin Xie, Yu-Wing Tai, Cewu Lu

2017 IEEE International Conference on Computer Vision (ICCV) > 2353 - 2362

2017 IEEE International Conference on Computer Vision (ICCV)

Multi-person pose estimation in the wild is challenging. Although state-of-the-art human detectors have demonstrated good performance, small errors in localization and recognition are inevitable. These errors can cause failures for a single-person pose estimator (SPPE), especially for methods that solely depend on human detection results. In this paper, we propose a novel regional multi-person pose...

chapter

Unmasking the Abnormal Events in Video

Radu Tudor Ionescu, Sorina Smeureanu, Bogdan Alexe, Marius Popescu

2017 IEEE International Conference on Computer Vision (ICCV) > 2914 - 2922

2017 IEEE International Conference on Computer Vision (ICCV)

We propose a novel framework for abnormal event detection in video that requires no training sequences. Our framework is based on unmasking, a technique previously used for authorship verification in text documents, which we adapt to our task. We iteratively train a binary classifier to distinguish between two consecutive video sequences while removing at each step the most discriminant features....

chapter

VegFru: A Domain-Specific Dataset for Fine-Grained Visual Categorization

Saihui Hou, Yushan Feng, Zilei Wang

2017 IEEE International Conference on Computer Vision (ICCV) > 541 - 549

2017 IEEE International Conference on Computer Vision (ICCV)

In this paper, we propose a novel domain-specific dataset named VegFru for fine-grained visual categorization (FGVC). While the existing datasets for FGVC are mainly focused on animal breeds or man-made objects with limited labelled data, VegFru is a larger dataset consisting of vegetables and fruits which are closely associated with the daily life of everyone. Aiming at domestic cooking and food...

chapter

Real-Time Single-Shot Brand Logo Recognition

Leonardo Bombonato, Guillermo Camara-Chavez, Pedro Silva

2017 30th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI) > 134 - 140

2017 30th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI)

The amount of data produced every day on the internet increases every day and with the increasing popularity of the social networks the number of published photos are huge, and those pictures contain several implicit or explicit brand logos. Detecting this logos in natural images can provide information about how widespread is a brand, discover unwanted copyright distribution, analyze marketing campaigns,...

chapter

Call detection of driver based on constrained local models

Huasheng Xu, Junhang Ding, Xiujuan Ren, Qingmei Sui

2017 Chinese Automation Congress (CAC) > 1907 - 1910

2017 Chinese Automation Congress (CAC)

In order to reduce the number of accidents caused by the call when the driver was driving, this paper uses the computer vision technology to dectet the behavior of the driver. Based on the constrained local models (CLM) to detect the characteristic changes of the mouth area, combine the HSV color space and the template matching to detect the hand characteristics to judge whether the driver has the...

chapter

Sampling Matters in Deep Embedding Learning

R. Manmatha, Chao-Yuan Wu, Alexander J. Smola, Philipp Krahenbuhl

2017 IEEE International Conference on Computer Vision (ICCV) > 2859 - 2867

2017 IEEE International Conference on Computer Vision (ICCV)

Deep embeddings answer one simple question: How similar are two images? Learning these embeddings is the bedrock of verification, zero-shot learning, and visual search. The most prominent approaches optimize a deep convolutional network with a suitable loss function, such as contrastive loss or triplet loss. While a rich line of work focuses solely on the loss functions, we show in this paper that...

chapter

How Far are We from Solving the 2D & 3D Face Alignment Problem? (and a Dataset of 230,000 3D Facial Landmarks)

Adrian Bulat, Georgios Tzimiropoulos

2017 IEEE International Conference on Computer Vision (ICCV) > 1021 - 1030

2017 IEEE International Conference on Computer Vision (ICCV)

This paper investigates how far a very deep neural network is from attaining close to saturating performance on existing 2D and 3D face alignment datasets. To this end, we make the following 5 contributions: (a) we construct, for the first time, a very strong baseline by combining a state-of-the-art architecture for landmark localization with a state-of-the-art residual block, train it on a very large...

chapter

Approximate Grassmannian Intersections: Subspace-Valued Subspace Learning

Calvin Murdock, Fernando De la Torre

2017 IEEE International Conference on Computer Vision (ICCV) > 4318 - 4326

2017 IEEE International Conference on Computer Vision (ICCV)

Subspace learning is one of the most foundational tasks in computer vision with applications ranging from dimensionality reduction to data denoising. As geometric objects, subspaces have also been successfully used for efficiently representing certain types of invariant data. However, methods for subspace learning from subspace-valued data have been notably absent due to incompatibilities with standard...

chapter

A Mobile Application for Plant Recognition through Deep Learning

Min Gao, Yang Lin, Richard O. Sinnott

2017 IEEE 13th International Conference on e-Science (e-Science) > 29 - 38

2017 IEEE 13th International Conference on e-Science (e-Science)

It is a simple task for humans to visually identify objects. However, computer-based image recognition remains challenging. In this paper we describe an approach for image recognition with specific focus on automated recognition of plants and flowers. The approach taken utilizes deep learning capabilities and unlike other approaches that focus on static images for feature classification, we utilize...

chapter

A minimal convolutional neural network for handwritten digit recognition

Matthew Y. W. Teow

2017 7th IEEE International Conference on System Engineering and Technology (ICSET) > 171 - 176

2017 7th IEEE International Conference on System Engineering and Technology (ICSET)

The contribution of this paper is to bridge the gap on understanding the mathematical structure and the computational implementation of a convolutional neural network using a minimal model. The proposed minimal convolutional neural network is presented using a layering approach. This approach provides a clear understanding of the main mathematical operations in a convolutional neural network. Hence,...

chapter

Person re-identification from CCTV silhouettes using generic fourier descriptor

Rawabi Alsedais, Richard Guest

2017 International Carnahan Conference on Security Technology (ICCST) > 1 - 6

2017 International Carnahan Conference on Security Technology (ICCST)

Person re-identification in public areas (such as airports, train stations and shopping malls) has recently received increased attention within computer vision research due, in part, to the demand for enhanced levels of security. Re-identifying subjects within non-overlapped camera networks can be considered as a challenging task. Illumination changes in different scenes, variations in camera resolutions,...

chapter

Deep Functional Maps: Structured Prediction for Dense Shape Correspondence

Or Litany, Tal Remez, Emanuele Rodola, Alex Bronstein, more

2017 IEEE International Conference on Computer Vision (ICCV) > 5660 - 5668

2017 IEEE International Conference on Computer Vision (ICCV)

We introduce a new framework for learning dense correspondence between deformable 3D shapes. Existing learning based approaches model shape correspondence as a labelling problem, where each point of a query shape receives a label identifying a point on some reference domain; the correspondence is then constructed a posteriori by composing the label predictions of two input shapes. We propose a paradigm...

chapter

Domain-Adaptive Deep Network Compression

Marc Masana, Joost van de Weijer, Luis Herranz, Andrew D. Bagdanov, more

2017 IEEE International Conference on Computer Vision (ICCV) > 4299 - 4307

2017 IEEE International Conference on Computer Vision (ICCV)

Deep Neural Networks trained on large datasets can be easily transferred to new domains with far fewer labeled examples by a process called fine-tuning. This has the advantage that representations learned in the large source domain can be exploited on smaller target domains. However, networks designed to be optimal for the source task are often prohibitively large for the target task. In this work...

chapter

Factorized Bilinear Models for Image Recognition

Yanghao Li, Naiyan Wang, Jiaying Liu, Xiaodi Hou

2017 IEEE International Conference on Computer Vision (ICCV) > 2098 - 2106

2017 IEEE International Conference on Computer Vision (ICCV)

Although Deep Convolutional Neural Networks (CNNs) have liberated their power in various computer vision tasks, the most important components of CNN, convolutional layers and fully connected layers, are still limited to linear transformations. In this paper, we propose a novel Factorized Bilinear (FB) layer to model the pairwise feature interactions by considering the quadratic terms in the transformations...

chapter

Curriculum Domain Adaptation for Semantic Segmentation of Urban Scenes

Yang Zhang, Philip David, Boqing Gong

2017 IEEE International Conference on Computer Vision (ICCV) > 2039 - 2049

2017 IEEE International Conference on Computer Vision (ICCV)

During the last half decade, convolutional neural networks (CNNs) have triumphed over semantic segmentation, which is a core task of various emerging industrial applications such as autonomous driving and medical imaging. However, to train CNNs requires a huge amount of data, which is difficult to collect and laborious to annotate. Recent advances in computer graphics make it possible to train CNN...

chapter

Learning Feature Pyramids for Human Pose Estimation

Wei Yang, Shuang Li, Wanli Ouyang, Hongsheng Li, more

2017 IEEE International Conference on Computer Vision (ICCV) > 1290 - 1299

2017 IEEE International Conference on Computer Vision (ICCV)

Articulated human pose estimation is a fundamental yet challenging task in computer vision. The difficulty is particularly pronounced in scale variations of human body parts when camera view changes or severe foreshortening happens. Although pyramid methods are widely used to handle scale changes at inference time, learning feature pyramids in deep convolutional neural networks (DCNNs) is still not...

chapter

Generative Adversarial Networks for Parallel Vision

Wang Kunfeng, Li Xuan, Yan Lan, Wang Fei-Yue

2017 Chinese Automation Congress (CAC) > 7670 - 7675

2017 Chinese Automation Congress (CAC)

Video image dataset is playing an essential role in design and evaluation of traffic vision methods. However, there is a longstanding difficulty that manually collecting and annotating large-scale diversified dataset from real scenes is time-consuming and prone to error. In 2016, we proposed the parallel vision methodology to tackle the issues of conventional vision computing approach in data collection,...

Keywords:
TRAINING
COMPUTER VISION

Publication date

Set your own date range

Content availability

Available (796)
None (9)

Keywords

FEATURE EXTRACTION (347)
ACCURACY (124)
COMPUTATIONAL MODELING (118)
PATTERN RECOGNITION (117)
OBJECT DETECTION (116)
SUPPORT VECTOR MACHINES (109)
IMAGE CLASSIFICATION (107)
VISUALIZATION (107)
CLASSIFICATION ALGORITHMS (101)
IMAGE SEGMENTATION (101)
IMAGE RECOGNITION (99)
DATABASES (91)
SHAPE (88)
FACE RECOGNITION (85)
LEARNING (ARTIFICIAL INTELLIGENCE) (85)
IMAGE COLOR ANALYSIS (83)
DETECTORS (79)
CAMERAS (77)
CONFERENCES (77)
FACE (76)
PIXEL (75)
TESTING (73)
ARTIFICIAL NEURAL NETWORKS (70)
MACHINE LEARNING (70)
OBJECT RECOGNITION (70)
DATA MINING (69)
HISTOGRAMS (69)
ROBUSTNESS (68)
NEURAL NETWORKS (66)
PRINCIPAL COMPONENT ANALYSIS (65)
ESTIMATION (62)
HUMANS (61)
KERNEL (61)
IMAGE EDGE DETECTION (56)
ALGORITHM DESIGN AND ANALYSIS (55)
IMAGE PROCESSING (53)
IMAGE MOTION ANALYSIS (49)
MATHEMATICAL MODEL (45)
LIGHTING (44)
SIGNAL PROCESSING (44)
HIDDEN MARKOV MODELS (41)
COMPUTER ARCHITECTURE (39)
EQUATIONS (37)
TRAINING DATA (37)
OPTIMIZATION (36)
VECTORS (35)
REAL TIME SYSTEMS (34)
TRANSFORMS (34)
BOOSTING (32)
CONVOLUTION (32)
IMAGE REPRESENTATION (32)
IMAGE SEQUENCES (32)
MACHINE VISION (32)
IMAGE RESOLUTION (31)
CORRELATION (29)
NEURAL NETS (29)
SUPPORT VECTOR MACHINE (29)
VEHICLES (29)
COMPUTERS (28)
IMAGE RECONSTRUCTION (28)
OPTICAL IMAGING (28)
VIDEOS (28)
SEMANTICS (27)
SIGNAL PROCESSING ALGORITHMS (27)
TRACKING (27)
VIDEO SIGNAL PROCESSING (27)
STANDARDS (26)
NEURONS (25)
NOISE (25)
SUPPORT VECTOR MACHINE CLASSIFICATION (25)
THREE DIMENSIONAL DISPLAYS (25)
DATA MODELS (23)
EIGENVALUES AND EIGENFUNCTIONS (23)
IMAGE COLOUR ANALYSIS (23)
POSE ESTIMATION (23)
ARTIFICIAL INTELLIGENCE (22)
DEEP LEARNING (22)
FACE DETECTION (22)
TARGET TRACKING (22)
CLUSTERING ALGORITHMS (21)
IMAGE MATCHING (21)
DICTIONARIES (20)
PEDESTRIAN DETECTION (20)
GESTURE RECOGNITION (19)
IMAGE RETRIEVAL (19)
PATTERN CLASSIFICATION (19)
SOLID MODELING (19)
VIDEO SEQUENCES (19)
ADABOOST (18)
COMPLEXITY THEORY (18)
DECISION TREES (18)
EDUCATIONAL INSTITUTIONS (18)
VISUAL DATABASES (18)
ANALYTICAL MODELS (17)
GEOMETRY (17)
INSPECTION (17)
MEASUREMENT (17)
SVM (17)
more

INFONA - science communication portal

Search results

A Self-Paced Category-Aware Approach for Unsupervised Adaptation Networks

Reducing Computational Costs of an Embedded Classifier to Determine Leather Quality

Binarized Convolutional Landmark Localizers for Human Pose Estimation and Face Alignment with Limited Resources

RMPE: Regional Multi-person Pose Estimation

Unmasking the Abnormal Events in Video

VegFru: A Domain-Specific Dataset for Fine-Grained Visual Categorization

Real-Time Single-Shot Brand Logo Recognition

Call detection of driver based on constrained local models

Sampling Matters in Deep Embedding Learning

How Far are We from Solving the 2D & 3D Face Alignment Problem? (and a Dataset of 230,000 3D Facial Landmarks)

Approximate Grassmannian Intersections: Subspace-Valued Subspace Learning

A Mobile Application for Plant Recognition through Deep Learning

A minimal convolutional neural network for handwritten digit recognition

Person re-identification from CCTV silhouettes using generic fourier descriptor

Deep Functional Maps: Structured Prediction for Dense Shape Correspondence

Domain-Adaptive Deep Network Compression

Factorized Bilinear Models for Image Recognition

Curriculum Domain Adaptation for Semantic Segmentation of Urban Scenes

Learning Feature Pyramids for Human Pose Estimation

Generative Adversarial Networks for Parallel Vision

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options