Search results

Items from 41 to 60 out of 1,080 results

chapter

Multi-attention Network for One Shot Learning

Peng Wang, Lingqiao Liu, Chunhua Shen, Zi Huang, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6212 - 6220

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

One-shot learning is a challenging problem where the aim is to recognize a class identified by a single training image. Given the practical importance of one-shot learning, it seems surprising that the rich information present in the class tag itself has largely been ignored. Most existing approaches restrict the use of the class tag to finding similar classes and transferring classifiers or metrics...

chapter

Joint Geometrical and Statistical Alignment for Visual Domain Adaptation

Jing Zhang, Wanqing Li, Philip Ogunbona

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5150 - 5158

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper presents a novel unsupervised domain adaptation method for cross-domain visual recognition. We propose a unified framework that reduces the shift between domains both statistically and geometrically, referred to as Joint Geometrical and Statistical Alignment (JGSA). Specifically, we learn two coupled projections that project the source domain and target domain data into low-dimensional...

chapter

Quality Aware Network for Set to Set Recognition

Yu Liu, Junjie Yan, Wanli Ouyang

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4694 - 4703

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper targets on the problem of set to set recognition, which learns the metric between two image sets. Images in each set belong to the same identity. Since images in a set can be complementary, they hopefully lead to higher accuracy in practical applications. However, the quality of each sample cannot be guaranteed, and samples with poor quality will hurt the metric. In this paper, the quality...

chapter

Semantically Consistent Regularization for Zero-Shot Recognition

Pedro Morgado, Nuno Vasconcelos

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 2037 - 2046

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

The role of semantics in zero-shot learning is considered. The effectiveness of previous approaches is analyzed according to the form of supervision provided. While some learn semantics independently, others only supervise the semantic subspace explained by training classes. Thus, the former is able to constrain the whole space but lacks the ability to model semantic correlations. The latter addresses...

chapter

Link the Head to the "Beak": Zero Shot Learning from Noisy Text Description at Part Precision

Mohamed Elhoseiny, Yizhe Zhu, Han Zhang, Ahmed Elgammal

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6288 - 6297

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

In this paper, we study learning visual classifiers from unstructured text descriptions at part precision with no training images. We propose a learning framework that is able to connect text terms to its relevant parts and suppress connections to non-visual text terms without any part-text annotations. For instance, this learning process enables terms like beak to be sparsely linked to the visual...

chapter

MIML-FCN+: Multi-Instance Multi-Label Learning via Fully Convolutional Networks with Privileged Information

Hao Yang, Joey Tianyi Zhou, Jianfei Cai, Yew Soon Ong

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 5996 - 6004

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Multi-instance multi-label (MIML) learning has many interesting applications in computer visions, including multi-object recognition and automatic image tagging. In these applications, additional information such as bounding-boxes, image captions and descriptions is often available during training phrase, which is referred as privileged information (PI). However, as existing works on learning using...

chapter

Captioning Images with Diverse Objects

Subhashini Venugopalan, Lisa Anne Hendricks, Marcus Rohrbach, Raymond Mooney, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1170 - 1178

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Recent captioning models are limited in their ability to scale and describe concepts unseen in paired image-text corpora. We propose the Novel Object Captioner (NOC), a deep visual semantic captioning model that can describe a large number of object categories not present in existing image-caption datasets. Our model takes advantage of external sources – labeled images from object recognition...

chapter

Finding Tiny Faces

Peiyun Hu, Deva Ramanan

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1522 - 1530

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Though tremendous strides have been made in object recognition, one of the remaining open challenges is detecting small objects. We explore three aspects of the problem in the context of finding small faces: the role of scale invariance, image resolution, and contextual reasoning. While most recognition approaches aim to be scale-invariant, the cues for recognizing a 3px tall face are fundamentally...

chapter

Fine-Grained Recognition of Thousands of Object Categories with Single-Example Training

Leonid Karlinsky, Joseph Shtok, Yochay Tzur, Asaf Tzadok

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 965 - 974

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

We approach the problem of fast detection and recognition of a large number (thousands) of object categories while training on a very limited amount of examples, usually one per category. Examples of this task include: (i) detection of retail products, where we have only one studio image of each product available for training, (ii) detection of brand logos, and (iii) detection of 3D objects and their...

chapter

Commonly Uncommon: Semantic Sparsity in Situation Recognition

Mark Yatskar, Vicente Ordonez, Luke Zettlemoyer, Ali Farhadi

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 6335 - 6344

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Semantic sparsity is a common challenge in structured visual classification problems, when the output space is complex, the vast majority of the possible predictions are rarely, if ever, seen in the training set. This paper studies semantic sparsity in situation recognition, the task of producing structured summaries of what is happening in images, including activities, objects and the roles objects...

chapter

Unsupervised Part Learning for Visual Recognition

Ronan Sicre, Yannis Avrithis, Ewa Kijak, Frederic Jurie

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 3116 - 3124

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Part-based image classification aims at representing categories by small sets of learned discriminative parts, upon which an image representation is built. Considered as a promising avenue a decade ago, this direction has been neglected since the advent of deep neural networks. In this context, this paper brings two contributions: first, this work proceeds one step further compared to recent part-based...

chapter

Evaluation of target segmentation on SAR target recognition

Baiyuan Ding, Gongjian Wen, Conghui Ma, Xiaoliang Yang

2017 4th International Conference on Information, Cybernetics and Computational Social Systems (ICCSS) > 663 - 667

2017 4th International Conference on Information, Cybernetics and Computational Social Systems (ICCSS)

Target segmentation of synthetic aperture radar (SAR) images is one of the challenging problems in SAR image interpretation, which often serves as a processing step for SAR target recognition. Target segmentation tries to separate the target from the background thus eliminating the interference of background noises or clutters. However, the segmentation may also discard a part of the target characteristics...

chapter

Fuzzy-appearance manifold and fuzzy nearest distance for face recognition on various poses and degraded images

Muhammad Adi Nugroho, Benyamin Kusumoputro

2017 15th International Conference on Quality in Research (QiR) : International Symposium on Electrical and Computer Engineering > 342 - 346

2017 15th International Conference on Quality in Research (QiR) : International Symposium on Electrical and Computer Engineering

This paper introduces an approach to recognize face from 3D space on 2D image using fuzzy vector manifolds and nearest distance. We employ fuzzy vector to help the system minimize negative effect coming from noise and image degradation. On the training set, crisp vector representation of images will be transformed to its fuzzy vector representation using a specific triangle fuzzification method. Then,...

chapter

A deep convolutional neural network, with pre-training, for solar photovoltaic array detection in aerial imagery

Jordan M. Malof, Leslie M. Collins, Kyle Bradbury

2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS) > 874 - 877

IGARSS 2017 - 2017 IEEE International Geoscience and Remote Sensing Symposium

In this work we consider the problem of developing algorithms that automatically identify small-scale solar photovoltaic arrays in high resolution aerial imagery. Such algorithms potentially offer a faster and cheaper solution to collecting small-scale photovoltaic (PV) information, such as their location, capacity, and the energy they produce. Here we build on previous algorithmic work by employing...

chapter

Coin Recognition Method Based on SIFT Algorithm

Jing Xu, Gongliu Yang, Yuanyuan Liu, Jingjia Zhong

2017 4th International Conference on Information Science and Control Engineering (ICISCE) > 229 - 233

2017 4th International Conference on Information Science and Control Engineering (ICISCE)

Coin recognition is one of the prime important activities for modern banking and currency processing systems in which machine vision is widely used. The technique at the heart of such systems is object recognition in a digital image. Although it has high recognition speed, the traditional method of coin recognition can not recognize the coins with similar sizes. This paper presents a method based...

chapter

Headgear recognition by decomposing human images in the thermal infrared spectrum

Brahmastro Kresnaraman, Yasutomo Kawanishi, Daisuke Deguchi, Tomokazu Takahashi, more

2017 15th International Conference on Quality in Research (QiR) : International Symposium on Electrical and Computer Engineering > 164 - 168

2017 15th International Conference on Quality in Research (QiR) : International Symposium on Electrical and Computer Engineering

Surveillance systems play a critical role in security and surveillance. A surveillance system with cameras that work in the visible spectrum is sufficient for most cases. However, problems may arise during the night, or in areas with less than ideal illumination conditions. Cameras with thermal infrared technology can be a better option in these situations since they do not rely on illumination to...

chapter

6-DOF object localization by combining monocular vision and robot arm kinematics

Kun Liu, Weiwei Shang, Shuang Du, Shuang Cong

2017 36th Chinese Control Conference (CCC) > 6575 - 6580

2017 36th Chinese Control Conference (CCC)

A robot needs to localize an unknown object before grasping it. When the robot only has a monocular sensor, how can it get the object pose? In this work, we present a method of localizing the 6-DOF pose of a target object using a robotic arm and a hand-mounted monocular camera. The method includes an object recognition and a localization process. The recognition process uses point features on a surface...

chapter

Generalization error of deep neural networks: Role of classification margin and data structure

Jure Sokolic, Raja Giryes, Guillermo Sapiro, Miguel R. D. Rodrigues

2017 International Conference on Sampling Theory and Applications (SampTA) > 147 - 151

2017 International Conference on Sampling Theory and Applications (SampTA)

Understanding the generalization properties of deep learning models is critical for their successful usage in many applications, especially in the regimes where the number of training samples is limited. We study the generalization properties of deep neural networks (DNNs) via the Jacobian matrix of the network. Our analysis is general to arbitrary network structures, types of non-linearities and...

chapter

An efficient deep residual-inception network for multimedia classification

Samira Pouyanfar, Shu-Ching Chen, Mei-Ling Shyu

2017 IEEE International Conference on Multimedia and Expo (ICME) > 373 - 378

2017 IEEE International Conference on Multimedia and Expo (ICME)

Deep learning has led to many breakthroughs in machine perception and data mining. Although there are many substantial advances of deep learning in the applications of image recognition and natural language processing, very few work has been done in video analysis and semantic event detection. Very deep inception and residual networks have yielded promising results in the 2014 and 2015 ILSVRC challenges,...

chapter

Fine-grained image recognition via weakly supervised click data guided bilinear CNN model

Guangjian Zheng, Min Tan, Jun Yu, Qing Wu, more

2017 IEEE International Conference on Multimedia and Expo (ICME) > 661 - 666

2017 IEEE International Conference on Multimedia and Expo (ICME)

Bilinear convolutional neural networks (BCNN) model, the state-of-the-art in fine-grained image recognition, fails in distinguishing the categories with subtle visual differences. We design a novel BCNN model guided by user click data (C-BCNN) to improve the performance via capturing both the visual and semantical content in images. Specially, to deal with the heavy noise in large-scale click data,...

Keywords:
TRAINING
IMAGE RECOGNITION

Publication date

Set your own date range

Content availability

Available (1,074)
None (6)

Keywords

FEATURE EXTRACTION (541)
FACE RECOGNITION (231)
DATABASES (209)
FACE (182)
ACCURACY (162)
ARTIFICIAL NEURAL NETWORKS (161)
SUPPORT VECTOR MACHINES (150)
PATTERN RECOGNITION (141)
PRINCIPAL COMPONENT ANALYSIS (137)
CLASSIFICATION ALGORITHMS (134)
IMAGE SEGMENTATION (133)
TESTING (124)
IMAGE CLASSIFICATION (116)
DATA MINING (115)
SHAPE (104)
COMPUTER VISION (99)
VISUALIZATION (98)
PIXEL (95)
OBJECT RECOGNITION (86)
CHARACTER RECOGNITION (84)
LEARNING (ARTIFICIAL INTELLIGENCE) (82)
IMAGE COLOR ANALYSIS (80)
NEURAL NETWORKS (79)
CAMERAS (75)
HIDDEN MARKOV MODELS (75)
NEURAL NETS (71)
COMPUTATIONAL MODELING (70)
IMAGE PROCESSING (70)
ALGORITHM DESIGN AND ANALYSIS (68)
MACHINE LEARNING (68)
ROBUSTNESS (68)
NEURONS (67)
HISTOGRAMS (65)
KERNEL (65)
LIGHTING (65)
HUMANS (60)
HANDWRITING RECOGNITION (59)
SIGNAL PROCESSING (55)
VECTORS (54)
IMAGE REPRESENTATION (53)
SUPPORT VECTOR MACHINE CLASSIFICATION (53)
IMAGE RESOLUTION (52)
TRAINING DATA (52)
TRANSFORMS (51)
MATHEMATICAL MODEL (50)
NOISE (50)
EQUATIONS (48)
WAVELET TRANSFORMS (44)
IMAGE RECONSTRUCTION (43)
SIGNAL PROCESSING ALGORITHMS (43)
SUPPORT VECTOR MACHINE (42)
BACKPROPAGATION (41)
BIOMETRICS (ACCESS CONTROL) (41)
CORRELATION (41)
DETECTORS (41)
EIGENVALUES AND EIGENFUNCTIONS (41)
IMAGE EDGE DETECTION (41)
OBJECT DETECTION (41)
ESTIMATION (39)
IMAGE SEQUENCES (39)
NEURAL NETWORK (37)
COVARIANCE MATRIX (35)
COMPUTERS (34)
CONFERENCES (34)
TARGET RECOGNITION (33)
IMAGE MOTION ANALYSIS (32)
VEHICLES (32)
DEEP LEARNING (31)
IMAGE TEXTURE (31)
IMAGE CODING (30)
REAL TIME SYSTEMS (30)
BIOLOGICAL NEURAL NETWORKS (29)
DATA MODELS (29)
PCA (29)
EDUCATIONAL INSTITUTIONS (28)
CONVOLUTION (27)
DICTIONARIES (27)
GABOR FILTERS (27)
PATTERN CLASSIFICATION (27)
HIDDEN MARKOV MODEL (26)
TEXT RECOGNITION (26)
THREE DIMENSIONAL DISPLAYS (26)
VISUAL DATABASES (26)
IMAGE MATCHING (25)
MEDICAL IMAGE PROCESSING (25)
OPTIMIZATION (25)
COMPUTER ARCHITECTURE (24)
ENCODING (24)
GENETIC ALGORITHMS (23)
GESTURE RECOGNITION (23)
LEGGED LOCOMOTION (23)
VIDEO SIGNAL PROCESSING (23)
BIOMEDICAL IMAGING (22)
DISTANCE MEASUREMENT (22)
ELECTRONIC MAIL (22)
IMAGE COLOUR ANALYSIS (22)
MATRIX DECOMPOSITION (22)
SVM (22)
more

INFONA - science communication portal

Search results

Multi-attention Network for One Shot Learning

Joint Geometrical and Statistical Alignment for Visual Domain Adaptation

Quality Aware Network for Set to Set Recognition

Semantically Consistent Regularization for Zero-Shot Recognition

Link the Head to the "Beak": Zero Shot Learning from Noisy Text Description at Part Precision

MIML-FCN+: Multi-Instance Multi-Label Learning via Fully Convolutional Networks with Privileged Information

Captioning Images with Diverse Objects

Finding Tiny Faces

Fine-Grained Recognition of Thousands of Object Categories with Single-Example Training

Commonly Uncommon: Semantic Sparsity in Situation Recognition

Unsupervised Part Learning for Visual Recognition

Evaluation of target segmentation on SAR target recognition

Fuzzy-appearance manifold and fuzzy nearest distance for face recognition on various poses and degraded images

A deep convolutional neural network, with pre-training, for solar photovoltaic array detection in aerial imagery

Coin Recognition Method Based on SIFT Algorithm

Headgear recognition by decomposing human images in the thermal infrared spectrum

6-DOF object localization by combining monocular vision and robot arm kinematics

Generalization error of deep neural networks: Role of classification margin and data structure

An efficient deep residual-inception network for multimedia classification

Fine-grained image recognition via weakly supervised click data guided bilinear CNN model

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options