Search results

Items from 1 to 20 out of 844 results

chapter

Visual positioning system for automated indoor/outdoor navigation

S Anup, Abhinav Goel, Suresh Padmanabhan

TENCON 2017 - 2017 IEEE Region 10 Conference > 1027 - 1031

TENCON 2017 - 2017 IEEE Region 10 Conference

This paper proposes a proof-of-concept for a novel automated indoor/outdoor navigation system. Our proposed method shall enable an object/user equipped to be able to navigate through closed environments using an automatically generated Spatial Map Graph (SMG) with the aid of pre-placed visual markers. The system is robust to dynamically changing complex environments, through adaptive reconfigurations...

chapter

Dual-Glance Model for Deciphering Social Relationships

Junnan Li, Yongkang Wong, Qi Zhao, Mohan S. Kankanhalli

2017 IEEE International Conference on Computer Vision (ICCV) > 2669 - 2678

2017 IEEE International Conference on Computer Vision (ICCV)

Since the beginning of early civilizations, social relationships derived from each individual fundamentally form the basis of social structure in our daily life. In the computer vision literature, much progress has been made in scene understanding, such as object detection and scene parsing. Recent research focuses on the relationship between objects based on its functionality and geometrical relations...

chapter

Video Fill In the Blank Using LR/RL LSTMs with Spatial-Temporal Attentions

Amir Mazaheri, Dong Zhang, Mubarak Shah

2017 IEEE International Conference on Computer Vision (ICCV) > 1416 - 1425

2017 IEEE International Conference on Computer Vision (ICCV)

Given a video and a description sentence with one missing word, “source sentence”, Video-Fill-In-the-Blank (VFIB) problem is to find the missing word automatically. The contextual information of the sentence, as well as visual cues from the video, are important to infer the missing word accurately. Since the source sentence is broken into two fragments: the sentence’s left fragment (before the blank)...

chapter

VegFru: A Domain-Specific Dataset for Fine-Grained Visual Categorization

Saihui Hou, Yushan Feng, Zilei Wang

2017 IEEE International Conference on Computer Vision (ICCV) > 541 - 549

2017 IEEE International Conference on Computer Vision (ICCV)

In this paper, we propose a novel domain-specific dataset named VegFru for fine-grained visual categorization (FGVC). While the existing datasets for FGVC are mainly focused on animal breeds or man-made objects with limited labelled data, VegFru is a larger dataset consisting of vegetables and fruits which are closely associated with the daily life of everyone. Aiming at domestic cooking and food...

chapter

Development of a versatile assistive system for the visually impaired based on sensor fusion

Nicolae Botezatu, Simona Caraiman, Dariusz Rzeszotarski, Pawel Strumillo

2017 21st International Conference on System Theory, Control and Computing (ICSTCC) > 540 - 547

2017 21st International Conference on System Theory, Control and Computing (ICSTCC)

In this paper we describe the 3D acquisition component integrated in the Sound of Vision (SoV) system. SoV is a computer vision based sensory substitution device (SSD) for the visually impaired. Its main objective is to provide the users with a 3D representation of the environment around them, conveyed by means of the hearing and tactile senses. One of the biggest challenges for the SoV system is...

chapter

[POSTER] Depth Map Interpolation Using Perceptual Loss

Ilya Makarov, Vladimir Aliev, Olga Gerasimova, Pavel Polyakov

2017 IEEE International Symposium on Mixed and Augmented Reality (ISMAR-Adjunct) > 93 - 94

2017 IEEE International Symposium on Mixed and Augmented Reality (ISMAR-Adjunct)

In this paper, we discuss a semi-dense depth map interpolation method based on convolutional neural network. We propose a compact neural network architecture with loss function defined as Euclidean distance in the feature space of VGG-16 neural network used for deep visual recognition. The suggested solution shows state-of-art performance on synthetic and real datasets. Together with LSD-SLAM, the...

chapter

Patched-based deep Boltzmann shape priors for visual tracking

Sanghoon Lee, Ilhong Shin, Eunjun Rhee, Sunghee Lee, more

2017 International Conference on Information and Communication Technology Convergence (ICTC) > 1109 - 1111

2017 International Conference on Information and Communication Technology Convergence (ICTC)

In this paper, we propose a patched-based deep Boltzmann shape priors for visual tracking. The shape priors are generated from deep Boltzmann machine network. The network consists of three layers of hidden and visible units. The generated shapes not only maintain general shapes from a variety of poses, but also entail local modifications with high probability.

chapter

The Mapillary Vistas Dataset for Semantic Understanding of Street Scenes

Gerhard Neuhold, Tobias Ollmann, Samuel Rota Bulo, Peter Kontschieder

2017 IEEE International Conference on Computer Vision (ICCV) > 5000 - 5009

2017 IEEE International Conference on Computer Vision (ICCV)

The Mapillary Vistas Dataset is a novel, large-scale street-level image dataset containing 25000 high-resolution images annotated into 66 object categories with additional, instance-specific labels for 37 classes. Annotation is performed in a dense and fine-grained style by using polygons for delineating individual objects. Our dataset is 5× larger than the total amount of fine annotations for Cityscapes...

chapter

Learning the Latent “Look”: Unsupervised Discovery of a Style-Coherent Embedding from Fashion Images

Wei-Lin Hsiao, Kristen Grauman

2017 IEEE International Conference on Computer Vision (ICCV) > 4213 - 4222

2017 IEEE International Conference on Computer Vision (ICCV)

What defines a visual style? Fashion styles emerge organically from how people assemble outfits of clothing, making them difficult to pin down with a computational model. Low-level visual similarity can be too specific to detect stylistically similar images, while manually crafted style categories can be too abstract to capture subtle style differences. We propose an unsupervised approach to learn...

chapter

When Unsupervised Domain Adaptation Meets Tensor Representations

Hao Lu, Lei Zhang, Zhiguo Cao, Wei Wei, more

2017 IEEE International Conference on Computer Vision (ICCV) > 599 - 608

2017 IEEE International Conference on Computer Vision (ICCV)

Domain adaption (DA) allows machine learning methods trained on data sampled from one distribution to be applied to data sampled from another. It is thus of great practical importance to the application of such methods. Despite the fact that tensor representations are widely used in Computer Vision to capture multi-linear relationships that affect the data, most existing DA methods are applicable...

chapter

Combined approach using artificial vision and neural networks for facial recognition

William Gutierrez Pezoa, Marcela Jamett Dominguez

2017 CHILEAN Conference on Electrical, Electronics Engineering, Information and Communication Technologies (CHILECON) > 1 - 5

2017 CHILEAN Conference on Electrical, Electronics Engineering, Information and Communication Technologies (CHILECON)

An application of artificial vision and artificial neural networks techniques in face recognition, is presented. In order to do that, a set of images (frontal face photos) with different lighting conditions, gestures, accessories and distances is used. A stepwise algorithm allows to achieve a satisfactory results, obtaining the correct identification of images inside and outside the data set.

chapter

A minimal convolutional neural network for handwritten digit recognition

Matthew Y. W. Teow

2017 7th IEEE International Conference on System Engineering and Technology (ICSET) > 171 - 176

2017 7th IEEE International Conference on System Engineering and Technology (ICSET)

The contribution of this paper is to bridge the gap on understanding the mathematical structure and the computational implementation of a convolutional neural network using a minimal model. The proposed minimal convolutional neural network is presented using a layering approach. This approach provides a clear understanding of the main mathematical operations in a convolutional neural network. Hence,...

chapter

HHA-based CNN image features for indoor loop closure detection

Wei Zhang, Guoliang Liu, Guohui Tian

2017 Chinese Automation Congress (CAC) > 4634 - 4639

2017 Chinese Automation Congress (CAC)

Loop closure detection is an important part of visual simultaneous location and mapping (SLAM) system. Most of traditional loop closure detection approaches using hand-crafted features often lack robustness with respect to object occlusions and illumination changes, especially for the complicated indoor environment. Recently, convolutional neural network (CNN) makes a huge impact on many computer...

chapter

Learning Feature Pyramids for Human Pose Estimation

Wei Yang, Shuang Li, Wanli Ouyang, Hongsheng Li, more

2017 IEEE International Conference on Computer Vision (ICCV) > 1290 - 1299

2017 IEEE International Conference on Computer Vision (ICCV)

Articulated human pose estimation is a fundamental yet challenging task in computer vision. The difficulty is particularly pronounced in scale variations of human body parts when camera view changes or severe foreshortening happens. Although pyramid methods are widely used to handle scale changes at inference time, learning feature pyramids in deep convolutional neural networks (DCNNs) is still not...

chapter

Deep Determinantal Point Process for Large-Scale Multi-label Classification

Pengtao Xie, Ruslan Salakhutdinov, Luntian Mou, Eric P. Xing

2017 IEEE International Conference on Computer Vision (ICCV) > 473 - 482

2017 IEEE International Conference on Computer Vision (ICCV)

We study large-scale multi-label classification (MLC) on two recently released datasets: Youtube-8M and Open Images that contain millions of data instances and thousands of classes. The unprecedented problem scale poses great challenges for MLC. First, finding out the correct label subset out of exponentially many choices incurs substantial ambiguity and uncertainty. Second, the large data-size and...

chapter

Computer aided motile sperm counting

Hamza Osman Ilhan, Nizamettin Aydin

2017 Medical Technologies National Congress (TIPTEKNO) > 1 - 5

2017 Medical Technologies National Congress (TIPTEKNO)

The rapid and irregular motion of semen cells makes the counting process of semen difficult in the visual assessment. Therefore, computer based techniques are necessary to evaluate the tests with more accurately. In this paper, an alternative way to the visual assessment technique in spermiogram tests is presented. Analyses are performed on the recorded microscope video images by computer, automatically...

chapter

GPLAC: Generalizing Vision-Based Robotic Skills Using Weakly Labeled Images

Avi Singh, Larry Yang, Sergey Levine

2017 IEEE International Conference on Computer Vision (ICCV) > 5852 - 5861

2017 IEEE International Conference on Computer Vision (ICCV)

We tackle the problem of learning robotic sensorimotor control policies that can generalize to visually diverse and unseen environments. Achieving broad generalization typically requires large datasets, which are difficult to obtain for task-specific interactive processes such as reinforcement learning or learning from demonstration. However, much of the visual diversity in the world can be captured...

chapter

Aesthetic quality assessment of images via Supervised Locality Preserving CCA

Misaki Kanai, Ren Togo, Takahiro Ogawa, Miki Haseyama

2017 IEEE 6th Global Conference on Consumer Electronics (GCCE) > 1 - 2

2017 IEEE 6th Global Conference on Consumer Electronics (GCCE)

Aesthetic quality assessment plays an important role in how people organize large image collections. Many studies on aesthetic quality assessment are based on design of hand-crafted features without considering whether attributes conveyed by images can actually affect image aesthetics. This paper presents an aesthetic quality assessment method which uses new visual features. The proposed method utilizes...

chapter

Generative Adversarial Networks for Parallel Vision

Wang Kunfeng, Li Xuan, Yan Lan, Wang Fei-Yue

2017 Chinese Automation Congress (CAC) > 7670 - 7675

2017 Chinese Automation Congress (CAC)

Video image dataset is playing an essential role in design and evaluation of traffic vision methods. However, there is a longstanding difficulty that manually collecting and annotating large-scale diversified dataset from real scenes is time-consuming and prone to error. In 2016, we proposed the parallel vision methodology to tackle the issues of conventional vision computing approach in data collection,...

chapter

Understanding convolutional neural networks using a minimal model for handwritten digit recognition

Matthew Y. W. Teow

2017 IEEE 2nd International Conference on Automatic Control and Intelligent Systems (I2CACIS) > 167 - 172

2017 IEEE 2nd International Conference on Automatic Control and Intelligent Systems (I2CACIS)

The contribution of this paper is to bridge the gap on understanding the mathematical structure and the computational implementation of a convolutional neural network (CNN) using a minimal model (Minimal CNN). The proposed minimal CNN is presented using a layering approach. This approach provides a concise and accessible understanding of the main mathematical operations of a CNN. Hence, it benefits...

Keywords:
VISUALIZATION
COMPUTER VISION

Publication date

Set your own date range

Content availability

Available (839)
None (5)

Keywords

FEATURE EXTRACTION (276)
CAMERAS (151)
COMPUTATIONAL MODELING (140)
IMAGE COLOR ANALYSIS (125)
TRAINING (107)
OBJECT DETECTION (96)
HUMANS (85)
IMAGE SEGMENTATION (84)
OBJECT RECOGNITION (77)
HISTOGRAMS (76)
PIXEL (73)
CONFERENCES (71)
PATTERN RECOGNITION (68)
DATA MINING (63)
SHAPE (62)
ROBUSTNESS (61)
IMAGE MOTION ANALYSIS (59)
TARGET TRACKING (53)
IMAGE PROCESSING (52)
IMAGE CLASSIFICATION (51)
SEMANTICS (49)
TRACKING (49)
IMAGE EDGE DETECTION (46)
IMAGE RETRIEVAL (45)
IMAGE RECOGNITION (44)
ACCURACY (41)
ESTIMATION (40)
MACHINE VISION (40)
THREE DIMENSIONAL DISPLAYS (39)
DETECTORS (38)
VIDEO SIGNAL PROCESSING (38)
DATABASES (37)
SUPPORT VECTOR MACHINES (37)
MATHEMATICAL MODEL (35)
KERNEL (34)
IMAGE REPRESENTATION (33)
VOCABULARY (33)
IMAGE RESOLUTION (31)
LIGHTING (31)
COMPUTERS (30)
IMAGE COLOUR ANALYSIS (30)
FACE (29)
LEARNING (ARTIFICIAL INTELLIGENCE) (29)
DISTANCE MEASUREMENT (28)
VISUAL TRACKING (28)
ALGORITHM DESIGN AND ANALYSIS (27)
CLASSIFICATION ALGORITHMS (27)
IMAGE MATCHING (27)
CORRELATION (26)
IMAGE SEQUENCES (25)
VISUAL ATTENTION (25)
NAVIGATION (24)
COMPUTER ARCHITECTURE (23)
IMAGE RECONSTRUCTION (23)
MACHINE LEARNING (23)
OPTICAL IMAGING (23)
ROBOTS (23)
THREE-DIMENSIONAL DISPLAYS (23)
VIDEOS (23)
VISUAL PERCEPTION (23)
VEHICLES (22)
NEURAL NETWORKS (21)
VECTORS (21)
COMPUTER GRAPHICS (20)
INSPECTION (20)
STEREO IMAGE PROCESSING (20)
SURVEILLANCE (20)
TRAJECTORY (20)
CLUSTERING ALGORITHMS (19)
EQUATIONS (19)
OBJECT TRACKING (19)
STEREO VISION (19)
BIOLOGICAL SYSTEM MODELING (18)
DATA VISUALISATION (18)
HIDDEN MARKOV MODELS (18)
REAL-TIME SYSTEMS (18)
SENSORS (18)
SOLID MODELING (18)
TRANSFORMS (18)
VIDEO SURVEILLANCE (18)
BRAIN MODELING (17)
CONTEXT (17)
DICTIONARIES (17)
MOBILE ROBOTS (17)
OPTIMIZATION (17)
PARTICLE FILTER (17)
ROBOT VISION (17)
STREAMING MEDIA (17)
COLOR (16)
ELECTRONIC MAIL (16)
ENCODING (16)
ENTROPY (16)
HUMAN VISUAL SYSTEM (16)
NEURONS (16)
OBSERVERS (16)
PARTICLE FILTERING (NUMERICAL METHODS) (16)
REAL TIME SYSTEMS (16)
SALIENCY MAP (16)
more

INFONA - science communication portal

Search results

Visual positioning system for automated indoor/outdoor navigation

Dual-Glance Model for Deciphering Social Relationships

Video Fill In the Blank Using LR/RL LSTMs with Spatial-Temporal Attentions

VegFru: A Domain-Specific Dataset for Fine-Grained Visual Categorization

Development of a versatile assistive system for the visually impaired based on sensor fusion

[POSTER] Depth Map Interpolation Using Perceptual Loss

Patched-based deep Boltzmann shape priors for visual tracking

The Mapillary Vistas Dataset for Semantic Understanding of Street Scenes

Learning the Latent “Look”: Unsupervised Discovery of a Style-Coherent Embedding from Fashion Images

When Unsupervised Domain Adaptation Meets Tensor Representations

Combined approach using artificial vision and neural networks for facial recognition

A minimal convolutional neural network for handwritten digit recognition

HHA-based CNN image features for indoor loop closure detection

Learning Feature Pyramids for Human Pose Estimation

Deep Determinantal Point Process for Large-Scale Multi-label Classification

Computer aided motile sperm counting

GPLAC: Generalizing Vision-Based Robotic Skills Using Weakly Labeled Images

Aesthetic quality assessment of images via Supervised Locality Preserving CCA

Generative Adversarial Networks for Parallel Vision

Understanding convolutional neural networks using a minimal model for handwritten digit recognition

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options