Search results

Items from 1 to 20 out of 2,083 results

chapter

Survey of Visual Feature Extraction Algorithms in a Mars-like Environment

Martin Oelsch, Dominik Van Opdenbosch, Eckehard Steinbach

2017 IEEE International Symposium on Multimedia (ISM) > 322 - 325

2017 IEEE International Symposium on Multimedia (ISM)

This paper presents a performance comparison of several state-of-the-art visual feature extraction algorithms when applied in a poorly-structured environment as found on the planet Mars. So far, no systematic evaluation of feature extraction algorithms in extraterrestrial environments is available. The algorithms in this paper are evaluated using the Devon Island dataset which is said to have one...

chapter

Performance Evaluation of Walking Imagery Training Based on Virtual Environment in Brain-Computer Interfaces

Xiaolu Liu, Shuang Liang, Wenlong Hang, Baiying Lei, more

2017 IEEE International Symposium on Multimedia (ISM) > 25 - 30

2017 IEEE International Symposium on Multimedia (ISM)

Motor imagery (MI) based on brain computer interfaces (BCIs) have been widely applied for upper limb motor rehabilitation. Due to the fact that a large number of disabled people need to restore or improve walking ability, it is also important to investigate the use of MI-based BCIs for lower limb motor rehabilitation. The brain activity of lower limb MI is more difficult to detect because of low reliability...

chapter

Deep Image Retrieval Applied on Kotenseki Ancient Japanese Literature

Chairath Sirirattanapol, Yusuke Matsui, Shin'ichi Satoh, Kuninori Matsuda, more

2017 IEEE International Symposium on Multimedia (ISM) > 495 - 499

2017 IEEE International Symposium on Multimedia (ISM)

Kotenseki is a collection of classical and ancient Japanese literature. It is comprised of image books that express Japanese stories by using comic drawings of different characters, such as humans, nature, and animals. To effectively store them for posterity, a search system is important. We propose an efficient CBIR system to assist the users in easily accessing the information and have an enjoyable...

chapter

Blog Article Summarization with Image-Text Alignment Techniques

Wei-Ta Chu, Ming-Chih Kao

2017 IEEE International Symposium on Multimedia (ISM) > 244 - 247

2017 IEEE International Symposium on Multimedia (ISM)

We propose an image-text alignment framework to match images with text, and take blog article summarization as the main application. Objects in an image are first detected, from them deep features are extracted and transformed into a space commonly shared with the text. On the other hand, sentences of a blog article are represented as vectors, and are also embedded into the common space. With these...

chapter

Hyper-Feature Based Tracking with the Fully-Convolutional Siamese Network

Yangliu Kuai, Gongjian Wen, Dongdong Li

2017 International Conference on Digital Image Computing: Techniques and Applications (DICTA) > 1 - 7

2017 International Conference on Digital Image Computing: Techniques and Applications (DICTA)

Convolutional neural network (CNN) has drawn increasing interest in visual tracking, among which fully-convolutional Siamese network based method (SiamFC) is quite popular due to its competitive performance in both precision and efficiency. Generally, SiamFC captures robust semantics from high-level features in the last layer but ignores detailed spatial features in earlier layers, thus tending to...

chapter

Deformable and Occluded Object Tracking via Graph Learning

Wei Han, Guang-Bin Huang, Dongshun Cui

2017 International Conference on Digital Image Computing: Techniques and Applications (DICTA) > 1 - 8

2017 International Conference on Digital Image Computing: Techniques and Applications (DICTA)

Object deformation and occlusion are ubiquitous problems for visual tracking. Though many efforts have been made to handle object deformation and occlusion, most existing tracking algorithms fail in case of large deformation and severe occlusion. In this paper, we propose a graph learning-based tracking framework to handle both challenges. For each consecutive frame pair, we construct a weighted graph,...

chapter

Seam tracking and welding bead geometry analysis for autonomous welding robot

Luciane B. Soares, Atila A. Weis, Ricardo N. Rodrigues, Paulo L. J. Drews, more

2017 Latin American Robotics Symposium (LARS) and 2017 Brazilian Symposium on Robotics (SBR) > 1 - 6

2017 Latin American Robotics Symposium (LARS) and 2017 Brazilian Symposium on Robotics (SBR)

Welding is a process recognized by the laborious work and hazardous work environment it takes place, but it is an important process in different industrial scenarios, like the shipbuilding industry. The use of robots has been increasing in recent years, reducing the human interference necessary for the process. This paper proposes a system for automated seam tracking and a geometric welding bead analysis...

chapter

Automatic detection of fruits in coffee crops from aerial images

Gabriel L. A. Carrijo, Danilo E. Oliveira, Gleice A. de Assis, Murillo G. Carneiro, more

2017 Latin American Robotics Symposium (LARS) and 2017 Brazilian Symposium on Robotics (SBR) > 1 - 6

2017 Latin American Robotics Symposium (LARS) and 2017 Brazilian Symposium on Robotics (SBR)

A big challenge in the precision agriculture is the detection of fruits in coffee crops on agricultural environments. This paper presents a comparison of four features set to detect the red fruits (mature) in Coffee plants. An Unmanned Aerial Vehicle (UAV) is used to obtain high-resolution RGB images of a coffee hall. The proposed methodology enables the extraction of visual features from image regions...

chapter

Development of a 3D printed stethoscope for virtual cardiac auscultation examination training

Tatiana Ortegon, Mario Vargas, Alvaro Uribe-Quevedo, Byron Perez-Gutierrez, more

2017 IEEE Healthcare Innovations and Point of Care Technologies (HI-POCT) > 125 - 128

2017 IEEE Healthcare Innovation Point-of-Care Technologies (HI-POCT)

Cardiac auscultation allows diagnosing the heart by listening to its sounds. Current cardiac auscultation training is seeing a preference towards diagnostics equipment such as the echocardiograph that allows visualizing and listening to the heart to determine how the heart is working, rather than the use of the stethoscope which only provides auditory feedback, resulting in a loss of stethoscope-based...

chapter

Deep affordance learning for single- and multiple-instance object detection

Jian-Gang Wang, Prabhu Shankar Mahendran, Eam-Khwang Teoh

TENCON 2017 - 2017 IEEE Region 10 Conference > 321 - 326

TENCON 2017 - 2017 IEEE Region 10 Conference

Affordance learning in general, is to identify the purpose, use, and ways to interact with an object, based on information gained from observing the object. Most of the existing affordance learning approaches assume the object target has been cropped individually from images. However, the object could not be easily separated from others due to occlusion or noise. Actually, two or more neighboring...

chapter

Dataset Selection for Controlling Swarms by Visual Demonstration

Karan Kumar Budhraja, Tim Oates

2017 IEEE International Conference on Data Mining Workshops (ICDMW) > 932 - 941

2017 IEEE International Conference on Data Mining Workshops (ICDMW)

Agent-based modeling is a paradigm of modeling dynamic systems of interacting agents that are individually governed by specified behavioral rules. Training a model of such agents to produce an emergent behavior by specification of the emergent (as opposed to agent) behavior is easier from a demonstration perspective. Without the involvement of manual behavior specification via code or reliance on...

chapter

Convolutional Drift Networks for Video Classification

Dillon Graham, Seyed Hamed Fatemi Langroudi, Christopher Kanan, Dhireesha Kudithipudi

2017 IEEE International Conference on Rebooting Computing (ICRC) > 1 - 8

2017 IEEE International Conference on Rebooting Computing (ICRC)

Analyzing spatio-temporal data like video is a challenging task that requires processing visual and temporal information effectively. Convolutional Neural Networks have shown promise as baseline fixed feature extractors through transfer learning, a technique that helps minimize the training cost on visual information. Temporal information is often handled using hand-crafted features or Recurrent Neural...

chapter

Visually-Aware Fashion Recommendation and Design with Generative Image Models

Wang-Cheng Kang, Chen Fang, Zhaowen Wang, Julian McAuley

2017 IEEE International Conference on Data Mining (ICDM) > 207 - 216

2017 IEEE International Conference on Data Mining (ICDM)

Building effective recommender systems for domains like fashion is challenging due to the high level of subjectivity and the semantic complexity of the features involved (i.e., fashion styles). Recent work has shown that approaches to 'visual' recommendation (e.g. clothing, art, etc.) can be made more accurate by incorporating visual signals directly into the recommendation objective, using 'off-the-shelf'...

chapter

A Lightweight Discriminative Tracker Based on Classification and Similarity

Weinong Wang, Fei Wang, Yu Guo

2017 International Conference on Digital Image Computing: Techniques and Applications (DICTA) > 1 - 8

2017 International Conference on Digital Image Computing: Techniques and Applications (DICTA)

Convolutional neural network (CNN) based trackers have achieved significant performances in tracking recently. Most existing CNN-based trackers regard tracking as a classification or similarity searching problem. The two methods have their respective superiorities and limitations because of different supervised objectives. In this paper, we propose a multi-task CNN for visual tracking, not only fully...

chapter

Learning Robust Visual-Semantic Embeddings

Yao-Hung Hubert Tsai, Liang-Kang Huang, Ruslan Salakhutdinov

2017 IEEE International Conference on Computer Vision (ICCV) > 3591 - 3600

2017 IEEE International Conference on Computer Vision (ICCV)

Many of the existing methods for learning joint embedding of images and text use only supervised information from paired images and its textual attributes. Taking advantage of the recent success of unsupervised learning in deep neural networks, we propose an end-to-end learning framework that is able to extract more robust multi-modal representations across domains. The proposed method combines representation...

chapter

Learning Multi-attention Convolutional Neural Network for Fine-Grained Image Recognition

Heliang Zheng, Jianlong Fu, Tao Mei, Jiebo Luo

2017 IEEE International Conference on Computer Vision (ICCV) > 5219 - 5227

2017 IEEE International Conference on Computer Vision (ICCV)

Recognizing fine-grained categories (e.g., bird species) highly relies on discriminative part localization and part-based fine-grained feature learning. Existing approaches predominantly solve these challenges independently, while neglecting the fact that part localization (e.g., head of a bird) and fine-grained feature learning (e.g., head shape) are mutually correlated. In this paper, we propose...

chapter

Weakly-Supervised Learning of Visual Relations

Julia Peyre, Ivan Laptev, Cordelia Schmid, Josef Sivic

2017 IEEE International Conference on Computer Vision (ICCV) > 5189 - 5198

2017 IEEE International Conference on Computer Vision (ICCV)

This paper introduces a novel approach for modeling visual relations between pairs of objects. We call relation a triplet of the form (subject; predicate; object) where the predicate is typically a preposition (eg. ’under’, ’in front of’) or a verb (’hold’, ’ride’) that links a pair of objects (subject; object). Learning such relations is challenging as the objects have different spatial configurations...

chapter

Sketching with Style: Visual Search with Sketches and Aesthetic Context

John Collomosse, Tu Bui, Michael Wilber, Chen Fang, more

2017 IEEE International Conference on Computer Vision (ICCV) > 2679 - 2687

2017 IEEE International Conference on Computer Vision (ICCV)

We propose a novel measure of visual similarity for image retrieval that incorporates both structural and aesthetic (style) constraints. Our algorithm accepts a query as sketched shape, and a set of one or more contextual images specifying the desired visual aesthetic. A triplet network is used to learn a feature embedding capable of measuring style similarity independent of structure, delivering...

chapter

Stepwise Metric Promotion for Unsupervised Video Person Re-identification

Zimo Liu, Dong Wang, Huchuan Lu

2017 IEEE International Conference on Computer Vision (ICCV) > 2448 - 2457

2017 IEEE International Conference on Computer Vision (ICCV)

The intensive annotation cost and the rich but unlabeled data contained in videos motivate us to propose an unsupervised video-based person re-identification (re-ID) method. We start from two assumptions: 1) different video tracklets typically contain different persons, given that the tracklets are taken at distinct places or with long intervals; 2) within each tracklet, the frames are mostly of the...

chapter

Pixel-Level Matching for Video Object Segmentation Using Convolutional Neural Networks

Jae Shin Yoon, Francois Rameau, Junsik Kim, Seokju Lee, more

2017 IEEE International Conference on Computer Vision (ICCV) > 2186 - 2195

2017 IEEE International Conference on Computer Vision (ICCV)

We propose a novel video object segmentation algorithm based on pixel-level matching using Convolutional Neural Networks (CNN). Our network aims to distinguish the target area from the background on the basis of the pixel-level similarity between two object units. The proposed network represents a target object using features from different depth layers in order to take advantage of both the spatial...

Keywords:
VISUALIZATION
Publication type:
book

Publication date

Set your own date range

Content availability

Available (2,078)
None (5)

Keywords

FEATURE EXTRACTION (665)
SUPPORT VECTOR MACHINES (254)
COMPUTATIONAL MODELING (189)
SEMANTICS (185)
IMAGE COLOR ANALYSIS (175)
ACCURACY (157)
IMAGE CLASSIFICATION (147)
DATA MINING (134)
IMAGE SEGMENTATION (132)
HISTOGRAMS (121)
KERNEL (114)
OBJECT RECOGNITION (114)
LEARNING (ARTIFICIAL INTELLIGENCE) (113)
NEURAL NETWORKS (108)
COMPUTER VISION (107)
TESTING (104)
OBJECT DETECTION (103)
VECTORS (101)
IMAGE RECOGNITION (98)
DATABASES (97)
CAMERAS (96)
IMAGE RETRIEVAL (95)
CORRELATION (92)
DETECTORS (91)
SHAPE (88)
ROBOTS (87)
VOCABULARY (87)
ROBUSTNESS (86)
GAMES (85)
MACHINE LEARNING (84)
ELECTROENCEPHALOGRAPHY (82)
DICTIONARIES (80)
TRAINING DATA (77)
CONTEXT (73)
HAPTIC INTERFACES (73)
VIRTUAL REALITY (73)
HIDDEN MARKOV MODELS (71)
TARGET TRACKING (71)
FACE (69)
CLASSIFICATION ALGORITHMS (68)
THREE-DIMENSIONAL DISPLAYS (68)
HUMANS (64)
DATA MODELS (62)
SOLID MODELING (60)
MEASUREMENT (59)
NEURONS (58)
OPTIMIZATION (58)
TRAJECTORY (57)
ARTIFICIAL NEURAL NETWORKS (55)
IMAGE REPRESENTATION (55)
SPEECH (55)
ENCODING (54)
CONFERENCES (52)
DEEP LEARNING (51)
IMAGE EDGE DETECTION (51)
PREDICTIVE MODELS (50)
STANDARDS (50)
ESTIMATION (49)
FACE RECOGNITION (49)
VIDEOS (49)
EDUCATIONAL INSTITUTIONS (48)
PIXEL (46)
PRINCIPAL COMPONENT ANALYSIS (46)
FORCE (45)
MATHEMATICAL MODEL (45)
ADAPTATION MODELS (44)
COMPUTER ARCHITECTURE (43)
DATA VISUALIZATION (42)
CLUSTERING ALGORITHMS (41)
IMAGE RECONSTRUCTION (40)
JOINTS (39)
NAVIGATION (39)
COMPUTERS (38)
DATA VISUALISATION (38)
MULTIMEDIA COMMUNICATION (38)
CONVOLUTION (37)
PROTOTYPES (36)
ROBOT SENSING SYSTEMS (36)
INTERNET (35)
PATTERN RECOGNITION (35)
SOFTWARE (35)
VIDEO SIGNAL PROCESSING (35)
SPEECH RECOGNITION (34)
PATTERN CLASSIFICATION (32)
PSYCHOLOGY (32)
BUILDINGS (31)
CLASSIFICATION (31)
IMAGE CODING (31)
LABELING (31)
NEURAL NETS (31)
THREE DIMENSIONAL DISPLAYS (31)
VEHICLES (31)
ALGORITHM DESIGN AND ANALYSIS (29)
CONTENT-BASED RETRIEVAL (29)
ELECTRODES (29)
IMAGE RESOLUTION (29)
LEGGED LOCOMOTION (29)
NOISE MEASUREMENT (29)
more

Data set

ieee (2,082)
Springer (1)

INFONA - science communication portal

Search results

Survey of Visual Feature Extraction Algorithms in a Mars-like Environment

Performance Evaluation of Walking Imagery Training Based on Virtual Environment in Brain-Computer Interfaces

Deep Image Retrieval Applied on Kotenseki Ancient Japanese Literature

Blog Article Summarization with Image-Text Alignment Techniques

Hyper-Feature Based Tracking with the Fully-Convolutional Siamese Network

Deformable and Occluded Object Tracking via Graph Learning

Seam tracking and welding bead geometry analysis for autonomous welding robot

Automatic detection of fruits in coffee crops from aerial images

Development of a 3D printed stethoscope for virtual cardiac auscultation examination training

Deep affordance learning for single- and multiple-instance object detection

Dataset Selection for Controlling Swarms by Visual Demonstration

Convolutional Drift Networks for Video Classification

Visually-Aware Fashion Recommendation and Design with Generative Image Models

A Lightweight Discriminative Tracker Based on Classification and Similarity

Learning Robust Visual-Semantic Embeddings

Learning Multi-attention Convolutional Neural Network for Fine-Grained Image Recognition

Weakly-Supervised Learning of Visual Relations

Sketching with Style: Visual Search with Sketches and Aesthetic Context

Stepwise Metric Promotion for Unsupervised Video Person Re-identification

Pixel-Level Matching for Video Object Segmentation Using Convolutional Neural Networks

Filter options

Publication date

Content availability

Keywords

Data set

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Data set

Reporting an error / abuse

Sending the report failed

Accessibility options