Search results

chapter

Combining eye movements for semantic image classification

Xin Liu, Xianzhong Zhou, Tianqi Ji, Han Bai, more

2017 IEEE 14th International Conference on Networking, Sensing and Control (ICNSC) > 761 - 766

2017 IEEE 14th International Conference on Networking, Sensing and Control (ICNSC)

Nowadays, the “semantic gap” problems have greatly limited development of image classification. The key to this problem is to get semantic information of the images. A semantic image feature extraction method is proposed in this paper, in which eye movement information is integrated. Firstly, the underlying visual features of images are extracted. Secondly, weighed feature vectors of images are constructed...

chapter

The semantic multinomial representation of images obtained using dynamic kernel based pseudo-concept SVMs

Shikha Gupta, Dileep Aroor Dinesh, Veena Thenkanidiyoor

2017 Twenty-third National Conference on Communications (NCC) > 1 - 6

2017 Twenty-third National Conference on Communications (NCC)

Hand-engineered local image features have been proven to be intended representation for a variety of high-level visual recognition tasks. But as the visual recognition tasks such as scene classification and object detection become more challenging, the semantic gap between low-level feature and the concept descriptor of the scene images increases. In this paper, we present novel semantic multinomial...

chapter

Semantic driven hierarchical learning for energy-efficient image classification

Priyadarshini Panda, Kaushik Roy

Design, Automation & Test in Europe Conference & Exhibition (DATE), 2017 > 1582 - 1587

2017 Design, Automation & Test in Europe Conference & Exhibition (DATE)

Machine-learning algorithms have shown outstanding image recognition performance for computer vision applications. While these algorithms are modeled to mimic brain-like cognitive abilities, they lack the remarkable energy-efficient processing capability of the brain. Recent studies in neuroscience reveal that the brain resolves the competition among multiple visual stimuli presented simultaneously...

chapter

Towards Fine-Grained Open Zero-Shot Learning: Inferring Unseen Visual Features from Attributes

Yang Long, Li Liu, Ling Shao

2017 IEEE Winter Conference on Applications of Computer Vision (WACV) > 944 - 952

2017 IEEE Winter Conference on Applications of Computer Vision (WACV)

Zero-shot Learning (ZSL) can leverage attributes to recognise unseen instances. However, the training data is limited and cannot adequately discriminate fine-grained classes with similar attributes. In this paper, we propose a complementary procedure that inversely makes use of attributes to infer discriminative visual features for unseen classes. In this way, ZSL is fully converted into conventional...

chapter

An overview of Multimodal Sentiment Analysis research: Opportunities and Difficulties

Mohammad Aman Ullah, Md. Monirul Islam, Norhidayah Binti Azman, Zulkifly Mohd Zaki

2017 IEEE International Conference on Imaging, Vision & Pattern Recognition (icIVPR) > 1 - 6

2017 IEEE International Conference on Imaging, Vision & Pattern Recognition (icIVPR)

The scatter form of multimedia data such as text, image, audio, and video posted regularly in the social media may contain useful information for the organizations. But, this information should be derived with the use of some form of analysis known as Multimodal Sentiment Analysis (MSA). But, there is a lack of proper analytic tools for such analysis. This paper presents a thorough overview of more...

chapter

Semantic segmentation based on aggregated features and contextual information

Chuanxia Zheng, Jianhua Wang, Weihai Chen, Xingming Wu

2016 IEEE International Conference on Robotics and Biomimetics (ROBIO) > 862 - 867

2016 IEEE International Conference on Robotics and Biomimetics (ROBIO)

In this paper, a novel semantic segmentation model based on aggregated features and contextual information is proposed. Given an RGB-D image, we train a support vector machine (SVM) to predict initial labels using aggregated features, and then optimize the predicted results using contextual information. For aggregated features, the local features on regions are extracted to capture visual appearance...

chapter

Building semantic understanding beyond deep learning from sound and vision

Fillipe D M de Souza, Sudeep Sarkar, Guillermo Camara-Chavez

2016 23rd International Conference on Pattern Recognition (ICPR) > 2097 - 2102

2016 23rd International Conference on Pattern Recognition (ICPR)

Deep learning-based models have recently been widely successful at outperforming traditional approaches in several computer vision applications such as image classification, object recognition and action recognition. However, those models are not naturally designed to learn structural information that can be important to tasks such as human pose estimation and structured semantic interpretation of...

chapter

Semantic-free attributes for image classification

Quentin Oliveau, Hichem Sahbi

2016 23rd International Conference on Pattern Recognition (ICPR) > 1577 - 1582

2016 23rd International Conference on Pattern Recognition (ICPR)

Attributes are defined as mid-level image characteristics shared among different categories. These characteristics are suitable in order to handle classification problems especially when training data are scarce. In this paper, we design discriminative real-valued attributes by learning nonlinear inductive maps. Our method is based on solving a constrained optimization problem that mixes three criteria;...

chapter

Evolutionary data purification for social media classification

Stuart James, John Collomosse

2016 23rd International Conference on Pattern Recognition (ICPR) > 2676 - 2681

2016 23rd International Conference on Pattern Recognition (ICPR)

We present a novel algorithm for the semantic labeling of photographs shared via social media. Such imagery is diverse, exhibiting high intra-class variation that demands large training data volumes to learn representative classifiers. Unfortunately image annotation at scale is noisy resulting in errors in the training corpus that confound classifier accuracy. We show how evolutionary algorithms may...

chapter

Modality Classification for Searching Figures in Biomedical Literature

Zhiyun Xue, Md. Mahmudur Rahman, Sameer Antani, L. Rodney Long, more

2016 IEEE 29th International Symposium on Computer-Based Medical Systems (CBMS) > 152 - 157

2016 IEEE 29th International Symposium on Computer-Based Medical Systems (CBMS)

Image modality classification categorizes images according to their type. It is an important module in the Open-iSM multimodal (text+image) search engine that retrieves figures from biomedical articles. It is a hierarchical classification where on the top level the input figures are classified into two general categories: regular images (X-ray, CT, MRI, photographs, etc.) vs. illustration images (cartoon...

chapter

Comparing and combining unimodal methods for multimodal recognition

Satoru Ishikawa, Jorma Laaksonen

2016 14th International Workshop on Content-Based Multimedia Indexing (CBMI) > 1 - 6

2016 14th International Workshop on Content-Based Multimedia Indexing (CBMI)

Multimodal recognition has recently become more attractive and common method in multimedia information retrieval. In many cases it shows better recognition results than using only unimodal methods. Most of current multimodal recognition methods still depend on unimodal recognition results. Therefore, in order to get better recognition performance, it is important to choose suitable features and classification...

chapter

Adapting attributes by selecting features similar across domains

Siqi Liu, Adriana Kovashka

2016 IEEE Winter Conference on Applications of Computer Vision (WACV) > 1 - 8

2016 IEEE Winter Conference on Applications of Computer Vision (WACV)

Attributes are semantic visual properties shared by objects. They have been shown to improve object recognition and to enhance content-based image search. While attributes are expected to cover multiple categories, e.g. a dalmatian and a whale can both have "smooth skin", we find that the appearance of a single attribute varies quite a bit across categories. Thus, an attribute model learned...

chapter

Relevance Feedback Based CBIR System Using SVM and Bayes Classifier

Navneet Kaur, Sonika Jindal, Bhavneet Kaur

2016 Second International Conference on Computational Intelligence & Communication Technology (CICT) > 214 - 218

2016 Second International Conference on Computational Intelligence & Communication Technology (CICT)

Image search techniques were not generally basedon visual features but on the textual annotation of images. Images were firstly annotated with text and then searched usinga text-based approach from traditional database managementsystems which is time consuming and difficult to manage. Toovercome this problem, CBIR (Content Based Image Retrieval) is introduced which is becoming the hottest research...

chapter

Semantic and Visual Cues for Humanitarian Computing of Natural Disaster Damage Images

Hadi S. Jomaa, Yara Rizk, Mariette Awad

2016 12th International Conference on Signal-Image Technology & Internet-Based Systems (SITIS) > 404 - 411

2016 12th International Conference on Signal-Image Technology & Internet-Based Systems (SITIS)

Identifying different types of damage is very essential in times of natural disasters, where first responders are flooding the internet with often annotated images and texts, and rescue teams are overwhelmed to prioritize often scarce resources. While most of the efforts in such humanitarian situations rely heavily on human labor and input, we propose in this paper a novel hybrid approach to help...

chapter

Learning-based movie summarization via role-community analysis and feature fusion

Jun-Ying Li, Li-Wei Kang, Chia-Ming Tsai, Chia-Wen Lin

2015 IEEE 17th International Workshop on Multimedia Signal Processing (MMSP) > 1 - 6

2015 IEEE 17th International Workshop on Multimedia Signal Processing (MMSP)

Movie summarization aims at condensing a full-length movie to a significantly shortened version that still preserves the movie's major semantic content. In this paper, we propose a learning-based movie summarization framework via role-community social network analysis and feature fusion. In our framework, scene-based movie summarization is formulated as a 0–1 knapsack problem, where the scene attention...

chapter

A component-based object detection method extended with a fuzzy inference engine

Murat Koyuncu, Basar Cetinkaya

2015 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE) > 1 - 7

2015 IEEE International Conference on Fuzzy Systems (FUZZ-IEEE)

In this paper, we propose a component-based object detection method extended with the fuzzy inference technique. The proposed method detects constituent components of a complex object instead of a whole object in images. For component detection, multiple multi-class support vector machines (SVM) are used in parallel. Each SVM classifies the candidate component using a different low-level image feature...

chapter

Learning to Select and Order Vacation Photographs

Fereshteh Sadeghi, J. Rafael Tena, Ali Farhadi, Leonid Sigal

2015 IEEE Winter Conference on Applications of Computer Vision > 510 - 517

2015 IEEE Winter Conference on Applications of Computer Vision (WACV)

We propose the problem of automated photo album creation from an unordered image collection. The problem is difficult as it involves a number of complex perceptual tasks that facilitate selection and ordering of photos to create a compelling visual narrative. To help solve this problem, we collect (and will make available) a new benchmark dataset based on Flickr images. Flickr Album Dataset and provides...

chapter

Semantic Features for Food Image Recognition with Geo-Constraints

Xinhang Song, Shuqiang Jiang, Ruihan Xu, Luis Herranz

2014 IEEE International Conference on Data Mining Workshop > 1020 - 1025

2014 IEEE International Conference on Data Mining Workshop (ICDMW)

Food-related photos have become increasingly very popular, due to social networks, food recommendation and dietary assessment systems. Reliable annotation is essential in those systems, but user-contributed tags are often non-informative and inconsistent, and unconstrained automatic food recognition still has relatively low accuracy. Most works focus on exploiting only the visual content while ignoring...

chapter

Bimodal Learning for Multi-concept Image Query

Haijiao Xu, Peng Pan, Yansheng Lu, Chunyan Xu, more

2014 Tenth International Conference on Computational Intelligence and Security > 205 - 209

2014 Tenth International Conference on Computational Intelligence and Security (CIS)

Multi-concept image query is a multi-label classification challenge. Traditional query methods focus on single concept query, and only use image visual data without considering the associated textual tag data. In this work, we address the problem of bimodal multi-concept image query, namely retrieving bimodal images with multiple target concepts from the image set. We propose a novel Bimodal Learning...

chapter

Survey on pLSA based scene classification techniques

Anamika Singh, Parmanand, Saurabh

2014 5th International Conference - Confluence The Next Generation Information Technology Summit (Confluence) > 555 - 560

2014 5th International Conference- Confluence The Next Generation Information Technology Summit

A comprehensive survey of scene classification based on pLSA formulation literature is presented. Due to the growth in robotics there is an increase in the concern towards visual technology adaption and the interest in the concern has been growing over past years. Vision creates the premises for brain-processing. Our brain receives and keeps unconscious processing over the stupendous amount of visual...

INFONA - science communication portal

Search results

Combining eye movements for semantic image classification

The semantic multinomial representation of images obtained using dynamic kernel based pseudo-concept SVMs

Semantic driven hierarchical learning for energy-efficient image classification

Towards Fine-Grained Open Zero-Shot Learning: Inferring Unseen Visual Features from Attributes

An overview of Multimodal Sentiment Analysis research: Opportunities and Difficulties

Semantic segmentation based on aggregated features and contextual information

Building semantic understanding beyond deep learning from sound and vision

Semantic-free attributes for image classification

Evolutionary data purification for social media classification

Modality Classification for Searching Figures in Biomedical Literature

Comparing and combining unimodal methods for multimodal recognition

Adapting attributes by selecting features similar across domains

Relevance Feedback Based CBIR System Using SVM and Bayes Classifier

Semantic and Visual Cues for Humanitarian Computing of Natural Disaster Damage Images

Learning-based movie summarization via role-community analysis and feature fusion

A component-based object detection method extended with a fuzzy inference engine

Learning to Select and Order Vacation Photographs

Semantic Features for Food Image Recognition with Geo-Constraints

Bimodal Learning for Multi-concept Image Query

Survey on pLSA based scene classification techniques

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options