2016 IEEE International Conference on Image Processing (ICIP)

Items from 1 to 20 out of 27 results

chapter

Semantic context and depth-aware object proposal generation

Haoyang Zhang, Xuming He, Fatih Porikli, Laurent Kneip

2016 IEEE International Conference on Image Processing (ICIP) > 1 - 5

2016 IEEE International Conference on Image Processing (ICIP)

This paper presents a context-aware object proposal generation method for stereo images. Unlike existing methods which mostly rely on image-based or depth features to generate object candidates, we propose to incorporate additional geometric and high-level semantic context information into the proposal generation. Our method starts from an initial object proposal set, and encode objectness for each...

chapter

Online multi-task learning for semantic concept detection in video

Foteini Markatopoulou, Vasileios Mezaris, Ioannis Patras

2016 IEEE International Conference on Image Processing (ICIP) > 186 - 190

2016 IEEE International Conference on Image Processing (ICIP)

In this paper we propose an online multi-task learning algorithm for video concept detection. In particular, we extend the Efficient Lifelong Learning Algorithm (ELLA) in the following ways: a) we solve the objective function of ELLA using quadratic programming instead of solving the Lasso problem, b) we add a new label-based constraint that considers concept correlations, c) we use linear SVMs as...

chapter

Image and tag retrieval by leveraging image-group links with multi-domain graph embedding

Kazuki Fukui, Akifumi Okuno, Hidetoshi Shimodaira

2016 IEEE International Conference on Image Processing (ICIP) > 221 - 225

2016 IEEE International Conference on Image Processing (ICIP)

A large number of images are available on online photo-sharing services along with rich meta-data, including tags, groups, and locations, etc. For associating two domains of different modalities, e.g. images and tags, Canonical Correlation Analysis (CCA) and its extended methods are used widely. We employ a more flexible graph embedding method called Cross-Domain Matching Correlation Analysis (CDMCA),...

chapter

A shape feature based bovw method for image classification using N-gram and spatial pyramid coding scheme

Elham Etemad, Gang Hu, Qigang Gao

2016 IEEE International Conference on Image Processing (ICIP) > 504 - 508

2016 IEEE International Conference on Image Processing (ICIP)

Image classification is a general visual analysis task based on the image content coded by its representation. In this research, we proposed an image representation method that is based on the perceptual shape features and their spatial distributions. A natural language processing concept, N-gram, is adopted to generate a set of perceptual shape visual words for encoding image features. By combining...

chapter

Where do emotions come from? Predicting the Emotion Stimuli Map

Kuan-Chuan Peng, Amir Sadovnik, Andrew Gallagher, Tsuhan Chen

2016 IEEE International Conference on Image Processing (ICIP) > 614 - 618

2016 IEEE International Conference on Image Processing (ICIP)

Which parts of an image evoke emotions in an observer? To answer this question, we introduce a novel problem in computer vision — predicting an Emotion Stimuli Map (ESM), which describes pixel-wise contribution to evoked emotions. Building a new image database, EmotionROI, as a benchmark for predicting the ESM, we find that the regions selected by saliency and objectness detection do not correctly...

chapter

Multi-scale blocks based image emotion classification using multiple instance learning

Tianrong Rao, Min Xu, Huiying Liu, Jinqiao Wang, more

2016 IEEE International Conference on Image Processing (ICIP) > 634 - 638

2016 IEEE International Conference on Image Processing (ICIP)

Emotional factors usually affect users' preferences for and evaluations of images. Although affective image analysis attracts increasing attention, there are still three major challenges remaining: 1) it is difficult to classify an image into a single emotion type since different regions within an image can represent different emotions; 2) there is a gap between low-level features and high-level emotions...

chapter

Formal representation of events in a surveillance domain ontology

F. Sobhani, K. Chandramouli, Q. Zhang, E. Izquierdo

2016 IEEE International Conference on Image Processing (ICIP) > 913 - 917

2016 IEEE International Conference on Image Processing (ICIP)

Following the exponential deployment of surveillance systems across a wide-spread region of geographic locations, detection and representation of events has become a critical element in automated surveillance systems. In this paper, we present an extensive ontology framework for representing complex semantic events. The proposed ontology builds on DOLCE ontology and relies on the linguistic and cognitive...

chapter

Deep cross-layer activation features for visual recognition

Georgios Th. Papadopoulos, Elpida Machairidou, Petros Daras

2016 IEEE International Conference on Image Processing (ICIP) > 923 - 927

2016 IEEE International Conference on Image Processing (ICIP)

Convolutional Neural Networks (CNNs), which have nowadays dominated image analysis tasks, constitute feed-forward methods that model increasingly complex data structures and patterns along the subsequent hidden layers of the network. However, the common practice of using the activation features from the last network layer inevitably leads to a visual recognition bottleneck. This is due to the fact...

chapter

Salient object detection via fast R-CNN and low-level cues

Xiang Wang, Huimin Ma, Xiaozhi Chen

2016 IEEE International Conference on Image Processing (ICIP) > 1042 - 1046

2016 IEEE International Conference on Image Processing (ICIP)

Recent advances in salient object detection have exploited the deep Convolutional Neural Network (CNN) to represent high-level semantic, however, due to the presence of convolutional and pooling layers, it is difficult for CNN to generate saliency map with sharp boundaries. In this paper, we propose multi-scale mask-based Fast R-CNN framework which generate saliency score of each region. Since the...

chapter

Motion sketch based crowd video retrieval via motion structure coding

Shuang Wu, Hang Su, Shibao Zheng, Hua Yang, more

2016 IEEE International Conference on Image Processing (ICIP) > 1205 - 1209

2016 IEEE International Conference on Image Processing (ICIP)

Crowd video retrieval is an important problem in surveillance video management in the era of big data, e.g., video indexing and browsing. In this paper, we address this issue from the motion-level perspective by using hand-drawn sketches as queries. Motion sketch based crowd video retrieval naturally suffers from challenges in motion-level video indexing and sketch representation. We tackle them by...

chapter

Joint crowd detection and semantic scene modeling using a Gestalt laws-based similarity

Weiqi Zhao, Zhang Zhang, Kaiqi Huang

2016 IEEE International Conference on Image Processing (ICIP) > 1220 - 1224

2016 IEEE International Conference on Image Processing (ICIP)

This paper presents a novel approach to detecting crowd groups and learning semantic regions with a Gestalt laws-based similarity. Different from the existing approaches based on optical flows or complete trajectories, our model adopts tracklets as the original input, because they carry more detailed information. Though those tracklets do not appear in the same duration, they are more robust to noise...

chapter

Weakly supervised semantic segmentation with superpixel embedding

Frank Z. Xing, Erik Cambria, Win-Bin Huang, Yang Xu

2016 IEEE International Conference on Image Processing (ICIP) > 1269 - 1273

2016 IEEE International Conference on Image Processing (ICIP)

In this paper, we propose to use contexts of superpixels as a prior to improve semantic segmentation by the CRF framework. A graphical model is constructed on over-segmented images. Our main contribution is to take the concept of “superpixel embedding” into consideration, which is formalized as a potential item for optimizing the energy of the whole graph. We also introduce two ways of calculating...

chapter

Deeper and wider fully convolutional network coupled with conditional random fields for scene labeling

Kien Nguyen, Clinton Fookes, Sridha Sridharan

2016 IEEE International Conference on Image Processing (ICIP) > 1344 - 1348

2016 IEEE International Conference on Image Processing (ICIP)

Deep convolutional neural networks (DCNNs) have been employed in many computer vision tasks with great success due to their robustness in feature learning. One of the advantages of DCNNs is their representation robustness to object locations, which is useful for object recognition tasks. However, this also discards spatial information, which is useful when dealing with topological information of the...

chapter

CNN-aware binary MAP for general semantic segmentation

Mahdyar Ravanbakhsh, Hossein Mousavi, Moin Nabi, Mohammad Rastegari, more

2016 IEEE International Conference on Image Processing (ICIP) > 1923 - 1927

2016 IEEE International Conference on Image Processing (ICIP)

In this paper we introduce a novel method for general semantic segmentation that can benefit from general semantics of Convolutional Neural Network (CNN). Our segmentation proposes visually and semantically coherent image segments. We use binary encoding of CNN features to overcome the difficulty of the clustering on the high-dimensional CNN feature space. These binary codes are very robust against...

chapter

Assessing semantic information in convolutional neural network representations of images via image annotation

Michael B. Mayhew, Barry Chen, Karl S. Ni

2016 IEEE International Conference on Image Processing (ICIP) > 2266 - 2270

2016 IEEE International Conference on Image Processing (ICIP)

Image annotation, or prediction of multiple tags for an image, is a challenging task. Most current algorithms are based on large sets of handcrafted features. Deep convolutional neural networks have recently outperformed humans in image classification, and these networks can be used to extract features highly predictive of an image's tags. In this study, we analyze semantic information in features...

chapter

Controlling explanatory heatmap resolution and semantics via decomposition depth

Sebastian Bach, Alexander Binder, Klaus-Robert Muller, Wojciech Samek

2016 IEEE International Conference on Image Processing (ICIP) > 2271 - 2275

2016 IEEE International Conference on Image Processing (ICIP)

We present an application of the Layer-wise Relevance Propagation (LRP) algorithm to state of the art deep convolutional neural networks and Fisher Vector classifiers to compare the image perception and prediction strategies of both classifiers with the use of visualized heatmaps. Layer-wise Relevance Propagation (LRP) is a method to compute scores for individual components of an input image, denoting...

chapter

Simple and effective visual question answering in a single modality

Yuetan Lin, Zhangyang Pang, Yanan Li, Donghui Wang

2016 IEEE International Conference on Image Processing (ICIP) > 2276 - 2280

2016 IEEE International Conference on Image Processing (ICIP)

Visual question answering (VQA) comes as a result of great development in computer vision and natural language processing, which requires deep understanding of images and questions and effective integration of them. Current works on VQA simply concatenated visual and textual features or compared them via dot product, which were unable to eliminate the semantic difference between them. We argue to...

chapter

A novel CNN-based match kernel for image retrieval

Dan Zhou, Xue Li, Yu-Jin Zhang

2016 IEEE International Conference on Image Processing (ICIP) > 2445 - 2449

2016 IEEE International Conference on Image Processing (ICIP)

The recent decade has witnessed remarkable developments of SIFT-based approaches for image retrieval. However, such approaches are inherently insufficient in handling the semantic gap and large viewpoint changes, leading to inferior performance. To address these limitations, this paper extends SIFT-based match kernels by integrating the match functions for SIFT and CNN features. Specifically, a thresholded...

chapter

Salient object detection by multi-level features learning determined sparse reconstruction

Xiaoyun Yan, Yuehuan Wang, Qiong Song, Kaiheng Dai

2016 IEEE International Conference on Image Processing (ICIP) > 2762 - 2766

2016 IEEE International Conference on Image Processing (ICIP)

We propose a salient object detection algorithm via multilevel features learning determined sparse reconstruction. There are three stages in our method. First, the test image are successively processed by a segmentation and semantic information generation procedures. Second, three kinds of features are extracted from semantic, global, and local levels for each superpixel to train a random forest regressor,...

chapter

Multi-scale region candidate combination for action recognition

Zhichen Zhao, Huimin Ma, Xiaozhi Chen

2016 IEEE International Conference on Image Processing (ICIP) > 3071 - 3075

2016 IEEE International Conference on Image Processing (ICIP)

In still images, multi-scale regions contain rich information of different granularity. However, only semantically meaningful regions provide auxiliary cues for action recognition. Moreover, regions at different scales contribute differently. Motivated by the two observations, we propose an approach that is composed of three components: 1) detecting semantic region candidates at multiple scales, 2)...

Keywords:
SEMANTICS

Publication date

Set your own date range

Keywords

FEATURE EXTRACTION (14)
TRAINING (12)
VISUALIZATION (9)
IMAGE SEGMENTATION (6)
NEURAL NETWORKS (6)
CORRELATION (5)
ACTION RECOGNITION (4)
COMPUTATIONAL MODELING (4)
ENCODING (4)
OBJECT DETECTION (4)
BENCHMARK TESTING (3)
CAMERAS (3)
CONTEXT (3)
DEEP LEARNING (3)
IMAGE COLOR ANALYSIS (3)
INDEXES (3)
TRAJECTORY (3)
CNN (2)
CONVOLUTIONAL NEURAL NETWORKS (2)
DETECTORS (2)
HISTOGRAMS (2)
LABELING (2)
LEGGED LOCOMOTION (2)
LINEAR PROGRAMMING (2)
MEASUREMENT (2)
SALIENT OBJECT DETECTION (2)
SEMANTIC INFORMATION (2)
TESTING (2)
VOCABULARY (2)
ZERO-SHOT LEARNING (2)
3D SCENE (1)
ADAPTIVE OPTICS (1)
ALGORITHM DESIGN AND ANALYSIS (1)
ART (1)
BACKGROUNDNESS PRIOR (1)
BIDIRECTIONAL RETRIEVAL (1)
BINARY CODES (1)
BIOMEDICAL MONITORING (1)
BIRDS (1)
BUILDINGS (1)
CLASSIFICATION ALGORITHMS (1)
CLUSTERING ALGORITHMS (1)
COMPUTATIONAL AESTHETICS (1)
COMPUTER ARCHITECTURE (1)
COMPUTER VISION (1)
CONCEPT DETECTION (1)
CONDITIONAL RANDOM FIELDS (1)
CONTEXT FEATURE (1)
CONVOLUTION (1)
CONVOLUTIONAL CODES (1)
CONVOLUTIONAL NEURAL NETWORK (1)
CROSS-DOMAIN MATCHING CORRELATION ANALYSIS (1)
CROWD ANALYSIS (1)
CROWD VIDEO RETRIEVAL (1)
DATA MINING (1)
DATA MODELS (1)
DATABASES (1)
DOLCE (1)
EDGE-BASED PROPAGATION (1)
EMOTION STIMULI MAP (1)
ESTIMATION (1)
EVENT DETECTION (1)
EXPLAINING CLASSIFIERS (1)
FACE (1)
FAST R-CNN (1)
FAST-FORWARD (1)
FCN (1)
FEATURE FUSION (1)
FEATURE REPRESENTATION (1)
FEATURE REPRESENTATIONS (1)
FIRST-PERSON VIDEO (1)
FLICKR (1)
FORENSIC ANALYST (1)
FORENSICS (1)
FREQUENCY DOMAIN ANALYSIS (1)
FREQUENCY-DOMAIN ANALYSIS (1)
FULLY CONVOLUTIONAL NETWORKS (1)
FUSES (1)
GRAPH EMBEDDING (1)
GROUP DETECTION (1)
HAMMING DISTANCE (1)
HEAD (1)
HEATMAPPING (1)
HIERARCHICAL CLUSTERING (1)
HIGHLIGHT RETRIEVAL (1)
HYPERLAPSE (1)
IMAGE ANALYSIS (1)
IMAGE ANNOTATION (1)
IMAGE CODING (1)
IMAGE EDGE DETECTION (1)
IMAGE EMOTION CLASSIFICATION (1)
IMAGE FEATURE REPRESENTATION (1)
IMAGE RECONSTRUCTION (1)
IMAGE REPRESENTATION (1)
IMAGE RETRIEVAL (1)
IMAGE SEMANTIC PARSING (1)
IMAGE UNDERSTANDING (1)
IMPART MULTI-VIEW ACTION DATA SET (1)
INDEXING (1)
more

INFONA - science communication portal

2016 IEEE International Conference on Image Processing (ICIP)

Semantic context and depth-aware object proposal generation

Online multi-task learning for semantic concept detection in video

Image and tag retrieval by leveraging image-group links with multi-domain graph embedding

A shape feature based bovw method for image classification using N-gram and spatial pyramid coding scheme

Where do emotions come from? Predicting the Emotion Stimuli Map

Multi-scale blocks based image emotion classification using multiple instance learning

Formal representation of events in a surveillance domain ontology

Deep cross-layer activation features for visual recognition

Salient object detection via fast R-CNN and low-level cues

Motion sketch based crowd video retrieval via motion structure coding

Joint crowd detection and semantic scene modeling using a Gestalt laws-based similarity

Weakly supervised semantic segmentation with superpixel embedding

Deeper and wider fully convolutional network coupled with conditional random fields for scene labeling

CNN-aware binary MAP for general semantic segmentation

Assessing semantic information in convolutional neural network representations of images via image annotation

Controlling explanatory heatmap resolution and semantics via decomposition depth

Simple and effective visual question answering in a single modality

A novel CNN-based match kernel for image retrieval

Salient object detection by multi-level features learning determined sparse reconstruction

Multi-scale region candidate combination for action recognition

Filter options

Publication date

Keywords

INFONA - science communication portal

2016 IEEE International Conference on Image Processing (ICIP) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2016 IEEE International Conference on Image Processing (ICIP)