2016 23rd International Conference on Pattern Recognition (ICPR)

chapter

Moving object reconstruction in monocular video data using boundary generation

Sebastian Bullinger, Christoph Bodensteiner, Sebastian Wuttke, Michael Arens

2016 23rd International Conference on Pattern Recognition (ICPR) > 240 - 246

We present a method to reconstruct the three-dimensional shape of a moving instance of a known object category in video data. We exploit state-of-the-art semantic segmentation techniques to extract the object's two-dimensional shape in each frame. Therefore, our method is robust to occlusion, handles stationary objects and extends naturally to multiple video sequences. We apply Structure from Motion...

chapter

Unsupervised cyber bullying detection in social networks

Michele Di Capua, Emanuel Di Nardo, Alfredo Petrosino

2016 23rd International Conference on Pattern Recognition (ICPR) > 432 - 437

2016 23rd International Conference on Pattern Recognition (ICPR)

Modern young people (“digital natives”) have grown in an era dominated by new technologies where communications are pushed to quite a real-time level, and pose no limits in establishing relationships with other people or communities. However, the speed of evolution does not allow young people to split consciously acceptable behaviors from potentially harmful ones and a new phenomenon known as cyber...

chapter

Semantic segmentation priors for object discovery

German M. Garcia, Farzad Husain, Hannes Schulz, Simone Frintrop, more

2016 23rd International Conference on Pattern Recognition (ICPR) > 549 - 554

2016 23rd International Conference on Pattern Recognition (ICPR)

Reliable object discovery in realistic indoor scenes is a necessity for many computer vision and service robot applications. In these scenes, semantic segmentation methods have made huge advances in recent years. Such methods can provide useful prior information for object discovery by removing false positives and by delineating object boundaries. We propose a novel method that combines bottom-up...

chapter

Object proposals using CNN-based edge filtering

Muhammad Adeel Waris, Alexandros Iosifidis, Moncef Gabbouj

2016 23rd International Conference on Pattern Recognition (ICPR) > 627 - 632

2016 23rd International Conference on Pattern Recognition (ICPR)

With the success of deep learning in the last few years, the object detection community shifted from processing on exhaustive sliding windows to smaller set of object proposals using more powerful and deep visual representations. Object proposals increase the accuracy and speed up detection process by reducing the search space. In this paper we propose a novel idea of filtering irrelevant edges using...

chapter

Video2vec: Learning semantic spatio-temporal embeddings for video representation

Sheng-Hung Hu, Yikang Li, Baoxin Li

2016 23rd International Conference on Pattern Recognition (ICPR) > 811 - 816

2016 23rd International Conference on Pattern Recognition (ICPR)

We propose to learn semantic spatio-temporal embeddings for videos to support high-level video analysis. The first step of the proposed embedding employs a deep architecture consisting of two channels of convolutional neural networks (capturing appearance and local motion) followed by their corresponding Gated Recurrent Unit encoders for capturing longer-term temporal structure of the CNN features...

chapter

Mutually incoherent pose bases for Action recognition

Yinzhong Qian, Wenbin Chen, I-fan Shen

2016 23rd International Conference on Pattern Recognition (ICPR) > 823 - 828

2016 23rd International Conference on Pattern Recognition (ICPR)

We propose mutually incoherent pose bases for action recognition in static image, each of which implicitly represents co-occurrence of poselets. First of all, action specific poselets are trained. To suppress the ambiguity of detection, we cluster poselet activations by the overlap of predicted torso bound of each poselet. Then pose feature of an action person can be extracted which is a vector composed...

chapter

Frame level annotations for tennis videos

Mohak Sukhwani, C.V. Jawahar

2016 23rd International Conference on Pattern Recognition (ICPR) > 841 - 846

2016 23rd International Conference on Pattern Recognition (ICPR)

Content based indexing is critical to the effective access of the multimedia data. To this end, visual data is often annotated with textual content for bridging the semantic gap. In this paper, we present a method to generate frame level fine grained annotations for a given video clip. Access to the frame level fine grained annotations lead to rich, dense and meaningful semantic associations between...

chapter

Wireless capsule endoscopy video summarization: A learning approach based on Siamese neural network and support vector machine

Jin Chen, Yuexian Zou, Yi Wang

2016 23rd International Conference on Pattern Recognition (ICPR) > 1303 - 1308

2016 23rd International Conference on Pattern Recognition (ICPR)

Wireless capsule endoscopy video summarization (WCE-VS) is highly demanded for eliminating redundant frames with high similarity. Conventional WCE-VS methods extract various hand-crafted features as image representations. Researches show that such features only reflect the low-level characteristics of single frame and essentially are not effective to capture the semantic similarity between WCE frames...

chapter

Taxonomy augmented object recognition

Xiaoyang Wang, Yue Zhao, Qiang Ji

2016 23rd International Conference on Pattern Recognition (ICPR) > 1370 - 1375

2016 23rd International Conference on Pattern Recognition (ICPR)

Realistic scene object recognition in computer vision still faces great challenges due to the large intra-class variation of object images caused by factors like object appearance variation and viewpoint change. To address this challenge, we propose to exploit the semantic relationships embedded in object taxonomy for improved object recognition. Specifically, we exploit the relationships in the object...

chapter

Semantic-free attributes for image classification

Quentin Oliveau, Hichem Sahbi

2016 23rd International Conference on Pattern Recognition (ICPR) > 1577 - 1582

2016 23rd International Conference on Pattern Recognition (ICPR)

Attributes are defined as mid-level image characteristics shared among different categories. These characteristics are suitable in order to handle classification problems especially when training data are scarce. In this paper, we design discriminative real-valued attributes by learning nonlinear inductive maps. Our method is based on solving a constrained optimization problem that mixes three criteria;...

chapter

Multi-label classification with meta-label-specific features

Lu Sun, Mineichi Kudo, Keigo Kimura

2016 23rd International Conference on Pattern Recognition (ICPR) > 1612 - 1617

2016 23rd International Conference on Pattern Recognition (ICPR)

Multi-label classification has attracted many attentions in various fields, such as text categorization and semantic image annotation. Aiming to classify an instance into multiple labels, various multi-label classification methods have been proposed. However, the existing methods typically build models in the identical feature (sub)space for all labels, possibly inconsistent with real-world problems...

chapter

Appearance changes detection during tracking

Wei Chen, Xifeng Guo, Xinwang Liu, En Zhu, more

2016 23rd International Conference on Pattern Recognition (ICPR) > 1821 - 1826

2016 23rd International Conference on Pattern Recognition (ICPR)

Correlation tracker has made a huge success in visual object tracking. However, it is mainly because that the tracker cannot catch the occurrence of appearance changes, tracking based on correlation filters often drifts due to the unexpected appearance changes caused by occlusion, deformation and background clutter. In this paper, we propose a new method to detect the case when the tracker encountered...

chapter

Beyond verbs: Understanding actions in videos with text

Shujon Naha, Yang Wang

2016 23rd International Conference on Pattern Recognition (ICPR) > 1833 - 1838

2016 23rd International Conference on Pattern Recognition (ICPR)

We consider the problem of joint modeling of videos and their corresponding textual descriptions (e.g. sentences or phrases). Our approach consists of three components: the video representation, the textual representation, and a joint model that links videos and text. Our video representation uses the state-of-the-art deep 3D ConvNet to capture the semantic information in the video. Our textual representation...

chapter

Efficient segmentation for Region-based Image Retrieval using Edge Integrated Minimum Spanning Tree

Yang Liu, Lei Huang, Siqi Wang, Xianglong Liu, more

2016 23rd International Conference on Pattern Recognition (ICPR) > 1929 - 1934

2016 23rd International Conference on Pattern Recognition (ICPR)

Region-based Image Retrieval (RBIR), which bases itself on image segmentation rather than global features or key-point-based local features, is a branch of Content-based Image Retrieval. This paper proposes a novel RBIR-oriented image segmentation algorithm named Edge Integrated Minimum Spanning Tree (EI-MST). The difference between EI-MST and the traditional MST-based methods is that EI-MST generates...

chapter

Weakly-supervised segmentation by combining CNN feature maps and object saliency maps

Wataru Shimoda, Keiji Yanai

2016 23rd International Conference on Pattern Recognition (ICPR) > 1935 - 1940

2016 23rd International Conference on Pattern Recognition (ICPR)

In general, CNN based semantic segmentation methods assume pixel-wise annotation is available, which is costly to obtain in general. On the other hand, image-level annotations is much easier to obtain than pixel-level annotation. Then, in this work, we focus on weakly-supervised semantic segmentation which is known as task of using training data with only image-level annotations. In this paper, we...

chapter

Building semantic understanding beyond deep learning from sound and vision

Fillipe D M de Souza, Sudeep Sarkar, Guillermo Camara-Chavez

2016 23rd International Conference on Pattern Recognition (ICPR) > 2097 - 2102

2016 23rd International Conference on Pattern Recognition (ICPR)

Deep learning-based models have recently been widely successful at outperforming traditional approaches in several computer vision applications such as image classification, object recognition and action recognition. However, those models are not naturally designed to learn structural information that can be important to tasks such as human pose estimation and structured semantic interpretation of...

chapter

Semantic role-based representations in text classification

Roberta A. Sinoara, Rafael G. Rossi, Solange O. Rezende

2016 23rd International Conference on Pattern Recognition (ICPR) > 2313 - 2318

2016 23rd International Conference on Pattern Recognition (ICPR)

Although good results for automatic text classification can be achieved with the use of bag-of-words representation, this model is not suitable for all classification problems and richer text representations can be required. In this paper, we proposed two text representation models based on semantic role labels and analyzed them in text classification scenarios. We also evaluated the combination of...

chapter

Energy minimization of discrete functions with higher-order potentials for depth map generation

Dimitri Bulatov, Benedikt Kottler, Franz Rottensteiner

2016 23rd International Conference on Pattern Recognition (ICPR) > 2344 - 2349

2016 23rd International Conference on Pattern Recognition (ICPR)

Minimization of discrete energy functions considering higher-order potentials is a challenging yet an important problem. In this work, a three-step procedure will be presented and exemplified on a general problem related to the dense depth map computation from multi-view configurations: Achieving a joint reconstruction of structure and semantics with piecewise planarity constraints. The three steps...

chapter

Partial membership latent Dirichlet allocation for image segmentation

Chao Chen, Alina Zare, J. Tory Cobb

2016 23rd International Conference on Pattern Recognition (ICPR) > 2368 - 2373

2016 23rd International Conference on Pattern Recognition (ICPR)

Topic models (e.g., pLSA, LDA, SLDA) have been widely used for segmenting imagery. These models are confined to crisp segmentation. Yet, there are many images in which some regions cannot be assigned a crisp label (e.g., transition regions between a foggy sky and the ground or between sand and water at a beach). In these cases, a visual word is best represented with partial memberships across multiple...

chapter

Bag of Embedded Words learning for text retrieval

Nikolaos Passalis, Anastasios Tefas

2016 23rd International Conference on Pattern Recognition (ICPR) > 2416 - 2421

2016 23rd International Conference on Pattern Recognition (ICPR)

The word embedding models are capable of capturing the semantic content of the textual words. The process of extracting a set of word embedding vectors from a text document is similar to the feature extraction step of the Bag-of-Features pipeline, which is usually used in computer vision tasks. That gives rise to the Bag-of-Embedded Words (BoEW) model. In this paper a novel learning technique that...

INFONA - science communication portal

2016 23rd International Conference on Pattern Recognition (ICPR)

Moving object reconstruction in monocular video data using boundary generation

Unsupervised cyber bullying detection in social networks

Semantic segmentation priors for object discovery

Object proposals using CNN-based edge filtering

Video2vec: Learning semantic spatio-temporal embeddings for video representation

Mutually incoherent pose bases for Action recognition

Frame level annotations for tennis videos

Wireless capsule endoscopy video summarization: A learning approach based on Siamese neural network and support vector machine

Taxonomy augmented object recognition

Semantic-free attributes for image classification

Multi-label classification with meta-label-specific features

Appearance changes detection during tracking

Beyond verbs: Understanding actions in videos with text

Efficient segmentation for Region-based Image Retrieval using Edge Integrated Minimum Spanning Tree

Weakly-supervised segmentation by combining CNN feature maps and object saliency maps

Building semantic understanding beyond deep learning from sound and vision

Semantic role-based representations in text classification

Energy minimization of discrete functions with higher-order potentials for depth map generation

Partial membership latent Dirichlet allocation for image segmentation

Bag of Embedded Words learning for text retrieval

Filter options

Publication date

Keywords

INFONA - science communication portal

2016 23rd International Conference on Pattern Recognition (ICPR) $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

2016 23rd International Conference on Pattern Recognition (ICPR)