Search results

chapter

PHOCNet: A Deep Convolutional Neural Network for Word Spotting in Handwritten Documents

Sebastian Sudholt, Gernot A. Fink

2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR) > 277 - 282

2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR)

In recent years, deep convolutional neural networks have achieved state of the art performance in various computer vision tasks such as classification, detection or segmentation. Due to their outstanding performance, CNNs are more and more used in the field of document image analysis as well. In this work, we present a CNN architecture that is trained with the recently proposed PHOC representation...

chapter

ICFHR2016 Handwritten Keyword Spotting Competition (H-KWS 2016)

Ioannis Pratikakis, Konstantinos Zagoris, Basilis Gatos, Joan Puigcerver, more

2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR) > 613 - 618

2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR)

The H-KWS 2016, organized in the context of the ICFHR 2016 conference aims at setting up an evaluation framework for benchmarking handwritten keyword spotting (KWS) examining both the Query by Example (QbE) and the Query by String (QbS) approaches. Both KWS approaches were hosted into two different tracks, which in turn were split into two distinct challenges, namely, a segmentation-based and a segmentation-free...

chapter

Visual Aesthetic Analysis for Handwritten Document Images

Anshuman Majumdar, Praveen Krishnan, C.V. Jawahar

2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR) > 423 - 428

2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR)

We present an approach for analyzing the visual aesthetic property of a handwritten document page which matches with human perception. We formulate the problem at two independent levels: (i) coarse level which deals with the overall layout, space usages between lines, words and margins, and (ii) fine level, which analyses the construction of each word and deals with the aesthetic properties of writing...

chapter

Bag of Genres for Video Retrieval

Leonardo A. Duarte, Otavio A. B. Penatti, Jurandy Almeida

2016 29th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI) > 257 - 264

2016 29th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI)

Often, videos are composed of multiple concepts or even genres. For instance, news videos may contain sports, action, nature, etc. Therefore, encoding the distribution of such concepts/genres in a compact and effective representation is a challenging task. In this sense, we propose the Bag of Genres representation, which is based on a visual dictionary defined by a genre classifier. Each visual word...

chapter

Semantic and Verbatim Word Spotting Using Deep Neural Networks

Tomas Wilkinson, Anders Brun

2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR) > 307 - 312

2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR)

In the last few years, deep convolutional neural networks have become ubiquitous in computer vision, achieving state-of-the-art results on problems like object detection, semantic segmentation, and image captioning. However, they have not yet been widely investigated in the document analysis community. In this paper, we present a word spotting system based on convolutional neural networks. We train...

chapter

Gameplay Genre Video Classification by Using Mid-Level Video Representation

Renato Augusto de Souza, Raquel Pereira de Almeida, Arghir-Nicolae Moldovan, Zenilton Kleber G. do Patrocinio, more

2016 29th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI) > 188 - 194

2016 29th SIBGRAPI Conference on Graphics, Patterns and Images (SIBGRAPI)

As video gameplay recording and streaming is becoming very popular on the Internet, there is an increasing need for automatic classification solutions to help service providers with indexing the huge amount of content and users with finding relevant content. The automatic classification of gameplay videos into specific genres is not a trivial task due to their high content diversity. This paper address...

chapter

Effect of adaptive thresholding on shot boundary detection performance

Soyoung Park, Jeongwoo Son, Sun-Joong Kim

2016 IEEE International Conference on Consumer Electronics-Asia (ICCE-Asia) > 1 - 2

2016 IEEE International Conference on Consumer Electronics - Asia (ICCE-Asia)

Effect of adaptive threshold on shot boundary detection performance is analyzed in this paper, where the threshold is used to determine whether a target frame is a shot boundary or not in a broadcasting video content. Adaptive threshold is calculated using input threshold and visual similarities of adjacent frames of a target frame. The experimental results show that application of adaptive threshold...

chapter

Underwater scene search scheme via similarity measure and sparse representation for autonomous underwater vehicle

Zhiyuan Wang, Yue Geng, Congcong Shi, Rui Nian, more

OCEANS 2016 MTS/IEEE Monterey > 1 - 5

OCEANS 2016 MTS/IEEE Monterey

Underwater scene search turns out to be one of the most challenging topics in the underwater image analysis. In this paper, we present one underwater scene search scheme combined with similarity measure and sparse representation. The color histogram is first adopted to classify the candidate image patches for each kind of the underwater scene. At the same time, the feature similarity (FSIM) considers...

chapter

A bag of relevant regions model for visual place recognition in coral reefs

Alejandro Maldonado-Ramirez, L. Abril Torres-Mendez

OCEANS 2016 MTS/IEEE Monterey > 1 - 5

OCEANS 2016 MTS/IEEE Monterey

Vision-based place recognition in underwater environments is a key component for autonomous robotic exploration. However, this task can be very challenging due to the inherent properties of this kind of places such as: color distortion, poor visibility, perceptual aliasing and dynamic illumination. In this paper, we present a method for vision-based place recognition in coral reefs. Our method relies...

chapter

Elements of inferential statistics in a quantitative assessment of illuminations of architectural structures

Wieslawa Malska, Henryk Wachta

2016 IEEE Lighting Conference of the Visegrad Countries (Lumen V4) > 1 - 6

2016 IEEE Lighting Conference of the Visegrad Countries (Lumen V4)

The topic presented in this paper covers statistical studies on illumination conducted using specialised software dedicated to such simulations. A pre-designed computer visualisation of illumination, which included zonal illuminations of a selected architectural structure was modified by a selected group of respondents. As a result of responders individual aesthetic preferences sets of average luminance...

chapter

Visual Improvement for Dense Foggy & Hazy Weather Images, Using Multimodal Enhancement Techniques

P. K. Chaturvedi, Ritu Vijay, R. D. Nirala

2016 International Conference on Micro-Electronics and Telecommunication Engineering (ICMETE) > 620 - 628

2016 International Conference on Micro-Electronics and Telecommunication Engineering (ICMETE)

Image enhancement processes consist of a collection of techniques that inquire about to improve the visual appearance of degraded image. This paper introduces a multimodal enhancement technique for dense foggy images. The present available techniques don't work in low visibility like dense fog. The proposed methods changes the intensity component among the converted HIS components from the RGB components...

chapter

Action Recognition Based on Local Fisher Discriminant Analysis and Mix Encoding

Lijun Li, Shuling Dai

2016 International Conference on Virtual Reality and Visualization (ICVRV) > 16 - 23

2016 International Conference on Virtual Reality and Visualization (ICVRV)

Action recognition has been one of the most popular fields of computer vision. This paper presents a novel approach to action recognition problem using the dimension reduction method, local fisher discriminant analysis, to reduce the dimension of feature descriptors as the preprocessing step after feature extraction. We propose to use sparse matrix and randomized kd-tree to modify and accelerate the...

chapter

Distributed object recognition in smart camera networks

Alireza Rahimpour, Ali Taalimi, Jiajia Luo, Hairong Qi

2016 IEEE International Conference on Image Processing (ICIP) > 669 - 673

2016 IEEE International Conference on Image Processing (ICIP)

Distributed object recognition is a significantly fast-growing research area, mainly motivated by the emergence of high performance cameras and their integration with modern wireless sensor network technologies. In wireless distributed object recognition, the bandwidth is limited and it is desirable to avoid transmitting redundant visual features from multiple cameras to the base station. In this...

chapter

Spatio-temporal saliency detection using abstracted fully-connected graphical models

A H. Karimi, M. J. Shafiee, C. Scharfenberger, I. BenDaya, more

2016 IEEE International Conference on Image Processing (ICIP) > 694 - 698

2016 IEEE International Conference on Image Processing (ICIP)

A novel approach to spatio-temporal saliency detection in video is proposed. Saliency computation is considered as an optimization problem that maximizes the energy of a fully-connected graphical model based on spatio-temporal feature distinctiveness. Each pixel in a video is modeled by a node, and the spatio-temporal feature distinctiveness between pixels by edges connecting the nodes in the graph...

chapter

Fast earth mover's distance computation for catadioptric image sequences

O. Tahri, M. Usman, C. Demonceaux, D. Fofi, more

2016 IEEE International Conference on Image Processing (ICIP) > 2485 - 2489

2016 IEEE International Conference on Image Processing (ICIP)

Earth mover's distance is one of the most effective metric for comparing histograms in various image retrieval applications. The main drawback is its computational complexity which hinders its usage in various comparison tasks. We propose fast earth mover's distance computation by providing better initialization to the transportation simplex algorithm. The new approach enables faster EMD computation...

chapter

A shape feature based bovw method for image classification using N-gram and spatial pyramid coding scheme

Elham Etemad, Gang Hu, Qigang Gao

2016 IEEE International Conference on Image Processing (ICIP) > 504 - 508

2016 IEEE International Conference on Image Processing (ICIP)

Image classification is a general visual analysis task based on the image content coded by its representation. In this research, we proposed an image representation method that is based on the perceptual shape features and their spatial distributions. A natural language processing concept, N-gram, is adopted to generate a set of perceptual shape visual words for encoding image features. By combining...

chapter

Visual attention inspired distant view and close-up view classification

Song Tong, Yuen Peng Loh, Xuefeng Liang, Takatsune Kumada

2016 IEEE International Conference on Image Processing (ICIP) > 2787 - 2791

2016 IEEE International Conference on Image Processing (ICIP)

The images of distant view and close-up view indicate a photographers' attention which can be further utilized for user behavior analysis and scene evaluation. As images may compose arbitrary contexts, distant view and close-up view classification becomes non-trivial. In this work, we found two cues can represent human visual attention, i.e. focus cue and scale cue. We model the focus cue in frequency...

chapter

Introducing temporal order of dominant visual word sub-sequences for human action recognition

N. Kardaris, V. Pitsikalis, E. Mavroudi, P. Maragos

2016 IEEE International Conference on Image Processing (ICIP) > 3061 - 3065

2016 IEEE International Conference on Image Processing (ICIP)

We present a novel video representation for human action recognition by considering temporal sequences of visual words. Based on state-of-the-art dense trajectories, we introduce temporal bundles of dominant, that is most frequent, visual words. These are employed to construct a complementary action representation of ordered dominant visual word sequences, that additionally incorporates fine grained...

chapter

Smile detection in the wild with hierarchical visual feature

Jiahuiran Li, Junkai Chen, Zheru Chi

2016 IEEE International Conference on Image Processing (ICIP) > 639 - 643

2016 IEEE International Conference on Image Processing (ICIP)

Smile detection in the wild is an interesting and challenging problem. This paper presents an efficient approach with hierarchical visual feature to handle this problem. In our approach, Gabor filters with multi-scale, multi-orientation are first applied to extract facial textures namely Gabor faces from the input face image. After this, Histograms of Oriented Gradients (HOG) are employed to encode...

chapter

Content-based image retrieval based on cauchy density function histogram

Guang-Hai Liu

2016 12th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD) > 506 - 510

2016 12th International Conference on Natural Computation and 13th Fuzzy Systems and Knowledge Discovery (ICNC-FSKD)

We have proposed a novel representation to describe color, intensity, edge orientation, frequency and spatial layout as histogram-based features via simulating human's visual mechanism. In the representation, Color volume is used as a low-level feature to detect salient regions. At the same time, a novel representation method of visual feature, namely Cauchy density function histogram, is used to...

INFONA - science communication portal

Search results

PHOCNet: A Deep Convolutional Neural Network for Word Spotting in Handwritten Documents

ICFHR2016 Handwritten Keyword Spotting Competition (H-KWS 2016)

Visual Aesthetic Analysis for Handwritten Document Images

Bag of Genres for Video Retrieval

Semantic and Verbatim Word Spotting Using Deep Neural Networks

Gameplay Genre Video Classification by Using Mid-Level Video Representation

Effect of adaptive thresholding on shot boundary detection performance

Underwater scene search scheme via similarity measure and sparse representation for autonomous underwater vehicle

A bag of relevant regions model for visual place recognition in coral reefs

Elements of inferential statistics in a quantitative assessment of illuminations of architectural structures

Visual Improvement for Dense Foggy & Hazy Weather Images, Using Multimodal Enhancement Techniques

Action Recognition Based on Local Fisher Discriminant Analysis and Mix Encoding

Distributed object recognition in smart camera networks

Spatio-temporal saliency detection using abstracted fully-connected graphical models

Fast earth mover's distance computation for catadioptric image sequences

A shape feature based bovw method for image classification using N-gram and spatial pyramid coding scheme

Visual attention inspired distant view and close-up view classification

Introducing temporal order of dominant visual word sub-sequences for human action recognition

Smile detection in the wild with hierarchical visual feature

Content-based image retrieval based on cauchy density function histogram

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options