Search results

chapter

Creation of a deep convolutional auto-encoder in Caffe

Volodymyr Turchenko, Artur Luczak

2017 9th IEEE International Conference on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications (IDAACS) > 2 > 651 - 659

2017 9th IEEE International Conference on Intelligent Data Acquisition and Advanced Computing Systems: Technology and Applications (IDAACS)

The development of a deep (stacked) convolutional auto-encoder in the Caffe deep learning framework is presented in this paper. We describe simple principles which we used to create this model in Caffe. The proposed model of convolutional auto-encoder does not have pooling/unpooling layers yet. The results of our experimental research show comparable accuracy of dimensionality reduction in comparison...

chapter

Robust visual tracking based on kernelized correlation filters

Min Jiang, Jianyu Shen, Jun Kong, Benxuan Wang

2017 IEEE International Conference on Information and Automation (ICIA) > 110 - 115

2017 IEEE International Conference on Information and Automation (ICIA)

Recently, kernelized correlation Filter-based trackers have aroused the interest of many researchers and achieved good results in the field of tracking. However, the current tracking model based on kernelized correlation filters can not deal with the changes of the target appearance and scale effectively. Therefore, in this paper, we intend to solve these two problems and improve the robustness of...

chapter

Generative Hierarchical Learning of Sparse FRAME Models

Jianwen Xie, Yifei Xu, Erik Nijkamp, Ying Nian Wu, more

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 1933 - 1941

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

This paper proposes a method for generative learning of hierarchical random field models. The resulting model, which we call the hierarchical sparse FRAME (Filters, Random field, And Maximum Entropy) model, is a generalization of the original sparse FRAME model by decomposing it into multiple parts that are allowed to shift their locations, scales and rotations, so that the resulting model becomes...

chapter

On Human Motion Prediction Using Recurrent Neural Networks

Julieta Martinez, Michael J. Black, Javier Romero

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) > 4674 - 4683

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)

Human motion modelling is a classical problem at the intersection of graphics and computer vision, with applications spanning human-computer interaction, motion synthesis, and motion prediction for virtual and augmented reality. Following the success of deep learning methods in several computer vision tasks, recent work has focused on using deep recurrent neural networks (RNNs) to model human motion,...

chapter

Show, attend and interact: Perceivable human-robot social interaction through neural attention Q-network

Ahmed Hussain Qureshi, Yutaka Nakamura, Yuichiro Yoshikawa, Hiroshi Ishiguro

2017 IEEE International Conference on Robotics and Automation (ICRA) > 1639 - 1645

2017 IEEE International Conference on Robotics and Automation (ICRA)

For a safe, natural and effective human-robot social interaction, it is essential to develop a system that allows a robot to demonstrate the perceivable responsive behaviors to complex human behaviors. We introduce the Multimodal Deep Attention Recurrent Q-Network using which the robot exhibits human-like social interaction skills after 14 days of interacting with people in an uncontrolled real world...

chapter

Semantic-free attributes for image classification

Quentin Oliveau, Hichem Sahbi

2016 23rd International Conference on Pattern Recognition (ICPR) > 1577 - 1582

2016 23rd International Conference on Pattern Recognition (ICPR)

Attributes are defined as mid-level image characteristics shared among different categories. These characteristics are suitable in order to handle classification problems especially when training data are scarce. In this paper, we design discriminative real-valued attributes by learning nonlinear inductive maps. Our method is based on solving a constrained optimization problem that mixes three criteria;...

chapter

Study of Dynamic Scheduling Strategy for Large-Scale Terrain Visualization in Flight Simulation System

Gao Qiang, Ji Ming, Pang Lan, Wang Jing, more

2016 9th International Symposium on Computational Intelligence and Design (ISCID) > 2 > 361 - 365

2016 9th International Symposium on Computational Intelligence and Design (ISCID)

Analyze the characteristic of three-dimensional scene visualization and regulation of scene change in flight simulation system, which is the basis of implementation of terrain reconstruction. Give the related mathematics model of terrain visualization. In order to resolve the problem that three-dimensional scene could not be reconstructed fast because of large-scale terrain data, present a procedure...

chapter

Multiple scaling factors based Semi-Blind watermarking of grayscale images using OS-ELM neural network

Ankit Rajpal, Anurag Mishra, Rajni Bala

2016 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC) > 1 - 6

2016 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC)

In this paper, a multiple scaling factor based Semi-Blind watermarking scheme for grayscale image watermarking using Online Sequential Extreme Learning Machine (OS-ELM) is proposed. Four-level DWT is applied on three standard test images of size 512 × 512. LL4 sub-band coefficients are chosen for watermark embedding. OS-ELM is initially tuned with a fixed number of training data used in its initial...

chapter

SAM: A rethinking of prominent convolutional neural network architectures for visual object recognition

Zhenyang Wang, Zhidong Deng, Shiyao Wang

2016 International Joint Conference on Neural Networks (IJCNN) > 1008 - 1014

2016 International Joint Conference on Neural Networks (IJCNN)

Convolutional neural networks play an increasingly important role in computer vision tasks, especially in the field of visual object recognition. Many prominent models, such as Inception, Maxout, ResNet, and NIN, have been proposed to significantly improve recognition performance. Inspired from those models, we propose a novel module called self-adaptive module (SAM). SAM consists of four passes and...

chapter

Learning partial differential equations for saliency detection

Zhenyu Zhao, Chenping Hou, Yi Wu, Yuanyuan Jiao

2016 IEEE International Conference on Big Data Analysis (ICBDA) > 1 - 5

2016 IEEE International Conference on Big Data Analysis (ICBDA)

Learning-based partial differential equations (PDEs), which combine fundamental differential invariants into a nonlinear regressor, have been successfully applied to several computer vision and image processing problems. However, it cannot apply to saliency detection directly. In this paper, we present a novel learning-based PDEs model and learn the PDEs from training samples. We simplify the current...

chapter

A texture retrieval scheme based on perceptual features

Yanhai Gan, Xiaoxu Cai, Jun Liu, Shengke Wang

2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA) > 897 - 900

2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA)

Procedural textures have been widely used as they can be easily generated from various mathematical models. However, the model parameters are not perceptually meaningful or uniform for non-expert users; therefore it is difficult for general users to obtain a desired texture by tuning the parameters. In order to satisfy users' requirement, we propose a novel procedural texture retrieval scheme that...

chapter

Natural language image descriptor

Anurag Kishore, Sanjay Singh

2015 IEEE Recent Advances in Intelligent Computational Systems (RAICS) > 110 - 115

2015 IEEE Recent Advances in Intelligent Computational Systems (RAICS)

Generating descriptions for visual data (images and video) automatically has been a complicated task in the field of Computer Vision and Artificial Intelligence. This paper discusses the working of and improvements on an algorithm called Neural Image Captioner (NIC) by Oriol Vinyals and his team, which uses a deep convolutional and recurrent architecture to generate natural language sentences to describe...

chapter

Choosing Basic-Level Concept Names Using Visual and Language Context

Alexander Mathews, Lexing Xie, Xuming He

2015 IEEE Winter Conference on Applications of Computer Vision > 595 - 602

2015 IEEE Winter Conference on Applications of Computer Vision (WACV)

We study basic-level categories for describing visual concepts, and empirically observe context-dependant basic level names across thousands of concepts. We propose methods for predicting basic-level names using a series of classification and ranking tasks, producing the first large scale catalogue of basic-level names for hundreds of thousands of images depicting thousands of visual concepts. We...

chapter

Multi-image morphing: Summarizing visual information from similar ancient coin image regions

Stefan Hodlmoser, Sebastian Zambanini, Martin Kampel

2014 International Conference on Virtual Systems & Multimedia (VSMM) > 161 - 168

2014 International Conference on Virtual Systems & Multimedia (VSMM)

The process of synthetically producing an image illustrating merged parts of multiple source images is usually known as image morphing. In this work a system is presented which morphs more than two source images to one output image. Its focus lies on using ancient coin images belonging to a common coin type. Nowadays, these coins can be worn or damaged. The goal of the presented morphing framework...

chapter

LDA Analyzer: A Tool for Exploring Topic Models

Chunyao Zou, Daqing Hou

2014 IEEE International Conference on Software Maintenance and Evolution > 593 - 596

2014 IEEE International Conference on Software Maintenance and Evolution (ICSME)

Online technical forums are valuable sources for mining useful software engineering information. LDA (Latent Dirichlet Allocation) is an unsupervised machine learning method which can be used for extracting underlying topics out of such large forums. However, the main output of LDA forum learning are usually huge matrices that contain millions of numbers, which is impossible for researchers to directly...

chapter

Improving Automatic Image Annotation with Google Semantic Link

Haijiao Xu, Peng Pan, Yansheng Lu, Chunyan Xu, more

2014 10th International Conference on Semantics, Knowledge and Grids > 177 - 184

2014 Tenth International Conference on Semantics, Knowledge and Grids (SKG)

During the past few years, there has been a massive explosion of multimedia content such as un-annotated images on the web. Automatic image annotation is an important task for multimedia retrieval. By automatically allocating semantic concepts to un-annotated images, image retrieval can be performed over annotation concepts. In this work, we address the problem of automatic image annotation, namely...

chapter

Development of a water ski simulator for indoor training with proprioceptive and visual feedback

Roberto Oboe, Riccardo Antonello, Francesco Biral

2014 IEEE 13th International Workshop on Advanced Motion Control (AMC) > 428 - 433

2014 IEEE 13th International Workshop on Advanced Motion Control (AMC)

This paper reports the preliminary development of a water-ski simulator for indoor training. Compared to existing training systems, the proposed simulator is capable of recreating a more realistic and immersive simulation experience, by providing both a proprioceptive and visual feedback to the practicing skier. In addition, it allows to practically test any desired skiing manoeuvre, since the ski...

chapter

Biological Image Temporal Stage Classification via Multi-layer Model Collaboration

Tao Meng, Mei-Ling Shyu

2013 IEEE International Symposium on Multimedia > 30 - 37

2013 IEEE International Symposium on Multimedia (ISM)

In current biological image analysis, the temporal stage information, such as the developmental stage in the Drosophila development in situ hybridization images, is important for biological knowledge discovery. Such information is usually gained through visual inspection by experts. However, as the high-throughput imaging technology becomes increasingly popular, the demand for labor effort on annotating,...

chapter

Enhancing object recognition for humanoid robots through time-awareness

Andreas Holzbach, Gordon Cheng

2013 13th IEEE-RAS International Conference on Humanoid Robots (Humanoids) > 246 - 251

2013 13th IEEE-RAS International Conference on Humanoid Robots (Humanoids 2013)

In this paper, we present a biologically-inspired object recognition system for humanoid robots. Our approach is based on a hierarchical model of the visual cortex for feature extraction and rapid scene categorization of natural images. We enhanced the model to be entropy-aware and real-time capable, to be able to realize object recognition over time. We integrate time in our system to model uncertainty...

chapter

Dynamic scene models for incremental, long-term, appearance-based localisation

Edward Johns, Guang-Zhong Yang

2013 IEEE International Conference on Robotics and Automation > 2731 - 2736

2013 IEEE International Conference on Robotics and Automation (ICRA)

In this paper we present a new appearance-based localisation system that is able to deal with dynamic elements in the scene. By independently modelling the properties of local features observed in a scene over long periods of time, we show that feature appearances and geometric relationships can be learned more accurately than when representing a location by a single image. We also present a new dataset...

INFONA - science communication portal

Search results

Creation of a deep convolutional auto-encoder in Caffe

Robust visual tracking based on kernelized correlation filters

Generative Hierarchical Learning of Sparse FRAME Models

On Human Motion Prediction Using Recurrent Neural Networks

Show, attend and interact: Perceivable human-robot social interaction through neural attention Q-network

Semantic-free attributes for image classification

Study of Dynamic Scheduling Strategy for Large-Scale Terrain Visualization in Flight Simulation System

Multiple scaling factors based Semi-Blind watermarking of grayscale images using OS-ELM neural network

SAM: A rethinking of prominent convolutional neural network architectures for visual object recognition

Learning partial differential equations for saliency detection

A texture retrieval scheme based on perceptual features

Natural language image descriptor

Choosing Basic-Level Concept Names Using Visual and Language Context

Multi-image morphing: Summarizing visual information from similar ancient coin image regions

LDA Analyzer: A Tool for Exploring Topic Models

Improving Automatic Image Annotation with Google Semantic Link

Development of a water ski simulator for indoor training with proprioceptive and visual feedback

Biological Image Temporal Stage Classification via Multi-layer Model Collaboration

Enhancing object recognition for humanoid robots through time-awareness

Dynamic scene models for incremental, long-term, appearance-based localisation

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options