Search results

chapter

Multi-UAV collaborative monocular SLAM

Patrik Schmuck

2017 IEEE International Conference on Robotics and Automation (ICRA) > 3863 - 3870

2017 IEEE International Conference on Robotics and Automation (ICRA)

With systems performing Simultaneous Localization And Mapping (SLAM) from a single robot reaching considerable maturity, the possibility of employing a team of robots to collaboratively perform a task has been attracting increasing interest. Promising great impact in a plethora of tasks ranging from industrial inspection to digitization of archaeological structures, collaborative scene perception...

chapter

Fusing attention with visual question answering

Ryan Burt, Mihael Cudic, Jose C. Principe

2017 International Joint Conference on Neural Networks (IJCNN) > 949 - 953

2017 International Joint Conference on Neural Networks (IJCNN)

Visual Question Answering is a complex problem that fuses natural language and image processing to answer a question based on information from the image. The basic architecture for accomplishing this is using a CNN to extract features from the image and an RNN for the language processing, then combine the two in an MLP to produce an answer. These architectures perform well at identifying content,...

chapter

Image aesthetics assessment using Deep Chatterjee's machine

Zhangyang Wang, Ding Liu, Shiyu Chang, Florin Dolcos, more

2017 International Joint Conference on Neural Networks (IJCNN) > 941 - 948

2017 International Joint Conference on Neural Networks (IJCNN)

Image aesthetics assessment has been challenging due to its subjective nature. Inspired by the Chatterjee's visual neuroscience model, we design Deep Chatterjee's Machine (DCM) tailored for this task. DCM first learns attributes through the parallel supervised pathways, on a variety of selected feature dimensions. A high-level synthesis network is trained to associate and transform those attributes...

chapter

Mitigating fooling with competitive overcomplete output layer neural networks

Navid Kardan, Kenneth O. Stanley

2017 International Joint Conference on Neural Networks (IJCNN) > 518 - 525

2017 International Joint Conference on Neural Networks (IJCNN)

Although the introduction of deep learning has led to significant performance improvements in many machine learning applications, several recent studies have revealed that deep feedforward models are easily fooled. Fooling in effect results from overgeneralization of neural networks over regions far from the training data. To circumvent this problem this paper proposes a novel elaboration of standard...

chapter

An embedded FPGA architecture for efficient visual saliency based object recognition implementation

Hanen Chenini

2017 6th International Conference on Systems and Control (ICSC) > 187 - 192

2017 6th International Conference on Systems and Control (ICSC)

In this article, we propose a new optimized embedded architecture based soft-core processors oriented to visual attention based object recognition applications. Our recognition approach relies mainly on two specific modules for online processing of acquired images in real-time: a novel saliency based feature detector/descriptor module and then an object classifier module. To deal with such parallel/pipeline...

chapter

Ego-Motion Classification for Driving Vehicle

Li Du, Wenhui Jiang, Zhicheng Zhao, Fei Su

2017 IEEE Third International Conference on Multimedia Big Data (BigMM) > 276 - 279

2017 IEEE Third International Conference on Multimedia Big Data (BigMM)

Accurate prediction of vehicle ego-motion in real time is crucial for an autonomous driving system. In this paper, we formulate the problem of ego-motion classification as video event detection, and we propose an end-to-end deep model to address this problem. In this model, we utilize Convolutional Neural Networks (CNNs) to extract semantic visual feature of each video frame, and employ a Long Short...

chapter

Architecture of an Extensible Visual Programming Environment for Authoring Behaviour of Personal Service Robots

Chandan Datta, Bruce A. MacDonald

2017 First IEEE International Conference on Robotic Computing (IRC) > 156 - 159

2017 First IEEE International Conference on Robotic Computing (IRC)

Programming tasks on personal service robots in multi-disciplinary teams is challenging. The goal of this research is to enable roboticists and non-programmer domain experts to co-develop robot service scenarios in real world environments using a visual programming environment called RoboStudio. The first key contribution of this paper is presenting the implementation architecture of RoboStudio. This...

chapter

Immersive eating: evaluating the use of head-mounted displays for mixed reality meal sessions

Dannie Korsgaard, Niels Christian Nilsson, Thomas Bjørner

2017 IEEE 3rd Workshop on Everyday Virtual Reality (WEVR) > 1 - 4

2017 IEEE 3rd Workshop on Everyday Virtual Reality (WEVR)

This paper documents a pilot study evaluating a simple approach allowing users to eat real food while exploring a virtual environment (VE) through a head-mounted display (HMD). Two cameras mounted on the HMD allowed for video-based stereoscopic see-through when the user’s head orientation pointed toward the food, and the VE would appear when the user turned elsewhere. The pilot study revealed that...

chapter

Modelling and Managing Deployment Costs of Microservice-Based Cloud Applications

Philipp Leitner, Jurgen Cito, Emanuel Stockli

2016 IEEE/ACM 9th International Conference on Utility and Cloud Computing (UCC) > 165 - 174

2016 IEEE/ACM 9th International Conference on Utility and Cloud Computing (UCC)

We present an approach to model the deployment costs, including compute and IO costs, of Microservice-based applications deployed to a public cloud. Our model, which we dubbed CostHat, supports both, Microservices deployed on traditional IaaS or PaaS clouds, and services that make use of novel cloud programming paradigms, such as AWS Lambda. CostHat is based on a network model, and allows for what-if...

chapter

Ultra low-power visual odometry for nano-scale unmanned aerial vehicles

Daniele Palossi, Andrea Marongiu, Luca Benini

Design, Automation & Test in Europe Conference & Exhibition (DATE), 2017 > 1647 - 1650

2017 Design, Automation & Test in Europe Conference & Exhibition (DATE)

One of the fundamental functionalities for autonomous navigation of Unmanned Aerial Vehicles (UAVs) is the hovering capability. State-of-the-art techniques for implementing hovering on standard-size UAVs process camera stream to determine position and orientation (visual odometry). Similar techniques are considered unaffordable in the context of nano-scale UAVs (i.e. few centimeters of diameter),...

chapter

Enriched Deep Recurrent Visual Attention Model for Multiple Object Recognition

Artsiom Ablavatski, Shijian Lu, Jianfei Cai

2017 IEEE Winter Conference on Applications of Computer Vision (WACV) > 971 - 978

2017 IEEE Winter Conference on Applications of Computer Vision (WACV)

We design an Enriched Deep Recurrent Visual Attention Model (EDRAM) — an improved attention-based architecture for multiple object recognition. The proposed model is a fully differentiable unit that can be optimized end-to-end by using Stochastic Gradient Descent (SGD). The Spatial Transformer (ST) was employed as visual attention mechanism which allows to learn the geometric transformation of objects...

chapter

Flowdometry: An Optical Flow and Deep Learning Based Approach to Visual Odometry

Peter Muller, Andreas Savakis

2017 IEEE Winter Conference on Applications of Computer Vision (WACV) > 624 - 631

2017 IEEE Winter Conference on Applications of Computer Vision (WACV)

Visual odometry is a challenging task related to simultaneous localization and mapping that aims to generate a map traveled from a visual data stream. Based on one or two cameras, motion is estimated from features and pixel differences between frames. Because of the frame rate of the cameras, there are generally small, incremental changes between subsequent frames where optical flow can be assumed...

chapter

Detecting falling people by autonomous service robots: A ROS module integration approach

Sergio Hernandez-Mendez, Carolina Maldonado-Mendez, Antonio Marin-Hernandez, Homero Vladimir Rios-Figueroa

2017 International Conference on Electronics, Communications and Computers (CONIELECOMP) > 1 - 7

2017 International Conference on Electronics, Communications and Computers (CONIELECOMP)

In this paper is presented the integration of diverse modules for people fallen detection by a mobile service robot. This integration has been achieved in the middleware ROS (Robotics Operation System). The proposed implementation are arranged over an modular architecture of three layers: Hardware, Processing and Decision. The modules implemented are on the processing layer. The first module uses...

chapter

Architecture for Hybrid Language Systems

Mirai Watanabe, Yutaka Watanobe, Alexander Vazhenin

2016 IEEE International Conference on Computer and Information Technology (CIT) > 134 - 139

2016 IEEE International Conference on Computer and Information Technology (CIT)

An architecture for hybrid language systems is presented. A hybrid language has features of both textual languages and visual languages. Textual languages are computer-oriented and are geared toward storage, syntax analysis, and editing. On the other hand, visual languages are human-oriented and are geared toward expressive power, understandability, direct manipulation, and learning cost. Although...

chapter

Analyzing features learned for Offline Signature Verification using Deep CNNs

Luiz G. Hafemann, Robert Sabourin, Luiz S. Oliveira

2016 23rd International Conference on Pattern Recognition (ICPR) > 2989 - 2994

2016 23rd International Conference on Pattern Recognition (ICPR)

Research on Offline Handwritten Signature Verification explored a large variety of handcrafted feature extractors, ranging from graphology, texture descriptors to interest points. In spite of advancements in the last decades, performance of such systems is still far from optimal when we test the systems against skilled forgeries - signature forgeries that target a particular individual. In previous...

chapter

Comparative study between deep learning and bag of visual words for wild-animal recognition

Emmanuel Okafor, Pornntiwa Pawara, Faik Karaaba, Olarik Surinta, more

2016 IEEE Symposium Series on Computational Intelligence (SSCI) > 1 - 8

2016 IEEE Symposium Series on Computational Intelligence (SSCI)

Most research in image classification has focused on applications such as face, object, scene and character recognition. This paper examines a comparative study between deep convolutional neural networks (CNNs) and bag of visual words (BOW) variants for recognizing animals. We developed two variants of the bag of visual words (BOW and HOG-BOW) and examine the use of gray and color information as well...

chapter

Image clustering based on deep sparse representations

Le Lv, Dongbin Zhao, Qingqiong Deng

2016 IEEE Symposium Series on Computational Intelligence (SSCI) > 1 - 6

2016 IEEE Symposium Series on Computational Intelligence (SSCI)

Currently, the supervised trained deep neural networks (DNNs) have been successfully applied in several image classification tasks. However, how to extract powerful data representations and discover semantic concepts from unlabeled data is a more practical issue. Unsupervised feature learning methods aim at extracting abstract representations from unlabeled data. Large amount of research works illustrate...

chapter

Spatial-crowd: A big data framework for efficient data visualization

Shahbaz Atta, Bilal Sadiq, Akhlaq Ahmad, Sheikh Nasir Saeed, more

2016 IEEE International Conference on Big Data (Big Data) > 2130 - 2138

2016 IEEE International Conference on Big Data (Big Data)

Analyzing and visualizing large datasets generated by real-time spatio-temporal activities (e.g. vehicle mobility or large crowd movement) are a very challenging task. Recursive delays both at middleware and front end applications limit the of usefulness of the real-time analysis. In this paper, we present a framework “Spatial-Crowd” that first handles spatial-temporal data acquisition and processing...

chapter

Multimodal architecture for emotion in robots using deep learning

Arvind K Bansal, Mehdi Ghayoumi

2016 Future Technologies Conference (FTC) > 901 - 907

2016 Future Technologies Conference (FTC)

These days, some robots have emotional state (expression and recognition) to make Human-Robot Interaction (HRI) and Robot-Robot Interaction (RRI) better. In this article we analyze what it means for a robot to have emotion and distinguishing emotional state for communication from an emotional state as a mechanism for the organization of its behavior with humans and robots by convolutional neural network...

chapter

Types of computational self-awareness and how we might implement them

Peter R. Lewis

2016 International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS) > 1 - 2

2016 International Conference on Hardware/Software Codesign and System Synthesis (CODES+ISSS)

Computing systems increasingly comprise large numbers of heterogeneous subsystems, each with their own local perspective and goals, connected in dynamic networks, and interacting with each other and humans in ways which are difficult to predict. Nevertheless, users engaging with different parts of the system still expect high performance, reliability, security and other qualities, provided in a way...

INFONA - science communication portal

Search results

Multi-UAV collaborative monocular SLAM

Fusing attention with visual question answering

Image aesthetics assessment using Deep Chatterjee's machine

Mitigating fooling with competitive overcomplete output layer neural networks

An embedded FPGA architecture for efficient visual saliency based object recognition implementation

Ego-Motion Classification for Driving Vehicle

Architecture of an Extensible Visual Programming Environment for Authoring Behaviour of Personal Service Robots

Immersive eating: evaluating the use of head-mounted displays for mixed reality meal sessions

Modelling and Managing Deployment Costs of Microservice-Based Cloud Applications

Ultra low-power visual odometry for nano-scale unmanned aerial vehicles

Enriched Deep Recurrent Visual Attention Model for Multiple Object Recognition

Flowdometry: An Optical Flow and Deep Learning Based Approach to Visual Odometry

Detecting falling people by autonomous service robots: A ROS module integration approach

Architecture for Hybrid Language Systems

Analyzing features learned for Offline Signature Verification using Deep CNNs

Comparative study between deep learning and bag of visual words for wild-animal recognition

Image clustering based on deep sparse representations

Spatial-crowd: A big data framework for efficient data visualization

Multimodal architecture for emotion in robots using deep learning

Types of computational self-awareness and how we might implement them

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options