Search results

Items from 1 to 20 out of 57 results

chapter

Learning to walk with prior knowledge

Martin Gottwald, Dominik Meyer, Hao Shen, Klaus Diepold

2017 IEEE International Conference on Advanced Intelligent Mechatronics (AIM) > 1369 - 1374

2017 IEEE International Conference on Advanced Intelligent Mechatronics (AIM)

In this work a novel approach to Transfer Learning for the use in Deep Reinforcement Learning is introduced. The agent is realized as an actor-critic framework, namely the Deep Deterministic Policy Gradient algorithm. The Q-function and the policy are represented as deep feed-forward networks, that are trained by minimizing the mean squared Bellman error and by maximizing the expected reward, respectively...

chapter

End-to-End Driving in a Realistic Racing Game with Deep Reinforcement Learning

Etienne Perot, Maximilian Jaritz, Marin Toromanoff, Raoul de Charette

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) > 474 - 475

2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

We address the problem of autonomous race car driving. Using a recent rally game (WRC6) with realistic physics and graphics we train an Asynchronous Actor Critic (A3C) in an end-to-end fashion and propose an improved reward function to learn faster. The network is trained simultaneously on three very different tracks (snow, mountain, and coast) with various road structures, graphics and physics. Despite...

chapter

Learning how to drive in a real world simulation with deep Q-Networks

Peter Wolf, Christian Hubschneider, Michael Weber, Andre Bauer, more

2017 IEEE Intelligent Vehicles Symposium (IV) > 244 - 250

2017 IEEE Intelligent Vehicles Symposium (IV)

We present a reinforcement learning approach using Deep Q-Networks to steer a vehicle in a 3D physics simulation. Relying solely on camera image input the approach directly learns steering the vehicle in an end-to-end manner. The system is able to learn human driving behavior without the need of any labeled training data. An action-based reward function is proposed, which is motivated by a potential...

chapter

Training neural networks with policy gradient

Sourabh Bose, Manfred Huber

2017 International Joint Conference on Neural Networks (IJCNN) > 3998 - 4005

2017 International Joint Conference on Neural Networks (IJCNN)

Neural networks are a powerful function approximation tool which has the ability to model any function with arbitrary precision. For any function as a black box, it is able to reconstruct the function given the target and the input data. However, there are problems where the target is at least partially unknown. In such cases it is impossible for a traditional neural network to compute the gradient...

chapter

Heterogeneous team deep q-learning in low-dimensional multi-agent environments

Mateusz Kurek, Wojciech Jaskowski

2016 IEEE Conference on Computational Intelligence and Games (CIG) > 1 - 8

2016 IEEE Conference on Computational Intelligence and Games (CIG)

Deep Q-Learning is an effective reinforcement learning method, which has recently obtained human-level performance for a set of Atari 2600 games. Remarkably, the system was trained on the high-dimensional raw visual data. Is Deep Q-Learning equally valid for problems involving a low-dimensional state space? To answer this question, we evaluate the components of Deep Q-Learning (deep architecture,...

chapter

How to not get frustrated with neural networks

B M Wilamowski

2011 IEEE International Conference on Industrial Technology > 5 - 11

2011 IEEE International Conference on Industrial Technology (ICIT 2011)

In the presentation major difficulties of designing neural networks are shown. It turn out that popular MLP (Multi Layer Perceptron) networks in most cases produces far from satisfactory results. Also, popular EBP (Error Back Propagation) algorithm is very slow and often is not capable to train best neural network architectures. Very powerful and fast LM (Levenberg- Marquardt) algorithm was unfortunately...

chapter

Fast cell detection in high-throughput imagery using GPU-accelerated machine learning

D Mayerich, Jaerock Kwon, A Panchal, J Keyser, more

2011 IEEE International Symposium on Biomedical Imaging: From Nano to Macro > 719 - 723

2011 8th IEEE International Symposium on Biomedical Imaging: From Nano to Macro (ISBI 2011)

High-throughput microscopy allows fast imaging of large tissue samples, producing an unprecedented amount of sub-cellular information. The size and complexity of these data sets often out-scale current reconstruction algorithms. Overcoming this computational bottleneck requires extensive parallel processing and scalable algorithms. As high-throughput imaging techniques move into main stream research,...

chapter

Support Vector Machines on GPU with Sparse Matrix Format

Tsung-Kai Lin, Shao-Yi Chien

2010 Ninth International Conference on Machine Learning and Applications > 313 - 318

2010 Ninth International Conference on Machine Learning and Applications (ICMLA 2010)

Emerging general-purpose Graphics Processing Unit (GPU) provides a multi-core platform for wide applications, including machine learning algorithms. In this paper, we proposed several techniques to accelerate Support Vector Machines (SVM) on GPUs. Sparse matrix format is introduced into parallel SVM to achieve better performance. Experimental results show that the speedup of 55x-133.8x over LIBSVM...

chapter

A novel FPGA-based SVM classifier

M Papadonikolakis, C Bouganis

2010 International Conference on Field-Programmable Technology > 283 - 286

2010 International Conference on Field-Programmable Technology (FPT 2010)

Support Vector Machines (SVMs) are a powerful supervised learning tool, providing state-of-the-art accuracy at a cost of high computational complexity. The SVM classification suffers from linear dependencies on the number of the Support Vectors and the problem's dimensionality. In this work, we propose a scalable FPGA architecture for the acceleration of SVM classification, which exploits the device...

chapter

Intelligent Task Mapping Using Machine Learning

D Tetzlaff, S Glesner

2010 International Conference on Computational Intelligence and Software Engineering > 1 - 4

2010 International Conference on Computational Intelligence and Software Engineering (CiSE 2010)

Task scheduling and task allocation, which are vital parts of mapping parallel programs to concurrent architectures, must take into account the interprocessor communication, whose overheads have emerged as the major performance limitation in parallel applications. Furthermore, its power consumption is an important research focus which must be addressed. Finding an optimal solution requires information...

chapter

Deep Spatiotemporal Feature Learning with Application to Image Classification

Thomas P Karnowski, Itamar Arel, Derek Rose

2010 Ninth International Conference on Machine Learning and Applications > 883 - 888

2010 Ninth International Conference on Machine Learning and Applications (ICMLA 2010)

Deep machine learning is an emerging framework for dealing with complex high-dimensionality data in a hierarchical fashion which draws some inspiration from biological sources. Despite the notable progress made in the field, there remains a need for an architecture that can represent temporal information with the same ease that spatial information is discovered. In this work, we present new results...

chapter

Evolving Arbitrarily Connected Feedforward Neural Networks via Genetic Algorithms

W J Puma-Villanueva, F J V Zuben

2010 Eleventh Brazilian Symposium on Neural Networks > 127 - 132

2010 Eleventh Brazilian Symposium on Neural Networks (SBRN 2010)

Though several approaches have already been proposed in the literature to evolve neural network topologies for solving a wide range of machine learning tasks, this paper presents an alternative one, capable of evolving arbitrarily connected feed forward neural networks (ACFNNs), including linear and nonlinear neurons. A genetic algorithm is conceived to adjust the topology and also to perform variable...

chapter

Fuzzy Neural Network for Malware Detect

Yichi Zhang, Jianmin Pang, Feng Yue, Jinxian Cui

2010 International Conference on Intelligent System Design and Engineering Application > 1 > 780 - 783

2010 International Conference on Intelligent System Design and Engineering Application (ISDEA 2010)

The current commercial anti-virus software detects a virus only after the virus has appeared and caused damage. Motivated by the inference technique for detecting viruses, and a recent successful classification method, we explore a system (Radux: Reverse Analysis for Detecting Unsafe eXecutables) for automatically detecting malicious code using the collected dataset of the benign and malicious code...

chapter

Incremental adaptive integration of layers of a hybrid control architecture

M Powers, T Balch

2010 IEEE/RSJ International Conference on Intelligent Robots and Systems > 2012 - 2017

2010 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2010)

Hybrid deliberative-reactive control architectures are a popular and effective approach to the control of robotic navigation applications. However, due to the fundamental differences in the design of the reactive and deliberative layers, the design of hybrid control architectures can pose significant difficulties. We propose a novel approach to improving system-level performance of hybrid control...

chapter

Machine learning and windowed subsecond event detection on PMU data via Hadoop and the openPDC

Paul Trachian

IEEE PES General Meeting > 1 - 5

2010 IEEE Power & Energy Society General Meeting

The high rate of data samples reported by devices that support PMU functionality forces the use of non-traditional methods in order to attempt realtime anomaly detection. Two methods discussed are offline machine learning and a realtime sliding window procedure. In using machine learning techniques it is possible to assert a classifier algorithm, which to a certain degree of accuracy can flag incoming...

chapter

Opposition-based differential evolution for beta basis function neural network

Habib Dhahri, Adel M Alimi

IEEE Congress on Evolutionary Computation > 1 - 8

2010 IEEE Congress on Evolutionary Computation

Many methods for solving optimization problems, whether direct or indirect, rely upon gradient information and therefore may converge to a local optimum. Global optimization methods like Evolutionary algorithms, overcome this problem although these techniques are computationally expensive due to slow nature of the evolutionary process. In this work, a new concept is investigated to accelerate the...

chapter

Dynamic and adaptive self organizing maps applied to high dimensional large scale text clustering

Zhonghui Feng, Junpeng Bao, Junyi Shen

2010 IEEE International Conference on Software Engineering and Service Sciences > 348 - 351

2010 IEEE International Conference on Software Engineering and Service Sciences (ICSESS 2010)

The self organizing maps(SOM) has been used as a tool for mapping high-dimensional input data into a low-dimensional feature map, which has significant advantages for text clustering applications. In this paper, a novel dynamic and adaptive SOM algorithm applied to high dimensional large scale text clustering is proposed. The characteristic feature of this novel neural network model is its dynamic...

chapter

First-order logic learning in Artificial Neural Networks

Mathieu Guillame-Bert, Krysia Broda, Artur d'Avila Garcez

The 2010 International Joint Conference on Neural Networks (IJCNN) > 1 - 8

2010 International Joint Conference on Neural Networks (IJCNN 2010)

Artificial Neural Networks have previously been applied in neuro-symbolic learning to learn ground logic program rules. However, there are few results of learning relations using neuro-symbolic learning. This paper presents the system PAN, which can learn relations. The inputs to PAN are one or more atoms, representing the conditions of a logic rule, and the output is the conclusion of the rule. The...

chapter

Neural network architecture selection analysis with application to cryptography location

J L Wright, M Manic

The 2010 International Joint Conference on Neural Networks (IJCNN) > 1 - 6

2010 International Joint Conference on Neural Networks (IJCNN 2010)

When training a neural network it is tempting to experiment with architectures until a low total error is achieved. The danger in doing so is the creation of a network that loses generality by over-learning the training data; lower total error does not necessarily translate into a low total error in validation. The resulting network may keenly detect the samples used to train it, without being able...

chapter

A Heterogeneous FPGA Architecture for Support Vector Machine Training

Markos Papadonikolakis, Christos-Savvas Bouganis

2010 18th IEEE Annual International Symposium on Field-Programmable Custom Computing Machines > 211 - 214

2010 IEEE 18th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM 2010)

Support Vector Machines is a powerful supervised learning tool. Its training phase, however, is a time-consuming task and heavily dependent on the training dataset size and dimensionality. In this work, we propose a scalable FPGA architecture for the acceleration of SVM training, which exploits the heterogeneous nature of the device and the diversities of the precision requirements among the dataset...

Data set:
ieee
Keywords:
COMPUTER ARCHITECTURE
TRAINING
LEARNING (ARTIFICIAL INTELLIGENCE)

Publication date

Set your own date range

INFONA - science communication portal

Search results

Learning to walk with prior knowledge

End-to-End Driving in a Realistic Racing Game with Deep Reinforcement Learning

Learning how to drive in a real world simulation with deep Q-Networks

Training neural networks with policy gradient

Heterogeneous team deep q-learning in low-dimensional multi-agent environments

How to not get frustrated with neural networks

Fast cell detection in high-throughput imagery using GPU-accelerated machine learning

Support Vector Machines on GPU with Sparse Matrix Format

A novel FPGA-based SVM classifier

Intelligent Task Mapping Using Machine Learning

Deep Spatiotemporal Feature Learning with Application to Image Classification

Evolving Arbitrarily Connected Feedforward Neural Networks via Genetic Algorithms

Fuzzy Neural Network for Malware Detect

Incremental adaptive integration of layers of a hybrid control architecture

Machine learning and windowed subsecond event detection on PMU data via Hadoop and the openPDC

Opposition-based differential evolution for beta basis function neural network

Dynamic and adaptive self organizing maps applied to high dimensional large scale text clustering

First-order logic learning in Artificial Neural Networks

Neural network architecture selection analysis with application to cryptography location

A Heterogeneous FPGA Architecture for Support Vector Machine Training

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options