Search results

Items from 61 to 80 out of 3,201 results

chapter

Inverse reinforcement learning with leveraged Gaussian processes

Kyungjae Lee, Sungjoon Choi, Songhwai Oh

2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) > 3907 - 3912

2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

In this paper, we propose a novel inverse reinforcement learning algorithm with leveraged Gaussian processes that can learn from both positive and negative demonstrations. While most existing inverse reinforcement learning (IRL) methods suffer from the lack of information near low reward regions, the proposed method alleviates this issue by incorporating (negative) demonstrations of what not to do...

chapter

Modeling behavior of Computer Generated Forces with Machine Learning Techniques, the NATO Task Group approach

Armon Toubman, Jan Joris Roessingh, Joost van Oijen, Rikke Amilde Lovlid, more

2016 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 1906 - 1911

2016 IEEE International Conference on Systems, Man, and Cybernetics (SMC)

Commercial/Military-Off-The-Shelf (COTS/MOTS) Computer Generated Forces (CGF) packages are widely used in modeling and simulation for training purposes. Conventional CGF packages often include artificial intelligence (AI) interfaces, but lack behavior generation and other adaptive capabilities. We believe Machine Learning (ML) techniques can be beneficial to the behavior modeling process, yet such...

chapter

Playing the game of Congklak with reinforcement learning

Muhammad Firmansyah Kasim

2016 8th International Conference on Information Technology and Electrical Engineering (ICITEE) > 1 - 5

2016 8th International Conference on Information Technology and Electrical Engineering (ICITEE)

Reinforcement learning is a branch of machine learning that allows an agent to learn to take an action based on its observations and rewards it obtains. In this paper, reinforcement learning agents are trained to play the game of Congklak, a traditional game from Indonesia and Malaysia. Congklak is a deterministic board game played by 2 players which play in turns. However, it was found that the common...

chapter

Incremental learning of neural network classifiers using reinforcement learning

Sourabh Bose, Manfred Huber

2016 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 2097 - 2103

2016 IEEE International Conference on Systems, Man, and Cybernetics (SMC)

With the availability of more data, classification is increasingly important. However, traditional classification algorithms do not scale well to large data sets and are often not suited when only limited samples of the dataset are available at any point in time. The latter arises, for example, in streaming data when the accumulation of data a priori is infeasible either due to limitations in memory...

chapter

An approach to interactive deep reinforcement learning for serious games

Aline Dobrovsky, Uwe M. Borghoff, Marko Hofmann

2016 7th IEEE International Conference on Cognitive Infocommunications (CogInfoCom) > 85 - 90

2016 7th IEEE International Conference on Cognitive Infocommunications (CogInfoCom)

Serious games receive increasing interest in the area of e-learning. Their development, however, is often still a demanding, specialized and arduous process, especially when regarding reasonable non-player character behaviour. Reinforcement learning and, since recently, also deep reinforcement learning have proven to automatically generate successful AI behaviour to a certain degree. These methods...

chapter

Deep Q-learning using redundant outputs in visual doom

Hyunsoo Park, Kyung-Joong Kim

2016 IEEE Conference on Computational Intelligence and Games (CIG) > 1 - 2

2016 IEEE Conference on Computational Intelligence and Games (CIG)

Recently, there is a growing interest in applying deep learning in game AI domain. Among them, deep reinforcement learning is the most famous in game AI communities. In this paper, we propose to use redundant outputs in order to adapt training progress in deep reinforcement learning. We compare our method with general ε-greedy in ViZDoom platform. Since AI player should select an action only based...

chapter

Evaluating real-time strategy game states using convolutional neural networks

Marius Stanescu, Nicolas A. Barriga, Andy Hess, Michael Buro

2016 IEEE Conference on Computational Intelligence and Games (CIG) > 1 - 7

2016 IEEE Conference on Computational Intelligence and Games (CIG)

Real-time strategy (RTS) games, such as Blizzard's StarCraft, are fast paced war simulation games in which players have to manage economies, control many dozens of units, and deal with uncertainty about opposing unit locations in real-time. Even in perfect information settings, constructing strong AI systems has been difficult due to enormous state and action spaces and the lack of good state evaluation...

chapter

Heterogeneous team deep q-learning in low-dimensional multi-agent environments

Mateusz Kurek, Wojciech Jaskowski

2016 IEEE Conference on Computational Intelligence and Games (CIG) > 1 - 8

2016 IEEE Conference on Computational Intelligence and Games (CIG)

Deep Q-Learning is an effective reinforcement learning method, which has recently obtained human-level performance for a set of Atari 2600 games. Remarkably, the system was trained on the high-dimensional raw visual data. Is Deep Q-Learning equally valid for problems involving a low-dimensional state space? To answer this question, we evaluate the components of Deep Q-Learning (deep architecture,...

chapter

Position-based reinforcement learning biased MCTS for General Video Game Playing

Chun-Yin Chu, Suguru Ito, Tomohiro Harada, Ruck Thawonmas

2016 IEEE Conference on Computational Intelligence and Games (CIG) > 1 - 8

2016 IEEE Conference on Computational Intelligence and Games (CIG)

This paper proposes an application of reinforcement learning and position-based features in rollout bias training of Monte-Carlo Tree Search (MCTS) for General Video Game Playing (GVGP). As an improvement on Knowledge-based Fast-Evo MCTS proposed by Perez et al., the proposed method is designated for both the GVG-AI Competition and improvement of the learning mechanism of the original method. The...

chapter

Monte-Carlo simulation balancing revisited

Tobias Graf, Marco Platzner

2016 IEEE Conference on Computational Intelligence and Games (CIG) > 1 - 7

2016 IEEE Conference on Computational Intelligence and Games (CIG)

Simulation Balancing is an optimization algorithm to automatically tune the parameters of a playout policy used inside a Monte Carlo Tree Search. The algorithm fits a policy so that the expected result of a policy matches given target values of the training set. Up to now it has been successfully applied to Computer Go on small 9 × 9 boards but failed for larger board sizes like 19 × 19. On these...

chapter

Maximum correntropy based attention-gated reinforcement learning designed for brain machine interface

Hongbao Li, Fang Wang, Qiaosheng Zhang, Shaomin Zhang, more

2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC) > 3056 - 3059

2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC)

Reinforcement learning is an effective algorithm for brain machine interfaces (BMIs) which interprets the mapping between neural activities with plasticity and the kinematics. Exploring large state-action space is difficulty when the complicated BMIs needs to assign credits over both time and space. For BMIs attention gated reinforcement learning (AGREL) has been developed to classify multi-actions...

chapter

Optimal medication dosing from suboptimal clinical examples: A deep reinforcement learning approach

Shamim Nemati, Mohammad M. Ghassemi, Gari D. Clifford

2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC) > 2978 - 2981

2016 38th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC)

Misdosing medications with sensitive therapeutic windows, such as heparin, can place patients at unnecessary risk, increase length of hospital stay, and lead to wasted hospital resources. In this work, we present a clinician-in-the-loop sequential decision making framework, which provides an individualized dosing policy adapted to each patient's evolving clinical phenotype. We employed retrospective...

chapter

Trust and privacy correlations in social networks: A deep learning framework

Shatha Jaradat, Nima Dokoohaki, Mihhail Matskin, Elena Ferrari

2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM) > 203 - 206

2016 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM)

Online Social Networks (OSNs) remain the focal point of Internet usage. Since the beginning, networking sites tried best to have right privacy mechanisms in place for users, enabling them to share the right content with the right audience. With all these efforts, privacy customizations remain hard for users across the sites. Existing research that address this problem mainly focus on semi-supervised...

chapter

Enhancing supervisory training signals with environmental reinforcement learning using adaptive dynamic programming and artificial neural networks

Niklas Melton, Donald C. Wunsch

2016 IEEE 15th International Conference on Cognitive Informatics & Cognitive Computing (ICCI*CC) > 331 - 335

2016 IEEE 15th International Conference on Cognitive Informatics & Cognitive Computing (ICCI*CC)

A method for hybridizing supervised learning with adaptive dynamic programming was developed to increase the speed, quality, and robustness of on-line neural network learning from an imperfect teacher. Reinforcement learning is used to modify and enhance the original supervisory signal before learning occurs. This paper describes the method of hybridization and presents a model problem in which a...

chapter

Category driven deep recurrent neural network for video summarization

Xinhui Song, Ke Chen, Jie Lei, Li Sun, more

2016 IEEE International Conference on Multimedia & Expo Workshops (ICMEW) > 1 - 6

2016 IEEE International Conference on Multimedia & Expo Workshops (ICMEW)

A large number of videos are generated and uploaded to video websites (like youku, youtube) every day and video websites play more and more important roles in human life. While bringing convenience, the big video data raise the difficulty of video summarization to allow users to browse a video easily. However, although there are many existing video summarization approaches, the key frames selected...

chapter

Application of Extreme Learning Machine Algorithm in the Regression Fitting

Gu-Xiong Li

2016 International Conference on Information System and Artificial Intelligence (ISAI) > 419 - 422

2016 International Conference on Information System and Artificial Intelligence (ISAI)

ELM (extreme learning machine) algorithm has the advantages of fast learning speed, good generalization performance. It is not only suitable for regression, fitting problem, but also applies to the field of classification and pattern recognition. In this paper, ELM algorithm is applied to nonlinear function fitting. The performance and running speed with other algorithms are comparison, show the superiority...

chapter

Research on Q-ELM algorithm in robot path planning

Hongge Ren, Rui Yin, Fujin Li, Wei Wang, more

2016 Chinese Control and Decision Conference (CCDC) > 5975 - 5979

2016 Chinese Control and Decision Conference (CCDC)

In view of high dimension, the difficulty of training, the problem of slow learning speed in the application of BP neural network in mobile robot path planning, an algorithm of reinforcement Q learning based on extreme learning machine (Q-ELM algorithm) is proposed in this paper. Firstly, the characteristic of reinforcement learning is combining the dynamic network with supervised learning, and the...

chapter

A fuzzy reinforcement learning algorithm using a predictor for pursuit-evasion games

Mostafa D. Awheda, Howard M. Schwartz

2016 Annual IEEE Systems Conference (SysCon) > 1 - 8

2016 Annual IEEE Systems Conference (SysCon)

In a pursuit-evasion game, the pursuer learning its strategy by any learning algorithm usually captures the evader when the environment of the game is similar to the environment that the pursuer was trained on. However, the trained pursuer may not be able to capture the evader if the environment of the pursuit-evasion game is different from the training environment. In this paper, we propose a fuzzy...

chapter

Multi-robot target reaching using modified Q-learning and PSO

Orawan Watchanupaporn, Peerapun Pudtuan

2016 2nd International Conference on Control, Automation and Robotics (ICCAR) > 66 - 69

2016 2nd International Conference on Control, Automation and Robotics (ICCAR)

In this paper, a group of mobile robots learns to solve a target reaching problem in a simulated grid environment filled with obstacles. Each robot knows its distance to the target and can communicate with each other. The proposed learning algorithm combines a reinforcement learning algorithm and a swarm optimization algorithm. Q-learning, which is a reinforcement learning algorithm, is modified to...

chapter

Temporal Difference Learning for the Game Tic-Tac-Toe 3D: Applying Structure to Neural Networks

Michiel Van De Steeg, Madalina M. Drugan, Marco Wiering

2015 IEEE Symposium Series on Computational Intelligence > 564 - 570

2015 IEEE Symposium Series on Computational Intelligence (SSCI)

When reinforcement learning is applied to large state spaces, such as those occurring in playing board games, the use of a good function approximator to learn to approximate the value function is very important. In previous research, multi-layer perceptrons have often been quite successfully used as function approximator for learning to play particular games with temporal difference learning. With...

Data set:
ieee
Keywords:
TRAINING
LEARNING (ARTIFICIAL INTELLIGENCE)

Publication date

Set your own date range

Content availability

Available (3,128)
None (73)

Keywords

ARTIFICIAL NEURAL NETWORKS (821)
CLASSIFICATION ALGORITHMS (791)
SUPPORT VECTOR MACHINES (762)
DATA MINING (755)
MACHINE LEARNING (739)
ACCURACY (685)
PATTERN CLASSIFICATION (654)
FEATURE EXTRACTION (640)
KERNEL (403)
NEURAL NETS (389)
IMAGE CLASSIFICATION (338)
TRAINING DATA (296)
NEURONS (273)
TESTING (258)
SUPPORT VECTOR MACHINE (252)
ALGORITHM DESIGN AND ANALYSIS (223)
DATA MODELS (218)
DATABASES (215)
OPTIMIZATION (206)
FACE (183)
FACE RECOGNITION (178)
MATHEMATICAL MODEL (177)
COMPUTATIONAL MODELING (171)
OBJECT DETECTION (167)
CLASSIFICATION (164)
PIXEL (162)
REGRESSION ANALYSIS (150)
BOOSTING (147)
SUPERVISED LEARNING (142)
PREDICTION ALGORITHMS (141)
PREDICTIVE MODELS (141)
NEURAL NETWORKS (140)
PRINCIPAL COMPONENT ANALYSIS (140)
HIDDEN MARKOV MODELS (136)
SVM (136)
HUMANS (131)
NEURAL NETWORK (129)
STATISTICAL ANALYSIS (129)
PATTERN CLUSTERING (127)
OPTIMISATION (126)
PATTERN RECOGNITION (123)
LEARNING SYSTEMS (121)
SUPPORT VECTOR MACHINE CLASSIFICATION (121)
GENETIC ALGORITHMS (120)
LEARNING (120)
RADIAL BASIS FUNCTION NETWORKS (120)
FUZZY SET THEORY (118)
TEXT ANALYSIS (117)
PROBABILITY (113)
VISUALIZATION (113)
CLUSTERING ALGORITHMS (110)
IMAGE SEGMENTATION (110)
ADAPTATION MODEL (109)
DECISION TREES (108)
BAYES METHODS (107)
EQUATIONS (103)
ERROR ANALYSIS (103)
NOISE (102)
SHAPE (102)
DISTANCE MEASUREMENT (101)
MACHINE LEARNING ALGORITHMS (100)
ESTIMATION (99)
CORRELATION (98)
DETECTORS (94)
NATURAL LANGUAGE PROCESSING (91)
IMAGE COLOR ANALYSIS (89)
CONVERGENCE (86)
ARTIFICIAL NEURAL NETWORK (85)
COMPUTER VISION (85)
GAUSSIAN PROCESSES (85)
INTERNET (85)
ROBOTS (83)
IMAGE RECOGNITION (82)
SPEECH (82)
ENTROPY (81)
FEATURE SELECTION (81)
LABELING (81)
OBJECT RECOGNITION (81)
ADABOOST (80)
MANIFOLDS (80)
PROBABILITY DENSITY FUNCTION (80)
LEARNING ALGORITHM (78)
MULTILAYER PERCEPTRONS (78)
BAYESIAN METHODS (76)
SEMI-SUPERVISED LEARNING (76)
CONTEXT (75)
HISTOGRAMS (74)
SPEECH RECOGNITION (74)
FUZZY NEURAL NETS (73)
BAGGING (72)
IMAGE RETRIEVAL (71)
INCREMENTAL LEARNING (71)
REINFORCEMENT LEARNING (70)
ACTIVE LEARNING (68)
MEDICAL IMAGE PROCESSING (68)
APPROXIMATION METHODS (67)
BIOLOGICAL SYSTEM MODELING (67)
COMPUTATIONAL COMPLEXITY (67)
more

INFONA - science communication portal

Search results

Inverse reinforcement learning with leveraged Gaussian processes

Modeling behavior of Computer Generated Forces with Machine Learning Techniques, the NATO Task Group approach

Playing the game of Congklak with reinforcement learning

Incremental learning of neural network classifiers using reinforcement learning

An approach to interactive deep reinforcement learning for serious games

Deep Q-learning using redundant outputs in visual doom

Evaluating real-time strategy game states using convolutional neural networks

Heterogeneous team deep q-learning in low-dimensional multi-agent environments

Position-based reinforcement learning biased MCTS for General Video Game Playing

Monte-Carlo simulation balancing revisited

Maximum correntropy based attention-gated reinforcement learning designed for brain machine interface

Optimal medication dosing from suboptimal clinical examples: A deep reinforcement learning approach

Trust and privacy correlations in social networks: A deep learning framework

Enhancing supervisory training signals with environmental reinforcement learning using adaptive dynamic programming and artificial neural networks

Category driven deep recurrent neural network for video summarization

Application of Extreme Learning Machine Algorithm in the Regression Fitting

Research on Q-ELM algorithm in robot path planning

A fuzzy reinforcement learning algorithm using a predictor for pursuit-evasion games

Multi-robot target reaching using modified Q-learning and PSO

Temporal Difference Learning for the Game Tic-Tac-Toe 3D: Applying Structure to Neural Networks

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options