Search results

Items from 81 to 100 out of 3,201 results

chapter

Simulating Human Behavior in Fighting Games Using Reinforcement Learning and Artificial Neural Networks

Matheus R. F. Mendonca, Heder S. Bernardino, Raul F. Neto

2015 14th Brazilian Symposium on Computer Games and Digital Entertainment (SBGames) > 152 - 159

2015 14th Brazilian Symposium on Computer Games and Digital Entertainment (SBGames)

The study of intelligent agent training is of great interest to the gaming industry due to its wide application in various game genres and its capabilities of simulating a human-like behavior. In this work two machine learning techniques, namely, a reinforcement learning approach and an Artificial Neural Network (ANN), are used in a fighting game in order to allow the agent/fighter to emulate a human...

chapter

Learning Trading Negotiations Using Manually and Automatically Labelled Data

Heriberto Cuayahuitl, Simon Keizer, Oliver Lemon

2015 IEEE 27th International Conference on Tools with Artificial Intelligence (ICTAI) > 904 - 911

2015 IEEE 27th International Conference on Tools with Artificial Intelligence (ICTAI)

Strategic conversational agents often need to trade resources with their opponent conversants -- and trading strategically can lead to better results. While rule-based or supervised agents can be used for such a purpose, here we explore a learning approach based on automatically labelled examples from human players for automatic trading in the game of Settlers of Catan. Our experiments are based on...

chapter

Continuous Action-Space Reinforcement Learning Methods Applied to the Minimum-Time Swing-Up of the Acrobot

Barry D. Nichols

2015 IEEE International Conference on Systems, Man, and Cybernetics > 2084 - 2089

2015 IEEE International Conference on Systems, Man, and Cybernetics (SMC)

Here I apply three reinforcement learning methods to the full, continuous action, swing-up acrobot control benchmark problem. These include two approaches from the literature: CACLA and NM-SARSA and a novel approach which I refer to as Nelder Mead-SARSA. Nelder Mead-SARSA, like NMSARSA, directly optimises the state-action value function for action selection, in order to allow continuous action reinforcement...

chapter

A Q-learning approach for aligning protein sequences

Ioan-Gabriel Mircea, Gabriela Czibula, Maria-Iuliana Bocicor

2015 IEEE International Conference on Intelligent Computer Communication and Processing (ICCP) > 51 - 58

2015 IEEE International Conference on Intelligent Computer Communication and Processing (ICCP)

Protein multiple sequence alignment is significant in the field of bioinformatics as it may reveal important information about the protein sequences' functional, structural or evolutionary relationships. It involves the alignment of three or more biological protein sequences and represents a real challenge both from a biological and a computational point of view. Q-learning is a reinforcement learning...

chapter

Learning compound multi-step controllers under unknown dynamics

Weiqiao Han, Sergey Levine, Pieter Abbeel

2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) > 6435 - 6442

2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

Applications of reinforcement learning for robotic manipulation often assume an episodic setting. However, controllers trained with reinforcement learning are often situated in the context of a more complex compound task, where multiple controllers might be invoked in sequence to accomplish a higher-level goal. Furthermore, training such controllers typically requires resetting the environment between...

chapter

Reinforcement learning of variable admittance control for human-robot co-manipulation

Fotios Dimeas, Nikos Aspragathos

2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) > 1011 - 1016

2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

In this paper, a variable admittance controller based on reinforcement learning is proposed for human-robot co-manipulation tasks. Setting as the goal of the reinforcement learning algorithm the minimisation of the jerk throughout a point-to-point movement, the proposed controller can learn the appropriate damping for effective cooperation without any prior knowledge of the target position or other...

chapter

Reinforcement learning and instance-based learning approaches to modeling human decision making in a prognostic foraging task

Suhas E. Chelian, Jaehyon Paik, Peter Pirolli, Christian Lebiere, more

2015 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob) > 116 - 122

2015 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)

Procedural memory and episodic memory are known to be distinct and both underlie the performance of many tasks. Reinforcement learning (RL) and instance-based learning (IBL) represent common approaches to modeling procedural and episodic memory in that order. In this work, we present a neural model utilizing RL dynamics and an ACT-R model utilizing IBL productions to the task of modeling human decision...

chapter

Seeing [u] aids vocal learning: Babbling and imitation of vowels using a 3D vocal tract model, reinforcement learning, and reservoir computing

Max Murakami, Bernd Kroger, Peter Birkholz, Jochen Triesch

2015 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob) > 208 - 213

2015 Joint IEEE International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)

We present a model of imitative vocal learning consisting of two stages. First, the infant is exposed to the ambient language and forms auditory knowledge of the speech items to be acquired. Second, the infant attempts to imitate these speech items and thereby learns to control the articulators for speech production. We model these processes using a recurrent neural network and a realistic vocal tract...

chapter

A deep reinforcement learning approach to character segmentation of license plate images

Farnaz Abtahi, Zhigang Zhu, Aaron M. Burry

2015 14th IAPR International Conference on Machine Vision Applications (MVA) > 539 - 542

2015 14th IAPR International Conference on Machine Vision Applications (MVA)

Automated license plate recognition (ALPR) has been applied to identify vehicles by their license plates and is critical in several important transportation applications. In order to achieve the recognition accuracy levels typically required in the market, it is necessary to obtain properly segmented characters. A standard method, projection-based segmentation, is challenged by substantial variation...

chapter

Adaptive traversability of partially occluded obstacles

Karel Zimmermann, Petr Zuzanek, Michal Reinstein, Tomas Petricek, more

2015 IEEE International Conference on Robotics and Automation (ICRA) > 3959 - 3964

2015 IEEE International Conference on Robotics and Automation (ICRA)

Controlling mobile robots with complex articulated parts and hence many degrees of freedom generates high cognitive load on the operator, especially under demanding conditions such as in Urban Search & Rescue missions. We propose a solution based on reinforcement learning in order to accommodate the robot morphology automatically to the terrain and the obstacles it traverses. In this paper, we...

chapter

Analyzing human's continuous learning processes with the reflection sub task

Tomohiro Yamaguchi, Kouki Takemori, Yuki Tamai, Keiki Takadama

2015 10th Asian Control Conference (ASCC) > 1 - 6

2015 10th Asian Control Conference (ASCC)

This paper reports our learning support system for a human learner to visualize his/her mental learning processes with invisible mazes for continuous learning. The objective of this research is to bring the learning ability of the learning agent close to that of a human. To fill in the missing piece of reinforcement learning whose learning process is mainly behavior change, we add two mental learning...

chapter

Differential Reward Mechanism Based Online Learning Algorithm for URL-based Topic Classification

Neetu Singh, Narendra S. Chaudhari

2014 International Conference on Computational Intelligence and Communication Networks > 589 - 596

2014 International Conference on Computational Intelligence and Communication Networks (CICN)

In this paper, we propose a differential reward based online learning algorithm for classifying web pages into predefined topics based on minimal text available in the URLs. It is then compared with two baseline methods, i.e., Support Vector Machine (SVM) and a state-of-the-art Reinforcement Learning Algorithm using recall, precision and F-measure scores. We conducted experiments on large scale Open...

chapter

Unsupervised neural controller for Reinforcement Learning action-selection: Learning to represent knowledge

Alexandros Gkiokas, Alexandra I. Cristea

12th Symposium on Neural Network Applications in Electrical Engineering (NEUREL) > 99 - 104

2014 12th Symposium on Neural Network Applications in Electrical Engineering (NEUREL 2014)

Constructing the correct Conceptual Graph representing some textual information requires a series of decisions, defined by vertex or edge creation. The process of creating Conceptual Graphs involves semiotics: the semantics, pragmatics and syntactics of the information, as well as graph structuralism and isomorphic projection, all described as decisions of a learning agent or system. The actual process...

chapter

RLRAUC: Reinforcem ent learning based ranking algorithm using user clicks

Vali Derhami, Javad Paksima, Homa Khajeh

2014 4th International Conference on Computer and Knowledge Engineering (ICCKE) > 29 - 34

2014 4th International eConference on Computer and Knowledge Engineering (ICCKE)

Because of great volume of web information, information retrieval process of a search engine is of great importance. For each query of user, the number of queries can reach hundred thousands, whereas a few number of the first results have the chance of being checked by user; therefore, a search engine pays attention to putting relevance results in the first ranks as a necessity. This paper introduces...

chapter

An effective hybrid model based on PSO-SVM algorithm with a new local search for feature selection

Ehsan Eslami, Mahdi Eftekhari

2014 4th International Conference on Computer and Knowledge Engineering (ICCKE) > 404 - 409

2014 4th International eConference on Computer and Knowledge Engineering (ICCKE)

Todays, feature selection is an active research in machine learning. The main idea of feature selection is to select a subset of available features, by eliminating features with little or no predictive information. This paper presents a hybrid model with a new local search technique based on reinforcement learning for feature selection. We combined the particle swarm optimization (PSO) with support...

chapter

Improving reinforcement learning with interactive feedback and affordances

Francisco Cruz, Sven Magg, Cornelius Weber, Stefan Wermter

4th International Conference on Development and Learning and on Epigenetic Robotics > 165 - 170

2014 Joint IEEE International Conferences on Development and Learning and Epigenetic Robotics (ICDL-Epirob)

Interactive reinforcement learning constitutes an alternative for improving convergence speed in reinforcement learning methods. In this work, we investigate inter-agent training and present an approach for knowledge transfer in a domestic scenario where a first agent is trained by reinforcement learning and afterwards transfers selected knowledge to a second agent by instructions to achieve more...

chapter

Robot-assisted motor training: Assistance decreases exploration during reinforcement learning

Albert Sans-Muntadas, Jaime E. Duarte, David J. Reinkensmeyer

2014 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society > 3516 - 3520

2014 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC)

Reinforcement learning (RL) is a form of motor learning that robotic therapy devices could potentially manipulate to promote neurorehabilitation. We developed a system that requires trainees to use RL to learn a predefined target movement. The system provides higher rewards for movements that are more similar to the target movement. We also developed a novel algorithm that rewards trainees of different...

chapter

Adaptive reinforcement learning in box-pushing robots

K. S. Hwang, J. L. Ling, Wei-Han Wang

2014 IEEE International Conference on Automation Science and Engineering (CASE) > 1182 - 1187

2014 IEEE International Conference on Automation Science and Engineering (CASE)

In this paper, an adaptive state aggregation Q-Learning method, with the capability of multi-agent cooperation, was proposed to enhance the efficiency of reinforcement learning (RL) and applied to box-pushing tasks for humanoid robots. First, a decision tree was applied to partition the state space according to temporary differences in reinforcement learning, so that a real valued action domain could...

chapter

A connectionist actor-critic algorithm for faster learning and biological plausibility

Leonard Johard, Emanuele Ruffaldi

2014 IEEE International Conference on Robotics and Automation (ICRA) > 3903 - 3909

2014 IEEE International Conference on Robotics and Automation (ICRA)

We propose a novel biologically plausible actor-critic algorithm using policy gradients in order to achieve practical, model-free reinforcement learning. It does not rely on backpropagation and is the first neural actor-critic relying only on locally available information. We show it has an advantage over pure policy gradients methods for motor learning performance in the polecart problem. We are...

chapter

Adaptive Traversability of unknown complex terrain with obstacles for mobile robots

Karel Zimmermann, Petr Zuzanek, Michal Reinstein, Vaclav Hlavac

2014 IEEE International Conference on Robotics and Automation (ICRA) > 5177 - 5182

2014 IEEE International Conference on Robotics and Automation (ICRA)

In this paper we introduce the concept of Adaptive Traversability (AT), which we define as means of autonomous motion control adapting the robot morphology — configuration of articulated parts and their compliance — to traverse unknown complex terrain with obstacles in an optimal way. We verify this concept by proposing a reinforcement learning based AT algorithm for mobile robots operating in such...

Data set:
ieee
Keywords:
TRAINING
LEARNING (ARTIFICIAL INTELLIGENCE)

Publication date

Set your own date range

Content availability

Available (3,128)
None (73)

Keywords

ARTIFICIAL NEURAL NETWORKS (821)
CLASSIFICATION ALGORITHMS (791)
SUPPORT VECTOR MACHINES (762)
DATA MINING (755)
MACHINE LEARNING (739)
ACCURACY (685)
PATTERN CLASSIFICATION (654)
FEATURE EXTRACTION (640)
KERNEL (403)
NEURAL NETS (389)
IMAGE CLASSIFICATION (338)
TRAINING DATA (296)
NEURONS (273)
TESTING (258)
SUPPORT VECTOR MACHINE (252)
ALGORITHM DESIGN AND ANALYSIS (223)
DATA MODELS (218)
DATABASES (215)
OPTIMIZATION (206)
FACE (183)
FACE RECOGNITION (178)
MATHEMATICAL MODEL (177)
COMPUTATIONAL MODELING (171)
OBJECT DETECTION (167)
CLASSIFICATION (164)
PIXEL (162)
REGRESSION ANALYSIS (150)
BOOSTING (147)
SUPERVISED LEARNING (142)
PREDICTION ALGORITHMS (141)
PREDICTIVE MODELS (141)
NEURAL NETWORKS (140)
PRINCIPAL COMPONENT ANALYSIS (140)
HIDDEN MARKOV MODELS (136)
SVM (136)
HUMANS (131)
NEURAL NETWORK (129)
STATISTICAL ANALYSIS (129)
PATTERN CLUSTERING (127)
OPTIMISATION (126)
PATTERN RECOGNITION (123)
LEARNING SYSTEMS (121)
SUPPORT VECTOR MACHINE CLASSIFICATION (121)
GENETIC ALGORITHMS (120)
LEARNING (120)
RADIAL BASIS FUNCTION NETWORKS (120)
FUZZY SET THEORY (118)
TEXT ANALYSIS (117)
PROBABILITY (113)
VISUALIZATION (113)
CLUSTERING ALGORITHMS (110)
IMAGE SEGMENTATION (110)
ADAPTATION MODEL (109)
DECISION TREES (108)
BAYES METHODS (107)
EQUATIONS (103)
ERROR ANALYSIS (103)
NOISE (102)
SHAPE (102)
DISTANCE MEASUREMENT (101)
MACHINE LEARNING ALGORITHMS (100)
ESTIMATION (99)
CORRELATION (98)
DETECTORS (94)
NATURAL LANGUAGE PROCESSING (91)
IMAGE COLOR ANALYSIS (89)
CONVERGENCE (86)
ARTIFICIAL NEURAL NETWORK (85)
COMPUTER VISION (85)
GAUSSIAN PROCESSES (85)
INTERNET (85)
ROBOTS (83)
IMAGE RECOGNITION (82)
SPEECH (82)
ENTROPY (81)
FEATURE SELECTION (81)
LABELING (81)
OBJECT RECOGNITION (81)
ADABOOST (80)
MANIFOLDS (80)
PROBABILITY DENSITY FUNCTION (80)
LEARNING ALGORITHM (78)
MULTILAYER PERCEPTRONS (78)
BAYESIAN METHODS (76)
SEMI-SUPERVISED LEARNING (76)
CONTEXT (75)
HISTOGRAMS (74)
SPEECH RECOGNITION (74)
FUZZY NEURAL NETS (73)
BAGGING (72)
IMAGE RETRIEVAL (71)
INCREMENTAL LEARNING (71)
REINFORCEMENT LEARNING (70)
ACTIVE LEARNING (68)
MEDICAL IMAGE PROCESSING (68)
APPROXIMATION METHODS (67)
BIOLOGICAL SYSTEM MODELING (67)
COMPUTATIONAL COMPLEXITY (67)
more

INFONA - science communication portal

Search results

Simulating Human Behavior in Fighting Games Using Reinforcement Learning and Artificial Neural Networks

Learning Trading Negotiations Using Manually and Automatically Labelled Data

Continuous Action-Space Reinforcement Learning Methods Applied to the Minimum-Time Swing-Up of the Acrobot

A Q-learning approach for aligning protein sequences

Learning compound multi-step controllers under unknown dynamics

Reinforcement learning of variable admittance control for human-robot co-manipulation

Reinforcement learning and instance-based learning approaches to modeling human decision making in a prognostic foraging task

Seeing [u] aids vocal learning: Babbling and imitation of vowels using a 3D vocal tract model, reinforcement learning, and reservoir computing

A deep reinforcement learning approach to character segmentation of license plate images

Adaptive traversability of partially occluded obstacles

Analyzing human's continuous learning processes with the reflection sub task

Differential Reward Mechanism Based Online Learning Algorithm for URL-based Topic Classification

Unsupervised neural controller for Reinforcement Learning action-selection: Learning to represent knowledge

RLRAUC: Reinforcem ent learning based ranking algorithm using user clicks

An effective hybrid model based on PSO-SVM algorithm with a new local search for feature selection

Improving reinforcement learning with interactive feedback and affordances

Robot-assisted motor training: Assistance decreases exploration during reinforcement learning

Adaptive reinforcement learning in box-pushing robots

A connectionist actor-critic algorithm for faster learning and biological plausibility

Adaptive Traversability of unknown complex terrain with obstacles for mobile robots

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options