Search results

Items from 1 to 20 out of 32 results

chapter

From Serve-on-Demand to Serve-on-Need: A Game Theoretic Approach

Yong Lin, F Makedon

2010 Ninth International Conference on Machine Learning and Applications > 31 - 36

2010 Ninth International Conference on Machine Learning and Applications (ICMLA 2010)

Everyone is familiar with the scenario, people demand or assign tasks to robots, and robots execute the tasks to serve people. We call such a model Serve-on-Demand. With the advancement of pervasive computing, machine learning and artificial intelligence, the robot service of the next generation will inevitably turn to actively and exactly meet people's needs, even without explicit demand. We call...

chapter

Public Goods Game Simulator with Reinforcement Learning Agents

ManChon U, Zhen Li

2010 Ninth International Conference on Machine Learning and Applications > 43 - 49

2010 Ninth International Conference on Machine Learning and Applications (ICMLA 2010)

As a famous game in the domain of game theory, both pervasive empirical studies as well as intensive theoretical analysis have been conducted and performed worldwide to research different public goods game scenarios. At the same time, computer game simulators are utilized widely for better research of game theory by providing easy but powerful visualization and statistics functionalities. However,...

chapter

Hierarchical learning approach for one-shot action imitation in humanoid robots

Yan Wu, Yiannis Demiris

2010 11th International Conference on Control Automation Robotics&Vision > 453 - 458

2010 11th International Conference on Control Automation Robotics & Vision (ICARCV 2010)

We consider the issue of segmenting an action in the learning phase into a logical set of smaller primitives in order to construct a generative model for imitation learning using a hierarchical approach. Our proposed framework, addressing the “how-to” question in imitation, is based on a one-shot imitation learning algorithm. It incorporates segmentation of a demonstrated template into a series of...

chapter

Towards a bounded-rationality model of multi-agent social learning in games

M Hemmati, N Sadati, M Nili

2010 10th International Conference on Intelligent Systems Design and Applications > 142 - 148

10th International Conference on Intelligent Systems Design and Applications (ISDA 2010)

This paper deals with the problem of multi-agent learning of a population of players, engaged in a repeated normal-form game. Assuming boundedly-rational agents, we propose a model of social learning based on trial and error, called “social reinforcement learning”. This extension of well-known Q-learning algorithm, allows players within a population to communicate and share their experiences with...

chapter

A study on personality identification using game based theory

C Y Yaakub, N Sulaiman, C W Kim

2010 2nd International Conference on Computer Technology and Development > 732 - 734

2nd International Conference on Computer Technology and Development (ICCTD 2010)

Game Based Personality Profiling Application had been develop as an application to solve the problems faced by a counselor in capturing and determine personality. There are many types of personality model and theory such as Jung's Sixteen personality, Myers Briggs Types Indicators (MBTI), 5 Big factors, and Kathrine Benziger's Personality. This application uses MBTI based concept as a guideline to...

chapter

Strategy and Fairness in Repeated Two-agent Interaction

Jianye Hao, Ho-fung Leung

2010 22nd IEEE International Conference on Tools with Artificial Intelligence > 2 > 3 - 6

2010 22nd International Conference on Tools with Artificial Intelligence (ICTAI 2010)

The criterion of fairness has not been given much attention in the research of multi-agent learning problem. We propose an adaptive strategy for agents to achieve fairness in repeated two-agent game with conflicting interests. In our strategy, each agent is equipped with inequity-averse based fairness model, and makes its decision according to its attractiveness for each action. Besides, each agent...

chapter

Imitation learning for task allocation

Felix Duvallet, Anthony Stentz

2010 IEEE/RSJ International Conference on Intelligent Robots and Systems > 3568 - 3573

2010 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2010)

At the heart of multi-robot task allocation lies the ability to compare multiple options in order to select the best. In some domains this utility evaluation is not straightforward, for example due to complex and unmodeled underlying dynamics or an adversary in the environment. Explicitly modeling these extrinsic influences well enough so that they can be accounted for in utility computation (and...

chapter

Evolving diverse Ms. Pac-Man playing agents using genetic programming

A M Alhejali, S M Lucas

2010 UK Workshop on Computational Intelligence (UKCI) > 1 - 6

2010 UK Workshop on Computational Intelligence (UKCI)

This paper uses genetic programming (GP) to evolve a variety of reactive agents for a simulated version of the classic arcade game Ms. Pac-Man. A diverse set of behaviours were evolved using the same GP setup in three different versions of the game. The results show that GP is able to evolve controllers that are well-matched to the game used for evolution and, in some cases, also generalise well to...

chapter

Rule fusion for the imitation of a human tutor

V Scesa, C Raievsky, S Sanchez, H Luga, more

Proceedings of the 2010 IEEE Conference on Computational Intelligence and Games > 154 - 161

2010 IEEE Information Theory Workshop (ITW 2010)

In virtual worlds, character credibility suffers from an increasing discrepancy between visual realism, physical modelling quality and behaviour simulation weakness. As behaviour credibility is firmly embedded in the eye of the human observer, it needs to be as close to human expectation as possible. In this study, we define a learning process able to build rule-based behaviour from the observation...

chapter

Learning the track and planning ahead in a car racing controller

J Quadflieg, M Preuss, O Kramer, Günter Rudolph

Proceedings of the 2010 IEEE Conference on Computational Intelligence and Games > 395 - 402

2010 IEEE Information Theory Workshop (ITW 2010)

We propose a robust approach for learning car racing track models from sensory data for the car racing simulator TORCS. Our track recognition system is based on the combination of an advanced preprocessing step of the sensory data and a simple classifier that delivers six types of track shapes similar to the ones a human would recognize. Out of these, establishing a complete track model is straightforward...

chapter

Learning Personal Agents with Adaptive Player Modeling in Virtual Worlds

Yilin Kang, Ah-Hwee Tan

2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology > 2 > 173 - 180

2010 IEEE/ACM International Conference on Web Intelligence-Intelligent Agent Technology (WI-IAT)

There has been growing interest in creating intelligent agents in virtual worlds that do not follow fixed scripts predefined by the developers, but react accordingly based on actions performed by human players during their interaction. In order to achieve this objective, previous approaches have attempted to model the environment and the user's context directly. However, a critical component for enabling...

chapter

Coevolutionary Temporal Difference Learning for small-board Go

Krzysztof Krawiec, Marcin Szubert

IEEE Congress on Evolutionary Computation > 1 - 8

2010 IEEE Congress on Evolutionary Computation

In this paper we apply Coevolutionary Temporal Difference Learning (CTDL), a hybrid of coevolutionary search and reinforcement learning proposed in our former study, to evolve strategies for playing the game of Go on small boards (5×5). CTDL works by interlacing exploration of the search space provided by one-population competitive coevolution and exploitation by means of temporal difference learning...

chapter

Learning of a ball-in-a-cup playing robot

Bojan Nemec, Matej Zorko, Leon Zlajpah

19th International Workshop on Robotics in Alpe-Adria-Danube Region (RAAD 2010) > 297 - 301

2010 IEEE 19th International Workshop on Robotics in Alpe-Adria-Danube Region (RAAD 2010)

In the paper we evaluate two learning methods applied to the ball-in-a-cup game. The first approach is based on imitation learning. The captured trajectory was encoded with Dynamic motion primitives (DMP). The DMP approach allows simple adaptation of the demonstrated trajectory to the robot dynamics. In the second approach, we use reinforcement learning, which allows learning without any previous...

chapter

Learning shape-proportion relationships from labeled humanoid cartoons

M T Islam, Yong Peng Why, G Ashraf

6th International Conference on Digital Content, Multimedia Technology and its Applications > 416 - 420

2010 6th International Conference on Digital Content, Multimedia Technology and its Applications (IDC 2010)

Character design artists typically use shape, pose and proportion as the first design layer to express role, physicality and personality traits. Inspired by this we approach the problem of automatic character synthesis by attempting to learn relations among the body-shape, proportions, pose, and trait labels from finished art. In our prior work, we have designed an online game framework to collect...

chapter

Could feedback-based self-learning help solve networked Prisoner's Dilemma?

Xiaojie Chen, Feng Fu, Long Wang

Proceedings of the 48h IEEE Conference on Decision and Control (CDC) held jointly with 2009 28th Chinese Control Conference > 1526 - 1531

2009 Joint 48th IEEE Conference on Decision and Control (CDC) and 28th Chinese Control Conference (CCC 2009)

We present a self-learning evolutionary Prisoner's Dilemma game model to study the evolution of cooperation in network-structured populations. During the evolutionary process, each agent updates its current strategy with a probability depending on the difference feedback between its actual score and score aspiration. Each agent's score is a weighed mean of its payoff coming from its neighbors (social...

chapter

The adaptive learning mechanism design for game agents' real-time behavior control

Yingying She, P. Grogono

2009 IEEE International Conference on Intelligent Computing and Intelligent Systems > 1 > 792 - 796

2009 IEEE International Conference on Intelligent Computing and Intelligent Systems (ICIS 2009)

In this paper, we present an approach of adaptive learning mechanism for game agents' real-time behavior control. This approach mainly focuses on how to generate game agent's adaptability in real-time. It is possible to apply our approach in complicated game character interactions by following the framework discussed in this paper. We consider the layered architecture, the behavior pattern and the...

chapter

Identification and Characteristic Descriptions of Procedural Chunks

J. Krivec, M. Guid, I. Bratko

2009 Computation World: Future Computing, Service Computation, Cognitive, Adaptive, Content, Patterns > 448 - 453

2009 Computation World: Future Computing, Service Computation, Cognitive, Adaptive, Content, Patterns (ComputationWorld 2009)

When dealing with cognitive architecture and behavior, chunks are one of the most well known and accepted constructs. Despite that, the nature of chunks still remains very elusive, especially with understanding chunks in procedural knowledge. Our attempt is to show the existence of chunks in procedural knowledge, define them, and describe their characteristics. With this purpose in mind, we use data...

chapter

Backpropagation without human supervision for visual control in Quake II

M. Parker, B.D. Bryant

2009 IEEE Symposium on Computational Intelligence and Games > 287 - 293

2009 IEEE Symposium on Computational Intelligence and Games (CIG)

Backpropagation and neuroevolution are used in a Lamarckian evolution process to train a neural network visual controller for agents in the Quake II environment. In previous work, we hand-coded a non-visual controller for supervising in backpropagation, but hand-coding can only be done for problems with known solutions. In this research the problem for the agent is to attack a moving enemy in a visually...

chapter

General game-playing systems

Y. Bjornsson

2009 IEEE Symposium on Computational Intelligence and Games > 1

2009 IEEE Symposium on Computational Intelligence and Games (CIG)

The aim of General Game Playing (GGP) is to create intelligent agents that can automatically learn how to play a wide variety of different games at an expert level without any human intervention. This requires that the agents be capable of learning diverse game-playing strategies from basic game rules without any game-specific knowledge being provided by their developers. A successful realization...

chapter

Learning to play Tic-tac-toe

D.H. Widyantoro, Y.G. Vembrina

2009 International Conference on Electrical Engineering and Informatics > 1 > 276 - 280

2009 International Conference on Electrical Engineering and Informatics (ICEEI)

This paper reports our experiment on applying Q Learning algorithm for learning to play Tic-tac-toe. The original algorithm is modified by updating the Q value only when the game terminates, propagating the update process from the final move backward to the first move, and incorporating a new update rule. We evaluate the agent performance using full-board and partial-board representations. In this...

Keywords:
LEARNING (ARTIFICIAL INTELLIGENCE)
HUMANS

Publication date

Set your own date range

INFONA - science communication portal

Search results

From Serve-on-Demand to Serve-on-Need: A Game Theoretic Approach

Public Goods Game Simulator with Reinforcement Learning Agents

Hierarchical learning approach for one-shot action imitation in humanoid robots

Towards a bounded-rationality model of multi-agent social learning in games

A study on personality identification using game based theory

Strategy and Fairness in Repeated Two-agent Interaction

Imitation learning for task allocation

Evolving diverse Ms. Pac-Man playing agents using genetic programming

Rule fusion for the imitation of a human tutor

Learning the track and planning ahead in a car racing controller

Learning Personal Agents with Adaptive Player Modeling in Virtual Worlds

Coevolutionary Temporal Difference Learning for small-board Go

Learning of a ball-in-a-cup playing robot

Learning shape-proportion relationships from labeled humanoid cartoons

Could feedback-based self-learning help solve networked Prisoner's Dilemma?

The adaptive learning mechanism design for game agents' real-time behavior control

Identification and Characteristic Descriptions of Procedural Chunks

Backpropagation without human supervision for visual control in Quake II

General game-playing systems

Learning to play Tic-tac-toe

Filter options

Publication date

Content availability

Publication type

Keywords

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options