Search results

Items from 1 to 9 out of 9 results

chapter

Real-valued Q-learning in multi-agent cooperation

Kao-Shing Hwang, Chia-Yue Lo, Kim-Joan Chen

2009 IEEE International Conference on Systems, Man and Cybernetics > 395 - 400

2009 IEEE International Conference on Systems, Man and Cybernetics. SMC 2009

In this paper, we propose a Q-learning with continuous action policy and extend this algorithm to a multi-agent system. We examine this algorithm in a task that there are two robots taking action independently but connected with a straight bar. The robots must cooperate to move to the goal and avoid the obstacles in the environment. Conventional Q-learning needs a pre-defined and discrete state space...

chapter

Positional consensus in multi-agent systems using a broadcast control mechanism

K. Das, D. Ghose

2009 American Control Conference > 5731 - 5736

2009 American Control Conference (ACC-09)

In this paper a strategy for controlling a group of agents to achieve positional consensus is presented. The proposed technique is based on the constraint that every agents must be given the same control input through a broadcast communication mechanism. Although the control command is computed using state information in a global framework, the control input is implemented by the agents in a local...

chapter

Rapidly convergent leader-enabled multi-agent deployment into planar curves

P. Frihauf, M. Krstic

2009 American Control Conference > 1994 - 1999

2009 American Control Conference (ACC-09)

We introduce an approach for stable deployment of agents into planar curves (1-D formations in 2-D space) parameterized by the agent index. Stability is ensured by leader feedback, which is designed in a manner similar to boundary control of PDEs. By discretizing the model and the PDE controllers with respect to the continuous agent index, we obtain control laws for the discrete follower agents and...

chapter

Formation shape and orientation control using projected collinear tensegrity structures

D. Pais, Ming Cao, N.E. Leonard

2009 American Control Conference > 610 - 615

2009 American Control Conference (ACC-09)

The goal of this work is to stabilize the shape and orientation of formations of N identical and fully actuated agents, each governed by double-integrator dynamics. Using stability and rigidity properties inherent to tensegrity structures, we first design a tensegrity-based, globally exponentially stable control law in one dimension. This stabilizes given inter-agent spacing along the line, thereby...

chapter

Supervised self-organization of large homogeneous Swarms using Ergodic Projections of Markov Chains

I. Chattopadhyay, A. Ray

2009 American Control Conference > 2922 - 2927

2009 American Control Conference (ACC-09)

This paper formulates a self-organization algorithm to addresses the problem of emergent behavior supervision in engineered swarms of arbitrary population size. Based on collections of independent identical finite-state agents, the algorithm is derived to compute necessary perturbations in the local agents' behavior, which guarantees convergence to the desired observed state of the swarm. A simulation...

chapter

Decentralized centroid estimation for multi-agent systems in absence of any common reference frame

M. Franceschelli, A. Gasparri

2009 American Control Conference > 512 - 517

2009 American Control Conference (ACC-09)

In this paper, a novel distributed algorithm to deal with the problem of estimating the network centroid in a multi-agent system is proposed. In this scenario, agents are assumed to be lacking any global reference frame or absolute position information. The proposed algorithm can be thought as a general tool to retrieve information about the centroid of a network of agents. Indeed, this allows to...

chapter

Analysis and Design of an Improved R-learning

Wei Chen, Zhenkun Zhai, Xiong Li, Jing Guo

2009 Eighth IEEE/ACIS International Conference on Computer and Information Science > 48 - 52

2009 8th IEEE/ACIS International Conference on Computer and Information Science (ICIS)

This paper presents a modified R-learning according to the traditional average reward reinforcement learning algorithm. Reinforcement learning problems constitute an important class of learning and control problems faced by artificial intelligence systems. The general framework of reinforcement learning can be divided into two forms, discounted reward reinforcement learning and average reward reinforcement...

chapter

Formation control of multiple marine vehicles based passivity-control design

J. Ghommam, F. Mnif, O. Calvo

2009 6th International Multi-Conference on Systems, Signals and Devices > 1 - 7

2009 6th International Multi-Conference on Systems, Signals and Devices

This paper addresses the problem of coordination path following control of multiple autonomous vehicles. Stated briefly, the problem consists in steering a group of vehicles along a specified paths, while holding a desired inter-ship formation pattern. Path-following for each vehicle amounts to reducing an appropriately defined geometric error to zero. We first show a passivity property for the path...

chapter

Behavior modes for randomized robotic coverage

J. Beal, N. Correll, L. Urbina, J. Bachrach

2009 Second International Conference on Robot Communication and Coordination > 1 - 6

2009 Second International Conference on Robot Communication and Coordination. RoboComm 2009

A basic primitive in a networked robotic swarm is to form a connected component that covers some area with relatively uniform density. Although most approaches to the problem require local coordinate information, it has been proposed that robots with only connectivity information do this instead with a generalized form of diffusion-limited aggregation, in which robots wander randomly until they find...

Filter options

Keywords:
CONVERGENCE
PROBABILITY DENSITY FUNCTION
MULTI-ROBOT SYSTEMS

Publication date

Set your own date range

Keywords

DISTANCE MEASUREMENT (4)
MOBILE ROBOTS (4)
CONTROL SYSTEM SYNTHESIS (3)
ROBOTS (3)
GAUSSIAN DISTRIBUTION (2)
LEARNING (ARTIFICIAL INTELLIGENCE) (2)
MARKOV PROCESSES (2)
MATHEMATICAL MODEL (2)
POSITION CONTROL (2)
STABILITY (2)
VEHICLE DYNAMICS (2)
ACTUATED HOLONOMIC MOBILE AGENT (1)
ACTUATORS (1)
ALGORITHM DESIGN AND ANALYSIS (1)
ARBITRARY UNDIRECTED CONNECTED GRAPH (1)
ARTIFICIAL INTELLIGENCE SYSTEM (1)
ASYMPTOTIC STABILITY (1)
BOUNDARY CONTROL (1)
BROADCAST COMMUNICATION MECHANISM (1)
BROADCAST CONTROL MECHANISM (1)
CLOSED LOOP SYSTEMS (1)
CLOSED-LOOP SYSTEM (1)
COLLISION AVOIDANCE (1)
COMMON REFERENCE FRAME (1)
CONTINUOUS AGENT INDEX (1)
CONVERGENCE PROPERTY (1)
COOPERATIVE CONTROL (1)
COORDINATION PATH FOLLOWING CONTROL (1)
DECENTRALIZED MOTION COORDINATION (1)
DECENTRALIZED NETWORK CENTROID ESTIMATION (1)
DECISION THEORY (1)
DIFFUSION-LIMITED AGGREGATION (1)
DISCOUNTED REWARD REINFORCEMENT LEARNING (1)
DISCRETE EVENT SYSTEMS (1)
DISCRETE FOLLOWER AGENT (1)
DISCRETE LEADER AGENT (1)
DISCRETE SYSTEMS (1)
DISPERSION (1)
DISTRIBUTED ALGORITHM (1)
DISTRIBUTED CONTROL (1)
DISTRIBUTED CONTROL LAW DESIGN (1)
DOUBLE-INTEGRATOR DYNAMICS (1)
EIGENVALUES AND EIGENFUNCTIONS (1)
EQUATIONS (1)
ERGODIC PROJECTIONS (1)
ESTIMATION (1)
EXPLORATION STRATEGY (1)
FEEDBACK (1)
FINITE STATE ERGODIC MARKOV CHAINS (1)
FINITE STATE MACHINES (1)
FORMATION CONTROL (1)
FORMATION SHAPE CONTROL (1)
GLOBAL EXPONENTIAL STABILITY CONTROL LAW DESIGN (1)
GLOBAL STABILITY (1)
GRAPH THEORY (1)
HEATING (1)
INDEPENDENT IDENTICAL FINITE-STATE AGENTS (1)
KINEMATICS (1)
LARGE HOMOGENEOUS SWARMS (1)
LEAD (1)
LEADER FEEDBACK (1)
LEARNING (1)
LINEAR MATRIX INEQUALITIES (1)
LINEAR PROGRAMMING (1)
LINEAR PROGRAMMING FORMULATION (1)
LOCAL COORDINATE FRAME (1)
LOGIC GATES (1)
MARINE VEHICLE (1)
MARINE VEHICLES (1)
MARKOV CHAINS (1)
MARKOV DECISION PROCESS (1)
MODEL-FREE AVERAGE REWARD REINFORCEMENT LEARNING ALGORITHM (1)
MOTION CONTROL (1)
MULTI AGENT SYSTEM (1)
MULTI-AGENT COOPERATION (1)
MULTI-AGENT SYSTEMS (1)
MULTIPLE AUTONOMOUS MARINE VEHICLE (1)
MULTIVARIABLE SYSTEMS (1)
N-STEP COLLISION-FREE ALGORITHM (1)
NETWORK TOPOLOGY (1)
NETWORKED ROBOTIC SWARM (1)
ONE-DIMENSIONAL CONTROL LAW (1)
OPTIMAL ACTION SELECTION POLICY (1)
ORIENTATION CONTROL (1)
PARTIAL DIFFERENTIAL EQUATIONS (1)
PASSIVITY THEORY (1)
PASSIVITY-CONTROL DESIGN (1)
PATH FOLLOWING (1)
PATH PLANNING (1)
PDE (1)
PLANAR CURVES (1)
POSITIONAL CONSENSUS (1)
PROBABILISTIC LOGIC (1)
PROBABILITY DISTRIBUTION (1)
PROJECTED COLLINEAR TENSEGRITY STRUCTURE (1)
PUNITIVE MECHANISM (1)
more

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options