Search results

Items from 1 to 9 out of 9 results

chapter

Reinforcement learning based throttle and brake control for autonomous vehicle following

Qi Zhu, Zhenhua Huang, Zhenping Sun, Daxue Liu, more

2017 Chinese Automation Congress (CAC) > 6657 - 6662

2017 Chinese Automation Congress (CAC)

In this paper, we focus on the basic form of autonomous follow driving problem with one leader and one follower. A reinforcement learning based throttle and brake control approach is developed for the follower vehicle. Near optimal control law is directly learned by “trial and error” with the neural dynamic programming algorithm. According to the timely updated following state, the learned control...

chapter

Routing in dynamically changing node location scenarios: A reinforcement learning approach

Sudhir K. Routray, Sharmila K. P.

2017 Third International Conference on Advances in Electrical, Electronics, Information, Communication and Bio-Informatics (AEEICB) > 458 - 462

2017 Third International Conference on Advances in Electrical, Electronics, Information, Communication and Bio-Informatics (AEEICB)

Routing in dynamically changing node location scenarios is quite challenging and time consuming. The emerging wireless communication networks such as LTE advanced and 5G, device-to-device communications present such dynamically changing node locations. In mobile ad hoc networks, very often we come across such dynamically changing node location scenarios. In the Internet of things (IoTs), we will come...

chapter

Intelligent Model Learning Based on Variance for Bayesian Reinforcement Learning

Shuhua You, Quan Liu, Zongzhang Zhang, Hui Wang, more

2015 IEEE 27th International Conference on Tools with Artificial Intelligence (ICTAI) > 170 - 177

2015 IEEE 27th International Conference on Tools with Artificial Intelligence (ICTAI)

We consider a modular method to reinforcement learning that represents uncertainty of model parameters by maintaining probability distributions over them. The algorithm we call MBDP (model-based Bayesian dynamic programming) can be decomposed into two parallel types of inference: model learning and policy learning. During learning a model, we update posterior distributions of a model over observations...

chapter

Using Reinforcement Learning to the Priority-Based Routing and Call Admission Control in WDM Networks

Ching-Lung Chang, Siao-Ji Kang

2010 Fifth International Multi-conference on Computing in the Global Information Technology > 126 - 130

2010 5th International Multi-Conference on Computing in the Global Information Technology (ICCGI 2010)

Using reinforcement learning (RL), this paper deals with the problem of call admission control (CAC) and routing in differentiating the services of Wavelength Division Multiplexing (WDM) networks to obtain maximized system revenue. The problem is formulated as a finite-state discrete-time dynamic programming problem. Here we adopt the RL method together with a decomposition approach, to solve this...

chapter

A comparative study of urban traffic signal control with reinforcement learning and Adaptive Dynamic Programming

Yujie Dai, Dongbin Zhao, Jianqiang Yi

The 2010 International Joint Conference on Neural Networks (IJCNN) > 1 - 7

2010 International Joint Conference on Neural Networks (IJCNN 2010)

This paper proposes a new algorithm that employs Adaptive Dynamic Programming(ADP) to solve the distributed control problem of urban traffic with an infinite horizon. Urban traffic congestions lead to a lot of time consumption and exhaust emissions. So alleviating congested situation will have a good impact on both economy and environment. The signal control at urban intersections is an effective...

chapter

Tree Exploration for Bayesian RL Exploration

C. Dimitrakakis

2008 International Conference on Computational Intelligence for Modelling Control&Automation > 1029 - 1034

2008 International Conference on Computational Intelligence for Modelling Control & Automation (CIMCA 2008)

Research in reinforcement learning has produced algorithms for optimal decision making under uncertainty that fall within two main types. The first employs a Bayesian framework, where optimality improves with increased computational time. This is because the resulting planning task takes the form of a dynamic programming problem on a be-lief tree with an infinite number of states. The second type...

chapter

Influence Graph based Task Decomposition and State Abstraction in Reinforcement Learning

Lasheng Yu, Fei Hong, PengRen Wang, Yang Xu, more

2008 The 9th International Conference for Young Computer Scientists > 136 - 141

2008 9th International Conference for Young Computer Scientists

Task decomposition and state abstraction are crucial parts in reinforcement learning. It allows an agent to ignore aspects of its current states that are irrelevant to its current decision, and therefore speeds up dynamic programming and learning. This paper presents the SVI algorithm that uses a dynamic Bayesian network model to construct an influence graph that indicates relationships between state...

chapter

An autonomous search utility for pervasive storage virtualization

Lei Liu

2008 IEEE International Conference on System of Systems Engineering > 1 - 10

2008 IEEE International Conference on System of Systems Engineering (SoSE)

Storage virtualization provides abstraction to pervasive storage graphs. Data management operations require pervasive search utilities to discovery entities and services. In search methods, algorithmic complexity, such as memory bound problems, error convergence issues and supervised training is prohibitive for large state and solution spaces or high dimensional state spaces. In addition, among popular...

chapter

Development of reinforcement learning methods in control and decision making in the large scale dynamic game environments

S. Orafa, M.J. Yazdanpanah, C. Lucas, A. Rahimikian, more

2006 IEEE Conference on Computer Aided Control System Design, 2006 IEEE International Conference on Control Applications, 2006 IEEE International Symposium on Intelligent Control > 850 - 855

2006 IEEE Conference on Computer Aided Control System Design, 2006 IEEE International Conference on Control Applications, 2006 IEEE International Symposium on Intelligent Control

In this paper, an analytical comparison is done between dynamic programming and reinforcement learning methods in dynamic two-player games. The emphasis is on the large number of states and actions available for each player and different conflictive optimization objectives of these games that make them complicated in modeling and analysis. Optimization and decision making is done through quantifying...

Filter options

Keywords:
HEURISTIC ALGORITHMS
LEARNING (ARTIFICIAL INTELLIGENCE)
DYNAMIC PROGRAMMING
REINFORCEMENT LEARNING

Publication date

Set your own date range

Keywords

BAYES METHODS (2)
COMPUTATIONAL MODELING (2)
DATA MINING (2)
DECISION MAKING (2)
EQUATIONS (2)
LEARNING (2)
MATHEMATICAL MODEL (2)
ROUTING (2)
UNCERTAINTY (2)
ADAPTIVE DYNAMIC PROGRAMMING (1)
AEROSPACE ELECTRONICS (1)
ARTIFICIAL INTELLIGENT (1)
AUTONOMOUS DRIVING (1)
AUTONOMOUS FOLLOWING (1)
AUTONOMOUS SEARCH UTILITY (1)
AUTONOMOUS VEHICLES (1)
BAYESIAN (1)
BAYESIAN BELIEF TREE (1)
BAYESIAN DYNAMIC PROGRAMMING (1)
BAYESIAN METHODS (1)
BAYESIAN RL EXPLORATION (1)
BELIEF NETWORKS (1)
BRAKES (1)
CALL ADMISSION CONTROL (1)
COMMUNICATION NETWORKS (1)
COMPLEXITY THEORY (1)
COMPUTATIONAL TIME (1)
CONFLICTIVE OPTIMIZATION OBJECTIVES (1)
CONTROL OPTIMIZATION (1)
CONVERGENCE ANALYSIS (1)
DATA MANAGEMENT (1)
DATA PARALLELIZATION (1)
DECISION THEORY (1)
DECOMPOSITION APPROACH (1)
DELTA MODULATION (1)
DIFFERENTIATED SERVICES (1)
DIFFSERV NETWORKS (1)
DIRICHLET DISTRIBUTIONS (1)
DISCRETE TIME SYSTEMS (1)
DISTRIBUTION-FREE ALGORITHM (1)
DYNAMIC BAYESIAN NETWORK (1)
DYNAMIC BAYESIAN NETWORK MODEL (1)
DYNAMIC PROGRAMMING PROBLEM (1)
DYNAMIC SEARCH FUNCTION (1)
DYNAMIC TWO-PLAYER GAMES (1)
DYNAMICALLY CHANGING NODE LOCATIONS (1)
ERROR ESTIMATION (1)
EXACT STATE-ACTION VALUE FUNCTION MAPPING (1)
EXPLORATION (1)
FINITE-STATE DISCRETE-TIME DYNAMIC PROGRAMMING PROBLEM (1)
FORCE (1)
GAME THEORY (1)
GAMES (1)
GENERAL SEARCH SOLUTION METHODS (1)
GRAPH THEORY (1)
GREEN PRODUCTS (1)
INDEXES (1)
INFLUENCE GRAPH (1)
LARGE SCALE DYNAMIC GAME ENVIRONMENTS (1)
MACHINE LEARNING ALGORITHMS (1)
MACHINE LEARNING METHOD (1)
MARKOV PROCESSES (1)
MICROSCOPIC TRAFFIC SIMULATION SOFTWARE (1)
MODEL LEARNING (1)
MODEL-BUILDING ALGORITHMS (1)
MONTE CARLO METHODS (1)
MULTIARMED BANDIT PROBLEM (1)
NEURAL DYNAMIC PROGRAMMING (1)
OPTICAL FIBER NETWORKS (1)
OPTICAL NETWORKS (1)
OPTIMAL DECISION MAKING (1)
OPTIMAL ROUTING (1)
OPTIMAL VALUE FUNCTION (1)
OPTIMIZATION (1)
PARTIALLY-OBSERVABLE MARKOV DECISION PROCESS (1)
PEDIATRICS (1)
PERVASIVE STORAGE VIRTUALIZATION (1)
POLICY LEARNING (1)
POMDP (1)
POSTAL SERVICES (1)
PRIORITY-BASED ROUTING (1)
PROBABILITY DENSITY FUNCTION (1)
PROBABILITY DISTRIBUTION (1)
Q LEARNING (1)
Q-LEARNING ALGORITHM (1)
Q-LEARNING CONTROL (1)
R-LEARNING (1)
ROBOTS (1)
SEARCH PROBLEMS (1)
SELF-ORGANIZED INDEX STRUCTURE (1)
SILICON (1)
SOFTWARE (1)
SOFTWARE ALGORITHMS (1)
SPACE COMPLEXITY (1)
STATE ABSTRACTION (1)
STATE VARIABLE INFLUENCE ALGORITHM (1)
more

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options