Search results

Items from 1 to 9 out of 9 results

chapter

Practical aspects of the model-free learning control initialization

Krzysztof Stebel

2015 20th International Conference on Methods and Models in Automation and Robotics (MMAR) > 453 - 458

2015 20th International Conference on Methods and Models in Automation and Robotics (MMAR )

The paper presents aspects of model-free learning control initialization. Model-free learning has several advantages as general purpose approach or adaptive capability. However, practical implementation is not intuitive in each step. Choice of time scale in order to provide the necessary reactivity or level of granularity of states or control actions is not an easy problem. Before it learns environment...

chapter

Dynamic enhanced Inter-Cell Interference Coordination using reinforcement learning approach in Heterogeneous Network

Qi Li, Hailun Xia, Zhimin Zeng, Tiankui Zhang

2013 15th IEEE International Conference on Communication Technology > 239 - 243

2013 15th IEEE International Conference on Communication Technology (ICCT)

This paper investigates enhanced Inter-Cell Interference Coordination (eICIC) techniques for Heterogeneous Networks (HetNets), and models this strategic coexistence as a multi-player system in which interference management strategies inspired from a form of reinforcement learning known as distributed Q-learning are devised. Specifically, this paper focuses on time domain eICIC techniques in which...

chapter

Autonomous Navigation in Dynamic Environments with Reinforcement Learning and Heuristic

E D S Costa, Maury M Gouvea

2010 Ninth International Conference on Machine Learning and Applications > 37 - 42

2010 Ninth International Conference on Machine Learning and Applications (ICMLA 2010)

Researchers have created machines which operate autonomously in complex and changing environments. An important problem that has been widely studied is that of autonomous navigation systems, through which attempts have been made to create mechanisms with their own decision making in complex environments. Ideally, an autonomous navigation agent must have an ability to learn while working in its environment...

chapter

Building the knowledge base of a buyer agent using reinforcement learning techniques

George Boulougaris, Kostas Kolomvatsos, Stathes Hadjiefthymiades

The 2010 International Joint Conference on Neural Networks (IJCNN) > 1 - 8

2010 International Joint Conference on Neural Networks (IJCNN 2010)

Electronic markets are places where entities not known in advance can negotiate and agree upon the exchange of products. Intelligent agents can be proved very advantageous when representing entities in markets. Mostly, such entities are based on reputation models in order to conclude a transaction. However, reputation is not the only parameter that they could be based on. In this work, we deal with...

chapter

The improvement of Q-learning applied to imperfect information game

Jing Lin, Xuan Wang, Lijiao Han, Jiajia Zhang, more

2009 IEEE International Conference on Systems, Man and Cybernetics > 1562 - 1567

2009 IEEE International Conference on Systems, Man and Cybernetics. SMC 2009

There exist problems of slow convergence and local optimum in standard Q-learning algorithm. Truncated TD estimate returns efficiency and simulated annealing algorithm increase the chance of exploration. To accelerate the algorithm convergence speed and to avoid results in local optimum, this paper combines Q-learning algorithm, truncated TD estimation and simulated annealing algorithm. We apply improved...

chapter

Reinforcement learning based Dynamic Network Self-optimization for heterogeneous networks

Zhiyong Feng, Li Tan, Wei Li, T.A. Gulliver

2009 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing > 319 - 324

2009 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing (PacRim)

The coexistence of different heterogeneous Radio Access Technologies (RATs) is a significant feature of current wireless networks. Thus, it is important for network elements, such as the Base Stations (BSs) of cellular networks or access points (APs) of wireless local area networks (WLANs) to be reconfigurable according to the real-time network environment. This will enable interconnection between...

chapter

A reinforcement learning approach to dynamic optimization of load allocation in AGC system

Y.M. Wang, Q.J. Liu, T. Yu

2009 IEEE Power&Energy Society General Meeting > 1 - 6

2009 IEEE Power & Energy Society General Meeting (PES)

A Reinforcement Learning (RL) method applied to the dynamic load allocation in AGC system is presented. The problem can be modeled as a Markov Decision Process (MDP). The Q-learning algorithm as a model-free learning algorithm is introduced. It learns an optimal action strategy by experience from exploring an unknown system and getting rewards. Rewards are chosen to express how well actions control...

chapter

Real Time Demand Learning-Based Q-learning Approach for Dynamic Pricing in E-retailing Setting

Yan Cheng

2009 International Symposium on Information Engineering and Electronic Commerce > 594 - 598

2009 International Symposium on Information Engineering and Electronic Commerce (IEEC)

Information technology has given e-retailers new capability of learning demand in real time. This paper investigates how to integrate this real time learning technology with Q-learning algorithm for the optimization of dynamic pricing in e-retailing setting. Especially, this paper studies the optimal dynamic pricing problem for seasonal and style products in e-retailing setting, and validate our approach...

chapter

Development of reinforcement learning methods in control and decision making in the large scale dynamic game environments

S. Orafa, M.J. Yazdanpanah, C. Lucas, A. Rahimikian, more

2006 IEEE Conference on Computer Aided Control System Design, 2006 IEEE International Conference on Control Applications, 2006 IEEE International Symposium on Intelligent Control > 850 - 855

2006 IEEE Conference on Computer Aided Control System Design, 2006 IEEE International Conference on Control Applications, 2006 IEEE International Symposium on Intelligent Control

In this paper, an analytical comparison is done between dynamic programming and reinforcement learning methods in dynamic two-player games. The emphasis is on the large number of states and actions available for each player and different conflictive optimization objectives of these games that make them complicated in modeling and analysis. Optimization and decision making is done through quantifying...

Filter options

Keywords:
HEURISTIC ALGORITHMS
LEARNING (ARTIFICIAL INTELLIGENCE)
Q-LEARNING ALGORITHM

Publication date

Set your own date range

Keywords

REINFORCEMENT LEARNING (5)
DATA MINING (3)
MATHEMATICAL MODEL (3)
Q-LEARNING (3)
ALGORITHM DESIGN AND ANALYSIS (2)
APPROXIMATION ALGORITHMS (2)
ELECTRONIC COMMERCE (2)
EQUATIONS (2)
GAME THEORY (2)
GAMES (2)
INTELLIGENT AGENT (2)
LEARNING (2)
MARKOV PROCESSES (2)
SOFTWARE AGENTS (2)
3G MOBILE COMMUNICATION (1)
ADAPTATION MODEL (1)
AGC SYSTEM (1)
ALGORITHM CONVERGENCE SPEED (1)
ALMOST BLANK SUBFRAME (ABS) CONFIGURATION (1)
AUTOMATIC GENERATION CONTROL (1)
AUTONOMOUS NAVIGATION (1)
AUTONOMOUS NAVIGATION AGENT (1)
AUTONOMOUS NAVIGATION SYSTEM (1)
BIOLOGICAL SYSTEM MODELING (1)
BUYER AGENT (1)
CHINA SOUTHERN POWER GRID MODEL (1)
CONFLICTIVE OPTIMIZATION OBJECTIVES (1)
CONSUMER ELECTRONICS (1)
CONTROL ENGINEERING COMPUTING (1)
CONTROL SYSTEMS (1)
CONVERGENCE (1)
COST ACCOUNTING (1)
CPS (1)
DECISION MAKING (1)
DECISION THEORY (1)
DEMAND LEARNING (1)
DISTRIBUTED ALGORITHMS (1)
DISTRIBUTED RECONFIGURATION ALGORITHM (1)
DYNAMIC ENVIRONMENT (1)
DYNAMIC ENVIRONMENTS (1)
DYNAMIC LOAD ALLOCATION (1)
DYNAMIC LOAD ALLOCATION OPTIMIZATION (1)
DYNAMIC NETWORK SELFOPTIMIZATION ALGORITHM (1)
DYNAMIC PROGRAMMING (1)
DYNAMIC Q-TABLE CREATION (1)
DYNAMIC TWO-PLAYER GAMES (1)
E-COMMERCE (1)
E-RETAILING (1)
ELECTRONIC MARKET (1)
ENHANCED INTER-CELL INTERFERENCE COORDINATION (EICIC) (1)
ESTIMATION (1)
ESTIMATION THEORY (1)
FORCE (1)
GREEDY HEURISTIC (1)
HETEROGENEOUS NETWORK (1)
HETEROGENEOUS NETWORKS (HETNET) (1)
HUMAN INTELLIGENCE (1)
IMPERFECT INFORMATION GAME (1)
INDEXES (1)
INTELLIGENT CONTROL (1)
INTERFERENCE (1)
KNOWLEDGE ACQUISITION (1)
KNOWLEDGE BASE (1)
LARGE SCALE DYNAMIC GAME ENVIRONMENTS (1)
LEARNING CONTROL (1)
LOAD MANAGEMENT (1)
LOAD MODELING (1)
MACROCELL NETWORKS (1)
MARKETING AND SALES (1)
MARKOV DECISION PROCESS (1)
MDP (1)
MILITARY COMPUTING (1)
MOBILE ROBOTS (1)
MODEL-FREE CONTROL (1)
MODEL-FREE LEARNING ALGORITHM (1)
MULTI-AGENT REINFORCEMENT LEARNING PROBLEM (1)
MULTI-AGENT SYSTEMS (1)
NAVIGATION (1)
ONLINE LEARNING (1)
OPTIMAL DYNAMIC PRICING PROBLEM (1)
OPTIMISATION (1)
OPTIMIZATION (1)
PATH PLANNING (1)
POWER GENERATION CONTROL (1)
POWER GRIDS (1)
POWER SYSTEM DYNAMICS (1)
POWER SYSTEM SIMULATION (1)
PRICING (1)
PROCESS CONTROL (1)
PRODUCT EXCHANGE (1)
PRODUCT NEGOTIATION (1)
QUALITY OF SERVICE (1)
RADIO ACCESS NETWORK (1)
RADIO ACCESS NETWORKS (1)
RADIO ACCESS TECHNOLOGY (1)
RATS (1)
REAL TIME DEMAND LEARNING TECHNOLOGY (1)
more

INFONA - science communication portal

Search results

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options