Search results for: Yu Zheng

Items from 1 to 6 out of 6 results

chapter

Question Classification Based on Incremental Modified Bayes

Li Ying-wei, Yu Zheng-tao, Meng Xiang-yan, Che Wen-gang, more

2008 Second International Conference on Future Generation Communication and Networking > 2 > 149 - 152

2008 Second International Conference on Future Generation Communication and Networking (FGCN)

How to use the incremental training corpus to improve the question classification accuracy rate in the process of question classification based on statistic learning. A question classification method based on the incremental modified Bayes was presented in this paper. The method used the modified Bayes and combined the incremental learning to correct the parameter by the incremental training set stage...

chapter

Active Exploration Planning in Reinforcement Learning for Inverted Pendulum System Control

Yu Zheng, Si-Wei Luo, Zi-Ang Lv

2006 International Conference on Machine Learning and Cybernetics > 2805 - 2809

Proceedings of 2006 International Conference on Machine Learning and Cybernetics

Reinforcement learning method usually require that all actions be tried in all state infinitely often for convergence. Such algorithms are impractical to be applied to sophisticated systems due to its low learning efficiency. This paper analyses the problem of limit cycles exist in reinforcement learning for inverted pendulum system control and proposed active exploration planning policy. The algorithm...

chapter

A New Geometric Approach to the Complexity of Model Selection

Ziang Lv, Siwei Luo, Yunhui Liu, Yu Zheng

2006 5th IEEE International Conference on Cognitive Informatics > 1 > 268 - 273

2006 5th IEEE International Conference on Cognitive Informatics

Model selection is one of the central problems of machine learning. The goal of model selection is to select from a set of competing explanations the best one that capture the underlying regularities of given observations. The criterion of a good model is generalizability. We must make balance between the goodness of fit and the complexity of the model to obtain good generalization. Most of present...

chapter

Control Double Inverted Pendulum by Reinforcement Learning with Double CMAC Network

Yu Zheng, Siwei Luo, Ziang Lv

18th International Conference on Pattern Recognition (ICPR'6) > 4 > 639 - 642

2006 18th International Conference on Pattern Recognition

To accelerate the learning of reinforcement learning, many types of function approximation are used to represent state value. However function approximation reduces the accuracy of state value, and brings difficulty in the convergence. To solve the problems of tradeoff between the generalization and accuracy in reinforcement learning, we represent state-action value by two CMAC networks with different...

chapter

Greedy exploration policy of Q-learning based on state balance

Yu Zheng, Siwei Luo, Jing Zhang

TENCON 2005 - 2005 IEEE Region 10 Conference > 1 - 4

TENCON 2005 - 2005 IEEE Region 10 Conference

Q-learning is one of the successfully established algorithms for the reinforcement learning, which has been widely used to the intelligent control system, such as the control of robot pose. However, curse of dimensionality and difficulty in convergence exist in Q-learning arising from random exploration policy. In this paper, we propose a greedy exploration policy of Q-learning with rule guidance...

chapter

The negative effect on the control of inverted pendulum caused by the limit cycle in reinforcement learning

Yu Zheng, Siwei Luo, Ziang Lv

2005 International Conference on Neural Networks and Brain > 2 > 772 - 775

Proceedings of 2005 International Conference on Neural Networks and Brain

Control inverted pendulum is one of important applied regions of reinforcement learning. This paper analyzes negative effect on the control of inverted pendulum caused by the limit cycle. It points out the limit cycle will make Q-value converge to zero, and destroy the stabilization of the optimal control policy. Moreover higher degree of exploration can not overcome this problem, but rather intensify...

Filter options

Keywords:
LEARNING (ARTIFICIAL INTELLIGENCE)

Publication date

Set your own date range

Keywords

REINFORCEMENT LEARNING (4)
GENERALISATION (ARTIFICIAL INTELLIGENCE) (2)
GENERALIZATION (2)
NONLINEAR CONTROL SYSTEMS (2)
PENDULUMS (2)
ACCURACY (1)
ACTIVE EXPLORATION PLANNING (1)
BAYES (1)
BAYES METHODS (1)
CEREBELLAR MODEL ARITHMETIC COMPUTERS (1)
COMPUTATIONAL COMPLEXITY (1)
DOUBLE CMAC NETWORK (1)
DOUBLE INVERTED PENDULUM CONTROL (1)
EXPLORATION POLICY (1)
FUNCTION APPROXIMATION (1)
GAUSS-KRONECKER CURVATURE (1)
GEOMETRIC APPROACH (1)
GEOMETRICAL COMPLEXITY (1)
GEOMETRY (1)
GREEDY EXPLORATION POLICY (1)
INCREMENTAL LEARNING (1)
INCREMENTAL MODIFIED BAYES (1)
INCREMENTAL TRAINING CORPUS (1)
INFORMATION PROCESSING (1)
INTELLIGENT CONTROL (1)
INTELLIGENT CONTROL SYSTEM (1)
INTRINSIC COMPLEXITY (1)
INVERTED PENDULUM (1)
INVERTED PENDULUM CONTROL (1)
INVERTED PENDULUM SYSTEM CONTROL (1)
LIMIT CYCLE (1)
MACHINE LEARNING (1)
MODEL SELECTION (1)
MODEL SELECTION COMPLEXITY (1)
NEUROCONTROLLERS (1)
NONLINEAR SYSTEMS (1)
NONOPTIMAL ACTION EXPLORATION (1)
OCCAM'S RAZOR (1)
OPTIMAL CONTROL POLICY (1)
PROBABILITY (1)
Q-LEARNING (1)
Q-VALUE CONVERGE (1)
QUESTION CLASSIFICATION (1)
RANDOM EXPLORATION (1)
REINFORCEMENT LEARNING METHOD (1)
RULE GUIDANCE (1)
STABILITY (1)
STATE BALANCE (1)
STATE-ACTION VALUE (1)
STATISTIC LEARNING (1)
STATISTICAL MANIFOLD (1)
SUBOPTIMAL CONTROL (1)
SUBOPTIMAL CONTROL ACTION (1)
SUPPORT VECTOR MACHINES (1)
TESTING (1)
TEXT ANALYSIS (1)
TEXT CATEGORIZATION (1)
TRAINING (1)
YUNNAN TOURISM (1)
more

INFONA - science communication portal

Search results for: Yu Zheng

Question Classification Based on Incremental Modified Bayes

Active Exploration Planning in Reinforcement Learning for Inverted Pendulum System Control

A New Geometric Approach to the Complexity of Model Selection

Control Double Inverted Pendulum by Reinforcement Learning with Double CMAC Network

Greedy exploration policy of Q-learning based on state balance

The negative effect on the control of inverted pendulum caused by the limit cycle in reinforcement learning

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options