Search results

Items from 1 to 5 out of 5 results

chapter

Effective lazy training method for deep q-network in obstacle avoidance and path planning

Juan Wu, Seabyuk Shin, Cheong-Gil Kim, Shin-Dug Kim

2017 IEEE International Conference on Systems, Man, and Cybernetics (SMC) > 1799 - 1804

2017 IEEE International Conference on Systems, Man and Cybernetics (SMC)

Deep reinforcement learning technique combines reinforcement learning and neural network for various applications. This paper is to propose an effective lazy training method for deep reinforcement learning, especially for deep Q-network combining neural network with Q-learning to be used for the obstacle avoidance and path planning applications. The proposed method can reduce the overall training...

chapter

Deep reinforcement learning for SPORADIC rewards with HUMAN experience

Harshit Sinha

2017 Second International Conference on Electrical, Computer and Communication Technologies (ICECCT) > 1 - 4

2017 Second International Conference on Electrical, Computer and Communication Technologies (ICECCT)

This paper presents the ideal approach to how to minimize the time taken by reinforcement learning to train the model. Similar to Computer vision the progress in reinforcement learning is not influenced by new ideas but mostly by the computation, large data, infrastructure and efficiency of algorithm. These 4 things only influenced the reinforcement learning RL model. How much time it will take to...

chapter

Network grafting: Transferring learned features from trained neural networks

Yang Li, Xiaoming Tao, Jianhua Lu

2012 IEEE International Conference on Computational Intelligence and Cybernetics (CyberneticsCom) > 40 - 44

2012 IEEE International Conference on Computational Intelligence and Cybernetics (CyberneticsCom)

The degree of abundance of labeled training data is an important factor in determining the performance of supervised machine learning systems. However, in some applications, labeled data are either costly to collect or easily outdated, resulting in poor generalization of trained machine learners. Nonetheless, there are often related domains where large corpuses of labeled data can be easily obtained...

chapter

Incremental Learning for Multitask Pattern Recognition Problems

S. Ozawa, A. Roy

2008 Seventh International Conference on Machine Learning and Applications > 747 - 751

2008 Seventh International Conference on Machine Learning and Applications

This paper presents a learning model of multitask pattern recognition (MTPR) which is constructed by several neural classifiers, long-term memories, and the detector of task changes. In the MTPR problem, several multi-class classification tasks are sequentially given to the learning model without notifying their task categories. This implies that the learning model is supposed to detect task changes...

chapter

A novel learning framework of CMAC via grey-area-time credit apportionment and grey learning rate

Po-Lun Chang, Ying-Kuei Yang, Horng-Lin Shieh

2008 International Conference on Machine Learning and Cybernetics > 6 > 3096 - 3101

2008 International Conference on Machine Learning and Cybernetics (ICMLC)

The advantages of CMAC neural network are fast learning convergence, capable of mapping nonlinear functions quickly due to its local generalization of weight updating, simple architecture, easily processing and hardware implementation. In the training phase, the disadvantage of some CMAC models with a larger fixed learning rate is the unstable phenomenon. The smaller learning rate would cause slower...

Filter options

Data set:
ieee
Keywords:
TRAINING
LEARNING SYSTEMS
MACHINE LEARNING
NEURAL NETWORK

Publication date

Set your own date range

Keywords

ACCURACY (2)
LEARNING (ARTIFICIAL INTELLIGENCE) (2)
ARTIFICIAL INTELLIGENCE (1)
AVATARS (1)
BACKPROPAGATION (1)
BARS (1)
BIOLOGICAL NEURAL NETWORKS (1)
CEREBELLAR MODEL ARITHMETIC COMPUTERS (1)
CLASSIFICATION ALGORITHMS (1)
CMAC (1)
COLLISION AVOIDANCE (1)
CONVERGENCE (1)
CREDIT APPORTIONMENT (1)
DEEP Q NETWORK (1)
DEEP REINFORCEMENT LEARNING (1)
EQUATIONS (1)
ERROR ANALYSIS (1)
FEATURE TRANSFER (1)
GAMES (1)
GREY LEARNING RATE (1)
GREY RELATIONAL ANALYSIS (1)
GREY RELATIONAL GRADE (1)
GREY SYSTEMS (1)
GREY-AREA-TIME CREDIT APPORTIONMENT (1)
INCREMENTAL LEARNING (1)
INCREMENTAL LEARNING MODEL (1)
INTERFERENCE (1)
KNOWLEDGE ACQUISITION (1)
LEARNING INTERFERENCE (1)
MACHINE LEARNING ALGORITHMS (1)
MATHEMATICAL MODEL (1)
MULTI-LAYER NEURAL NETWORK (1)
MULTICLASS CLASSIFICATION TASK (1)
MULTITASK LEARNING (1)
MULTITASK PATTERN RECOGNITION (1)
NEURAL CLASSIFIERS (1)
NEURAL NETWORKS (1)
NONLINEAR FUNCTIONS (1)
NONLINEAR FUNCTIONS MAPPING (1)
PATH PLANNING (1)
PATTERN CLASSIFICATION (1)
PATTERN RECOGNITION (1)
TRAINING DATA (1)
TRANSFER LEARNING (1)
VECTORS (1)
more

INFONA - science communication portal

Search results

Effective lazy training method for deep q-network in obstacle avoidance and path planning

Deep reinforcement learning for SPORADIC rewards with HUMAN experience

Network grafting: Transferring learned features from trained neural networks

Incremental Learning for Multitask Pattern Recognition Problems

A novel learning framework of CMAC via grey-area-time credit apportionment and grey learning rate

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options