Wyniki wyszukiwania

rozdział

Continuous Action-Space Reinforcement Learning Methods Applied to the Minimum-Time Swing-Up of the Acrobot

Barry D. Nichols

2015 IEEE International Conference on Systems, Man, and Cybernetics > 2084 - 2089

2015 IEEE International Conference on Systems, Man, and Cybernetics (SMC)

Here I apply three reinforcement learning methods to the full, continuous action, swing-up acrobot control benchmark problem. These include two approaches from the literature: CACLA and NM-SARSA and a novel approach which I refer to as Nelder Mead-SARSA. Nelder Mead-SARSA, like NMSARSA, directly optimises the state-action value function for action selection, in order to allow continuous action reinforcement...

rozdział

Supervised learning for Neural Network using Ant Colony Optimization

Ravinder Rathee, Anita Dagar, Seema Rani

2014 International Conference on Reliability Optimization and Information Technology (ICROIT) > 331 - 334

2014 International Conference on Optimization, Reliabilty, and Information Technology (ICROIT)

To describe the approach of real-world activities we have proposed an idea of SLNA algorithm and its diagram. In this paper we are using supervised learning to train the network. In supervised learning desire response is provided by the teacher in correspondence to the particular input. To explain the concept of SLNNA algorithm we have used a real-world example of travel agency (make my trip agency)...

rozdział

Detection of multiple cracks in beams using particle swarm optimization and artificial neural network

M A Kazemi, F Nazari, M Karimi, S Baghalian, więcej

2011 Fourth International Conference on Modeling, Simulation and Applied Optimization > 1 - 5

2011 Fourth International Conference on Modeling, Simulation and Applied Optimization (ICMSAO 2011)

This paper presents a new procedure for identification of multiple cracks in beam. Natural frequency is frequently used as a parameter for detection of cracks in the structures. The process of crack identification in presented procedure is consists of four stages. In first stage, three natural frequencies of a cantilever beam for different locations and depths of cracks were obtained using Finite...

rozdział

Design optimization of loop antenna using Competitive Learning ANN

K Sarmah, K K Sarma

2011 2nd National Conference on Emerging Trends and Applications in Computer Science > 1 - 4

2011 2nd National Conference on Emerging Trends and Applications in Computer Science (NCETACS 2011)

Out of several antenna design techniques the Artificial Neural Network (ANN) based method is suitable for prediction of characteristic parameters of loop antenna by considering transmit - receive conditions of practical communication set-ups. The predicted set of parameters can be used to fix dimensions of a loop antenna which involves theoretical calculations. This work proposes an approach to determine...

rozdział

A Q-learning method based on Quantum-Behaved Particle Swarm Optimizer

Mingliang Xu, Xiaojian Yan

International Conference on Information Science and Technology > 163 - 167

2011 International Conference on Information Science and Technology (ICIST 2011)

Normalized radial basis function (NRBF) neural network is presented to directly approach the Q-value function and generalize the information learnt by learning agent in continuous space. The action which impacts on environment is the one with maximum output of NRBF in the current state, and generated through Quantum-Behaved Particle Swarm Optimizer based on the current state. The effectiveness of...

rozdział

High performance lithographic hotspot detection using hierarchically refined machine learning

Duo Ding, A J Torres, F G Pikus, D Z Pan

16th Asia and South Pacific Design Automation Conference (ASP-DAC 2011) > 775 - 780

2011 16th Asia and South Pacific Design Automation Conference, ASP-DAC 2011

Under real and continuously improving manufacturing conditions, lithography hotspot detection faces several key challenges. First, real hotspots become less but harder to fix at post-layout stages; second, false alarm rate must be kept low to avoid excessive and expensive post-processing hotspot removal; third, full chip physical verification and optimization require fast turn-around time. To address...

rozdział

Multi-width fixed-point coding based on reprogrammable hardware implementation of a multi-layer perceptron neural network for alertness classification

A G Blaiech, Khaled Ben Khalifa, M Boubaker, M H Bedoui

2010 10th International Conference on Intelligent Systems Design and Applications > 610 - 614

10th International Conference on Intelligent Systems Design and Applications (ISDA 2010)

This paper presents an optimizing methodology for implementing a multi-layer perceptron (MLP) neural network in a Field Programmable Gate Array (FPGA) device. In order to obtain an efficient implementation, a compromise of time and area is needed. Starting from simulation in the learning phase with fixed point operators, we have developed a methodology which allows the automatic generation of a VHDL...

rozdział

On a Multiobjective Training Algorithm for RBF Networks Using Particle Swarm Optimization

G R L Silva, D A G Vieira, A C Lisboa, Vasile Palade

2010 22nd IEEE International Conference on Tools with Artificial Intelligence > 2 > 282 - 285

2010 22nd International Conference on Tools with Artificial Intelligence (ICTAI 2010)

This paper presents a novel algorithm for multiobjective training of Radial Basis Function (RBF) networks based on least-squares and Particle Swarm Optimization methods. The formulation is based on the fundamental concept that supervised learning is a bi-objective optimization problem, in which two conflicting objectives should be minimized. The objectives are related to the empirical training error...

rozdział

An enhanced workflow management for Utility Management Systems

S Vukmirovic, A Erdeljan, F Kulic, S Lukovic

International Congress on Ultra Modern Telecommunications and Control Systems > 429 - 436

2010 International Congress on Ultra Modern Telecommunications and Control Systems and Workshops (ICUMT 2010)

The emerging computational grid infrastructure consists of widely distributed heterogeneous resources, which makes mapping of increasingly complex applications a very challenging task. Utility Management Systems (UMS) manage large number of workflows with high resource requirements and thereby optimization of resource utilization has to be adapted. In this work we propose the architecture that implements...

rozdział

Statistically linearized least-squares temporal differences

M Geist, O Pietquin

International Congress on Ultra Modern Telecommunications and Control Systems > 450 - 457

2010 International Congress on Ultra Modern Telecommunications and Control Systems and Workshops (ICUMT 2010)

A common drawback of standard reinforcement learning algorithms is their inability to scale-up to real-world problems. For this reason, a current important trend of research is (state-action) value function approximation. A prominent value function approximator is the least-squares temporal differences (LSTD) algorithm. However, for technical reasons, linearity is mandatory: the parameterization of...

rozdział

A particle swarm optimized Fuzzy Neural Network for bankruptcy prediction

Li Rui

2010 International Conference on Future Information Technology and Management Engineering > 2 > 557 - 560

2010 International Conference on Future Information Technology and Management Engineering (FITME 2010)

Since the excellent performances of treating nonlinear data with self-learning capability, the neural networks (NNs) are wildly use in financial prediction problem. But the NNs more or less suffer from the slow convergence, “black-box” i.e., it is almost impossible to analysis them for how they work. The Fuzzy Neural Networks(FNN) allow to add rules to neural networks. This avoids the black-box but...

rozdział

Optimization of learning the neuronetworking data processing system for non-satinary objects recognition and forecasting

O I Djumanov, S M Kholmonov

2010 4th International Conference on Application of Information and Communication Technologies > 1 - 4

2010 4th International Conference on Application of Information and Communication Technologies (AICT 2010)

The problem of construction the neuronetworking systems for non-stationary information adaptive processing at various practical applications is formulated. The developed methods and algorithms of neural network training subset formation allow to take into account the conditions of information transfer, variation of statistical parameters and dynamic properties of data. The controlling algorithms which...

rozdział

Nonlinear System Control Using a Recurrent Neural Fuzzy Network Based on Reinforcement Particle Swarm Optimization

Cheng-Jian Lin, Ying-Ming Lin, Chi-Yung Lee

2010 International Symposium on Computational Intelligence and Design > 2 > 196 - 200

2010 3rd International Symposium on Computational Intelligence and Design (ISCID 2010)

This paper proposes a recurrent neural fuzzy network with the reinforcement improved particle swarm optimization (R-IPSO) for solving various control problems. The R-IPSO, which consists of structure learning and parameter learning, is also proposed. The structure learning is adopts several sub-swarms to constitute variable fuzzy systems and uses an elite-based structure strategy (ESS) to find suitable...

rozdział

Constrained optimization of robot trajectory and obstacle avoidance

V Michna, P Wagner, J Cernohorsky

2010 IEEE 15th Conference on Emerging Technologies&Factory Automation (ETFA 2010) > 1 - 4

2010 IEEE 15th Conference on Emerging Technologies & Factory Automation (ETFA 2010)

This paper deals with time-optimization of trajectories of wheeled robots within the speed and other constraints. The cubic Hermite spline curve with the method of speed profile computation is used to determine the trajectory. This method is summarized and extended to allow the optimization with the described constraints. It ensures fulfilment of required initial parameters of motion. The parameters...

rozdział

A Novel Hybrid Algorithm Based on Baldwinian Learning and PSO

Wanliang Wang, Lili Chen, Jing Jie, Haiyan Wang, więcej

2010 International Conference on Computational Aspects of Social Networks > 299 - 302

2010 International Conference on Computational Aspects of Social Networks (CASoN 2010)

In the paper, a novel hybrid algorithm based on Baldwinian learning and PSO (BLPSO) is proposed to increase the diversity of the particles and to prevent premature convergence of PSO. Firstly, BLPSO adopts the Baldwinian operator to simulate the learning mechanism among the particles and employs the information of the swarm to alter the search space adaptively. Secondly, a mutation operation is introduced...

rozdział

The study of fault diagnosis algorithm based on extension neural network

Lu Ming

2010 2nd IEEE International Conference on Information and Financial Engineering > 447 - 450

2010 2nd IEEE International Conference on Information and Financial Engineering (ICIFE 2010)

Extension neural network is a new method based on Extenics and neural networks, it is full use of the Extension of qualitative and quantitative description of the advantages, but also consider the parallel structure characteristics, of neural network. This article describes the extension theory and neural network fusion extension neural network structure and introduce ENN algorithm based on genetic...

rozdział

A multi-swarm cooperative hybrid particle swarm optimizer

Ying Li, Jiaxi Liang, Jie Hu

2010 Sixth International Conference on Natural Computation > 5 > 2535 - 2539

2010 Sixth International Conference on Natural Computation (ICNC)

Cooperative approaches have proved to be very useful in evolutionary computation. This paper a novel multi-swarm cooperative particle swarm optimization (PSO) is proposed. It involves a collection of two sub-swarms that interact by exchanging information to solve a problem. The two swarms execute IPSO (improved PSO) independently to maintain the diversity of populations, while introducing extremal...

rozdział

Performance enhancement of SVM ensembles using genetic algorithms in bankruptcy prediction

Dae-Ki Kang, Myoung-Jong Kim

2010 3rd International Conference on Advanced Computer Theory and Engineering(ICACTE) > 2 > V2-154 - V2-158

2010 3rd International Conference on Advanced Computer Theory and Engineering (ICACTE 2010)

Ensemble learning is a method to improve the performance of classification and prediction algorithms. It has received considerable attention because of its prominent generalization and performance improvement. However, its performance can be degraded due to multicollinearity problem where multiple classifiers of an ensemble are highly correlated with. This paper proposes genetic algorithm-based coverage...

rozdział

A Cooperative Learning Algorithm for Multiclass Classification

Youshen Xia, Yu Ying

2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology > 3 > 223 - 226

2010 IEEE/ACM International Conference on Web Intelligence-Intelligent Agent Technology (WI-IAT)

In this paper, we propose a cooperative learning algorithm for Multi-category classification which is decomposed into two sub-optimization problems by using the support vector machine technique. The proposed cooperative learning algorithm consists of two single learning algorithms and each sub-optimization problem is solved by one of them. Unlike the cooperative neural network, the proposed cooperative...

rozdział

An adaptive ensemble of fuzzy ARTMAP neural networks for video-based face classification

J Connolly, E Granger, R Sabourin

IEEE Congress on Evolutionary Computation > 1 - 8

2010 IEEE Congress on Evolutionary Computation

A key feature in population based optimization algorithms is the ability to explore a search space and make a decision based on multiple solutions. In this paper, an incremental learning strategy based on a dynamic particle swarm optimization (DPSO) algorithm allows to produce heterogeneous ensembles of classifiers for video-based face recognition. This strategy is applied to an adaptive classification...

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania

Continuous Action-Space Reinforcement Learning Methods Applied to the Minimum-Time Swing-Up of the Acrobot

Supervised learning for Neural Network using Ant Colony Optimization

Detection of multiple cracks in beams using particle swarm optimization and artificial neural network

Design optimization of loop antenna using Competitive Learning ANN

A Q-learning method based on Quantum-Behaved Particle Swarm Optimizer

High performance lithographic hotspot detection using hierarchically refined machine learning

Multi-width fixed-point coding based on reprogrammable hardware implementation of a multi-layer perceptron neural network for alertness classification

On a Multiobjective Training Algorithm for RBF Networks Using Particle Swarm Optimization

An enhanced workflow management for Utility Management Systems

Statistically linearized least-squares temporal differences

A particle swarm optimized Fuzzy Neural Network for bankruptcy prediction

Optimization of learning the neuronetworking data processing system for non-satinary objects recognition and forecasting

Nonlinear System Control Using a Recurrent Neural Fuzzy Network Based on Reinforcement Particle Swarm Optimization

Constrained optimization of robot trajectory and obstacle avoidance

A Novel Hybrid Algorithm Based on Baldwinian Learning and PSO

The study of fault diagnosis algorithm based on extension neural network

A multi-swarm cooperative hybrid particle swarm optimizer

Performance enhancement of SVM ensembles using genetic algorithms in bankruptcy prediction

A Cooperative Learning Algorithm for Multiclass Classification

An adaptive ensemble of fuzzy ARTMAP neural networks for video-based face classification

Opcje filtrowania

Data publikacji

Dostępność treści

Słowa kluczowe

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Dostępność treści

Słowa kluczowe

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu