Wyniki wyszukiwania dla: Bo Wu

Pozycje od 1 do 4 spośród 4 wyników

rozdział

Policy Reuse for Learning and Planning in Partially Observable Markov Decision Processes

Bo Wu, Yanpeng Feng

2017 4th International Conference on Information Science and Control Engineering (ICISCE) > 549 - 552

2017 4th International Conference on Information Science and Control Engineering (ICISCE)

Learning and planning in partially bservable Markov decision processes (POMDPs) is computationally intractable in real-time system. In order to address this problem, this paper proposes a belief policy reuse (BPR) method to avoid repeated computation. Firstly, the policy reuse evaluation mechanism based on belief Kullback¨CLeibler divergence is presented as a similarity metric between beliefs in the...

rozdział

Monte-Carlo Bayesian Reinforcement Learning Using a Compact Factored Representation

Bo Wu, Yanpeng Feng

2017 4th International Conference on Information Science and Control Engineering (ICISCE) > 466 - 469

2017 4th International Conference on Information Science and Control Engineering (ICISCE)

Bayesian reinforcement learning provides an elegant solution to the optimal tradeoff between exploration and exploitation of the uncertainty in learning. Unfortunately, the size of the learning parameters grows exponentially with the problem horizon. In this paper, we propose a novel Monte Carlo tree search for Bayesian reinforcement learning approach using a compact factored representation, to solve...

rozdział

Point-Based Incremental Pruning for Monte-Carlo Tree Search

Bo Wu, Yanpeng Feng

2017 4th International Conference on Information Science and Control Engineering (ICISCE) > 545 - 548

2017 4th International Conference on Information Science and Control Engineering (ICISCE)

Monte-Carlo tree search (MCTS) combines the generality of stochastic simulation and the accuracy of tree search, which has attracted the great attention of scholars. However, the MCTS search requires a sufficient number of iterations to converge to a good solution, which is more difficult to optimize. In order to solve this problem, this paper presents a point-based incremental pruning (PIP) for Monte-Carlo...

rozdział

Boosted Markov Chain Monte Carlo Data Association for Multiple Target Detection and Tracking

Qian Yu, I. Cohen, G. Medioni, Bo Wu

18th International Conference on Pattern Recognition (ICPR'6) > 2 > 675 - 678

2006 18th International Conference on Pattern Recognition

In this paper, we present a probabilistic framework for automatic detection and tracking of objects. We address the data association problem by formulating the visual tracking as finding the best partition of a measurement graph containing all detected moving regions. In order to incorporate model information in tracking procedure, the posterior distribution is augmented with Adaboost image likelihood...

Opcje filtrowania

Słowa kluczowe:
MONTE CARLO METHODS
Typ publikacji:
książka

Data publikacji

Ustaw własny zakres dat

Słowa kluczowe

MARKOV PROCESSES (3)
PLANNING (3)
COMPUTATIONAL MODELING (2)
ADABOOST IMAGE LIKELIHOOD (1)
ALGORITHM DESIGN AND ANALYSIS (1)
AUTOMATIC OBJECT DETECTION (1)
AUTOMATIC OBJECT TRACKING (1)
BAYES METHODS (1)
BAYESIAN REINFORCEMENT LEARNING (1)
BUSINESS PROCESS RE-ENGINEERING (1)
COMPLEXITY THEORY (1)
DATA ASSOCIATION (1)
DATA-ORIENTED SAMPLING (1)
DYNAMIC BAYESIAN NETWORKS (1)
FACTORED REPRESENTATION (1)
GAMES (1)
GRAPH THEORY (1)
HEURISTIC ALGORITHMS (1)
IMAGE MOTION ANALYSIS (1)
IMAGE SAMPLING (1)
INCREMENTAL PRUNING (1)
JOINT PROBABILITY MODEL (1)
LEARNING (ARTIFICIAL INTELLIGENCE) (1)
LIBRARIES (1)
MARKOV CHAIN MONTE CARLO METHOD (1)
MARKOV RANDOM FIELD-BASED INTERACTION (1)
MEASUREMENT GRAPH (1)
MONTE CARLO TREE SEARCH (1)
MONTE-CARLO TREE SEARCH (MCTS) (1)
MOVING REGION DETECTION (1)
OBJECT DETECTION (1)
PARTIALLY OBSERVABLE MARKOV DECISION PROCESSES (1)
PARTIALLY OBSERVABLE MARKOV DECISION PROCESSES (POMPDS) (1)
POLICY REUSE (1)
POSTERIOR DISTRIBUTION (1)
PROBABILISTIC FRAMEWORK (1)
PROBABILITY (1)
REAL-TIME SYSTEMS (1)
REINFORCEMENT LEARNING (1)
ROBOTS (1)
ROCKS (1)
TRACKING (1)
więcej

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania dla: Bo Wu

Policy Reuse for Learning and Planning in Partially Observable Markov Decision Processes

Monte-Carlo Bayesian Reinforcement Learning Using a Compact Factored Representation

Point-Based Incremental Pruning for Monte-Carlo Tree Search

Boosted Markov Chain Monte Carlo Data Association for Multiple Target Detection and Tracking

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Słowa kluczowe

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu