Search results for: Bo Wu

Items from 1 to 7 out of 7 results

chapter

Policy Reuse for Learning and Planning in Partially Observable Markov Decision Processes

Bo Wu, Yanpeng Feng

2017 4th International Conference on Information Science and Control Engineering (ICISCE) > 549 - 552

2017 4th International Conference on Information Science and Control Engineering (ICISCE)

Learning and planning in partially bservable Markov decision processes (POMDPs) is computationally intractable in real-time system. In order to address this problem, this paper proposes a belief policy reuse (BPR) method to avoid repeated computation. Firstly, the policy reuse evaluation mechanism based on belief Kullback¨CLeibler divergence is presented as a similarity metric between beliefs in the...

chapter

Monte-Carlo Bayesian Reinforcement Learning Using a Compact Factored Representation

Bo Wu, Yanpeng Feng

2017 4th International Conference on Information Science and Control Engineering (ICISCE) > 466 - 469

2017 4th International Conference on Information Science and Control Engineering (ICISCE)

Bayesian reinforcement learning provides an elegant solution to the optimal tradeoff between exploration and exploitation of the uncertainty in learning. Unfortunately, the size of the learning parameters grows exponentially with the problem horizon. In this paper, we propose a novel Monte Carlo tree search for Bayesian reinforcement learning approach using a compact factored representation, to solve...

chapter

Toward efficient manufacturing systems: A trust based human robot collaboration

Bo Wu, Bin Hu, Hai Lin

2017 American Control Conference (ACC) > 1536 - 1541

2017 American Control Conference (ACC)

Modern manufacturing systems are human robot systems that consist of human operators and intelligent robots collaborating with each other to accomplish complex tasks. The system performance of such human robot systems relies heavily on reliable and efficient human robot collaborations, which may be seriously compromised due to temporal variations in human to robot trust. This paper proposes to model...

chapter

Learning based supervisor synthesis of POMDP for PCTL specifications

Xiaobin Zhang, Bo Wu, Hai Lin

2015 54th IEEE Conference on Decision and Control (CDC) > 7470 - 7475

2015 54th IEEE Conference on Decision and Control (CDC)

Partially Observable Markov Decision Process (POMDP) has been widely used in the robotics to model uncertainties from sensors, actuators and the environment. However, such comprehensiveness makes the planning in POMDP generally very difficult. Existing work often searches for an optimal control policy with respect to predefined reward functions, which may require a large memory and is computationally...

chapter

Counterexample-guided permissive supervisor synthesis for probabilistic systems through learning

Bo Wu, Hai Lin

2015 American Control Conference (ACC) > 2894 - 2899

2015 American Control Conference (ACC)

Formal methods in robotic motion planning have emerged as a hot research topic recently due to its correct-by-design nature, and most results haven been based on nonprobabilistic discrete models. To better handle the environment uncertainties, sensor noise and actuator imperfection, control problems in probabilistic systems like Markov Chain (MC) and Markov Decision Process (MDP) have also been studied...

chapter

Stability analysis for wireless networked control system in unslotted IEEE 802.15.4 protocol

Bo Wu, Hai Lin, Michael Lemmon

11th IEEE International Conference on Control & Automation (ICCA) > 1084 - 1089

2014 11th IEEE International Conference on Control & Automation (ICCA)

Wireless networked control systems (WNCS) with the control loops closed over a wireless network are prevailing these days. But it also produces new challenges for stability analysis when considering the nuance of the practical communication protocols. The IEEE 802.15.4 protocol has been very popular among communication protocols utilized in WNCS. However, usually its medium access control (MAC) is...

chapter

Boosted Markov Chain Monte Carlo Data Association for Multiple Target Detection and Tracking

Qian Yu, I. Cohen, G. Medioni, Bo Wu

18th International Conference on Pattern Recognition (ICPR'6) > 2 > 675 - 678

2006 18th International Conference on Pattern Recognition

In this paper, we present a probabilistic framework for automatic detection and tracking of objects. We address the data association problem by formulating the visual tracking as finding the best partition of a measurement graph containing all detected moving regions. In order to incorporate model information in tracking procedure, the posterior distribution is augmented with Adaboost image likelihood...

Filter options

Data set:
ieee
Keywords:
MARKOV PROCESSES
Publication type:
book

Publication date

Set your own date range

Keywords

PLANNING (4)
MONTE CARLO METHODS (3)
COMPUTATIONAL MODELING (2)
PROBABILISTIC LOGIC (2)
ROBOTS (2)
ADABOOST IMAGE LIKELIHOOD (1)
AUTOMATIC OBJECT DETECTION (1)
AUTOMATIC OBJECT TRACKING (1)
BAYES METHODS (1)
BAYESIAN REINFORCEMENT LEARNING (1)
BUSINESS PROCESS RE-ENGINEERING (1)
COLLABORATION (1)
CONTROL SYSTEMS (1)
DATA ASSOCIATION (1)
DATA-ORIENTED SAMPLING (1)
DELAYS (1)
DYNAMIC BAYESIAN NETWORKS (1)
FACTORED REPRESENTATION (1)
FATIGUE (1)
GRAPH THEORY (1)
HEURISTIC ALGORITHMS (1)
HISTORY (1)
IEEE 802.15 STANDARDS (1)
IMAGE MOTION ANALYSIS (1)
IMAGE SAMPLING (1)
JOINT PROBABILITY MODEL (1)
LEAD (1)
LEARNING (ARTIFICIAL INTELLIGENCE) (1)
LIBRARIES (1)
MARKOV CHAIN MONTE CARLO METHOD (1)
MARKOV RANDOM FIELD-BASED INTERACTION (1)
MEASUREMENT GRAPH (1)
MODEL CHECKING (1)
MONTE CARLO TREE SEARCH (1)
MOVING REGION DETECTION (1)
OBJECT DETECTION (1)
PARTIALLY OBSERVABLE MARKOV DECISION PROCESSES (1)
POLICY REUSE (1)
POSTERIOR DISTRIBUTION (1)
POWER CAPACITORS (1)
PROBABILISTIC FRAMEWORK (1)
PROBABILITY (1)
PROTOCOLS (1)
REAL-TIME SYSTEMS (1)
REINFORCEMENT LEARNING (1)
RESOURCE MANAGEMENT (1)
ROBOT MOTION (1)
ROBOT SENSING SYSTEMS (1)
SERVICE ROBOTS (1)
STABILITY ANALYSIS (1)
TRACKING (1)
UNCERTAINTY (1)
YTTRIUM (1)
more

INFONA - science communication portal

Search results for: Bo Wu

Policy Reuse for Learning and Planning in Partially Observable Markov Decision Processes

Monte-Carlo Bayesian Reinforcement Learning Using a Compact Factored Representation

Toward efficient manufacturing systems: A trust based human robot collaboration

Learning based supervisor synthesis of POMDP for PCTL specifications

Counterexample-guided permissive supervisor synthesis for probabilistic systems through learning

Stability analysis for wireless networked control system in unslotted IEEE 802.15.4 protocol

Boosted Markov Chain Monte Carlo Data Association for Multiple Target Detection and Tracking

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options