Advanced search

Advanced search in people

From:

To:

Items from 1 to 5 out of 5 results

article

Identification of Unknown Parameters for a Class of Two-Level Quantum Systems

Zhengui Xue, Hai Lin, Tong Heng Lee

IEEE Transactions on Automatic Control > 2013 > 58 > 7 > 1805 - 1810

This technical note studies the identification of unknown decoherence rates for a class of two-level quantum systems undergoing spontaneous emission. Our previous work shows that estimates of the unknown decoherence rates can be obtained from the ensemble averages by imposing constant control and monitoring a sequence of identical systems continuously. Inspired by the work, this technical note further...

chapter

RVI reinforcement learning for semi-Markov decision processes with average reward

Yanjie Li, Fang Cao

2010 8th World Congress on Intelligent Control and Automation > 1674 - 1679

2010 8th World Congress on Intelligent Control and Automation (WCICA 2010)

Based on the sensitivity-based approach, we discuss the reinforcement learning problem of semi-Markov decision processes (SMDPs) with average reward. First, we provide a new Bellman optimality equation. On this basis, we propose a relative value iteration (RVI) reinforcement learning algorithm. The new RVI reinforcement learning algorithm may avoid the estimation of optimal average reward in the process...

chapter

Alpha-EM gives fast Hidden Markov Model estimation: Derivation and evaluation of alpha-HMM

Y Matsuyama, R Hayashi

The 2010 International Joint Conference on Neural Networks (IJCNN) > 1 - 8

2010 International Joint Conference on Neural Networks (IJCNN 2010)

A fast learning algorithm for Hidden Markov Models is derived starting from convex divergence optimization. This method utilizes the alpha-logarithm as a surrogate function for the traditional logarithm to process the likelihood ratio. This enables the utilization of a stronger curvature than the logarithm. This paper's method includes the ordinary Baum-Welch re-estimation algorithm as a proper subset...

chapter

Near-Optimal Approximation Rates for Distribution Free Learning with Exponentially, Mixing Observations

Andrew J Kurdila, Bin Xu

Proceedings of the 2010 American Contrl Conference > 504 - 509

2010 American Control Conference (ACC 2010)

This paper derives the rate of convergence for the distribution free learning problem when the observation process is an exponentially strongly mixing (α-mixing with an exponential rate) Markov chain. If {z_k}_K=1^∞ = {(x_k, y_k)}_k=1^∞ ⊂ x × Y ≡ Z is an exponentially strongly mixing Markov chain with stationary measure ρ, it is shown that the empirical estimate f_z that minimizes the discrete quadratic risk...

chapter

MCMC for sequential flight object attitude estimation based on perfect coupling sampling

Zhang Jingmei, Zhai Yongzhi

2008 2nd International Symposium on Systems and Control in Aerospace and Astronautics > 1 - 4

ISSCAA 2008. 2nd International Symposium on Systems and Control in Aerospace and Astronautics

Aiming at large initial attitude errors of flight object, this paper presents perfect coupling sampling based on coupling from the past (CFTP) algorithm on MCMC (Markov chain Monte Carlo) to tackle the problem of sequential flight object attitude estimation. Based Bayesian theory, posterior distribution can be approximated by Monte Carlo likelihood function and conjunction prior distribution based...

Filter options

Keywords:
ESTIMATION
EQUATIONS
CONVERGENCE
MARKOV PROCESSES
Publication language:
English

Publication date

Set your own date range

Publication type

book (4)
article (1)

Keywords

COMPUTATIONAL COMPLEXITY (2)
CONVERGENCE OF NUMERICAL METHODS (2)
FUNCTION APPROXIMATION (2)
LEARNING (ARTIFICIAL INTELLIGENCE) (2)
MATHEMATICAL MODEL (2)
AIRCRAFT (1)
ALGORITHM DESIGN AND ANALYSIS (1)
ALPHA-EXPECTATION-MAXIMIZATION ALGORITHM (1)
ALPHA-HIDDEN MARKOV MODEL (1)
ALPHA-LOGARITHM (1)
APPROXIMATION METHODS (1)
APPROXIMATION THEORY (1)
ATTITUDE MEASUREMENT (1)
BAYES METHODS (1)
BAYESIAN THEORY (1)
BELLMAN OPTIMALITY EQUATION (1)
CONTROL SYSTEMS (1)
CONVERGENCE RATE (1)
CONVEX DIVERGENCE OPTIMIZATION (1)
CONVEX PROGRAMMING (1)
COUPLING FROM THE PAST (CFTP) (1)
COUPLING FROM THE PAST ALGORITHM (1)
COUPLINGS (1)
DIFFERENCE ENCODING (1)
DISCRETE QUADRATIC RISK (1)
ENCODING (1)
ERGODICITY (1)
FAST HIDDEN MARKOV MODEL ESTIMATION (1)
FAST LEARNING ALGORITHM (1)
FILTERING (1)
FLIGHT OBJECT ATTITUDE ESTIMATION (1)
HEURISTIC ALGORITHMS (1)
HIDDEN MARKOV MODELS (1)
ITERATIVE METHODS (1)
LEARNING (1)
LEARNING THEORY (1)
LIKELIHOOD FUNCTION APPROXIMATION (1)
MARKOV CHAIN (1)
MARKOV CHAIN MONTE CARLO METHOD (1)
MCMC (1)
MINIMISATION (1)
MONOTONOUS STATE-SPACE (1)
MONTE CARLO METHODS (1)
NEAR-OPTIMAL APPROXIMATION (1)
NOISE MEASUREMENT (1)
OPTIMAL AVERAGE REWARD (1)
OPTIMISATION (1)
ORDINARY BAUM-WELCH RE-ESTIMATION ALGORITHM (1)
PARAMETER ESTIMATION (1)
PARTICLE FILTERING (1)
PARTICLE FILTERING (NUMERICAL METHODS) (1)
PERFECT COUPLING SAMPLING (1)
PERFORMANCE POTENTIAL (1)
PROBABILISTIC LOGIC (1)
PROCESS CONTROL (1)
QUANTUM SYSTEMS (1)
REGRESSOR FUNCTION (1)
REINFORCEMENT LEARNING (1)
REINFORCEMENT LEARNING PROBLEM (1)
RELATIVE VALUE ITERATION (1)
RELATIVE VALUE ITERATION REINFORCEMENT LEARNING ALGORITHM (1)
RVI REINFORCEMENT LEARNING ALGORITHM (1)
SEMI-MARKOV DECISION PROCESSES (1)
SEMIMARKOV DECISION PROCESSES (1)
SENSITIVITY-BASED APPROACH (1)
SEQUENTIAL ESTIMATION (1)
SEQUENTIAL FLIGHT OBJECT ATTITUDE ESTIMATION (1)
SIGNAL SAMPLING (1)
SINGLE SYSTEM (1)
SMDP (1)
SOFTWARE ALGORITHMS (1)
STATIONARY DISTRIBUTION (1)
STATISTICAL DISTRIBUTION (1)
STATISTICAL DISTRIBUTIONS (1)
STATISTICAL LEARNING (1)
SURROGATE FUNCTION (1)
more

INFONA - science communication portal

Advanced search

Advanced search in people

Identification of Unknown Parameters for a Class of Two-Level Quantum Systems

RVI reinforcement learning for semi-Markov decision processes with average reward

Alpha-EM gives fast Hidden Markov Model estimation: Derivation and evaluation of alpha-HMM

Near-Optimal Approximation Rates for Distribution Free Learning with Exponentially, Mixing Observations

MCMC for sequential flight object attitude estimation based on perfect coupling sampling

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options