Qing Zhao

rozdział

Mean-variance and value at risk in multi-armed bandit problems

Sattar Vakili, Qing Zhao

2015 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton) > 1330 - 1335

2015 53rd Annual Allerton Conference on Communication, Control and Computing (Allerton)

We study risk-averse multi-armed bandit problems under different risk measures. We consider three risk mitigation models. In the first model, the variations in the reward values obtained at different times are considered as risk and the objective is to minimize the mean-variance of the observed rewards. In the second and the third models, the quantity of interest is the total reward at the end of...

rozdział

Online learning for network optimization under unknown models

Yixuan Zhai, Qing Zhao

2013 IEEE Global Conference on Signal and Information Processing > 575 - 578

2013 IEEE Global Conference on Signal and Information Processing (GlobalSIP)

We consider the shortest path problem in a communication network with random link costs drawn from unknown distributions. A realization of the total end-to-end cost is obtained when a path is selected for communication. The objective is an online learning algorithm that minimizes the total expected communication cost in the long run. The problem is formulated as a multi-armed bandit problem with dependent...

rozdział

Distributed node-weighted connected dominating set problems

Sattar Vakili, Qing Zhao

2013 Asilomar Conference on Signals, Systems and Computers > 238 - 241

2013 Asilomar Conference on Signals, Systems and Computers

The Minimum Connected Dominating Set (MCDS) problem is to find a subset of vertices in a given graph G such that the set is connected and any vertex of G is either in the set or adjacent to a node in the set. This problem is shown to be NP-Hard and the best polynomial time approximation ratio is O(log n) where n is the number of vertices. The MCDS problem and its derivations are of interest in many...

rozdział

Achieving complete learning in Multi-Armed Bandit problems

Sattar Vakili, Qing Zhao

2013 Asilomar Conference on Signals, Systems and Computers > 1778 - 1782

2013 Asilomar Conference on Signals, Systems and Computers

In the classic Multi-Armed Bandit (MAB) problem, there is a given set of arms with unknown reward distributions. At each time, a player selects one arm to play, aiming to maximize the total expected reward over a horizon of length T. It is known that the minimum growth rate of regret (defined as the total expected loss with respect to the ideal scenario of known reward models of all arms) is logarithmic...

rozdział

Online learning for stochastic linear optimization problems

Keqin Liu, Qing Zhao

2012 Information Theory and Applications Workshop > 363 - 367

2012 Information Theory and Applications Workshop (ITA)

We consider the stochastic online linear optimization problems under unknown cost models. At each time, an action is chosen from a compact subset in R^d and a random cost with an unknown distribution (depending on the action) is incurred. The expected value of the random cost is assumed to be a (unknown) linear function over the action space. The objective is to minimize the growth rate of regret (i...

rozdział

Risk model in fuzzy environments

Xiao-yan Zhao, Jing-gui Gao, Ming-qing Zhao

2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery > 2 > 922 - 926

2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery (FSKD)

In this paper, we consider a risk model in which the individual claim amount is assumed to be a random variable with fuzzy parameters and the claim number process is characterized as Poisson process with fuzzy intensity λ. The mean chance of the ultimate ruin is researched. Particularly, the expressions of the mean chance of the ultimate ruin are obtained for zero initial surplus and arbitrary initial...

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania dla: Qing Zhao

Mean-variance and value at risk in multi-armed bandit problems

Online learning for network optimization under unknown models

Distributed node-weighted connected dominating set problems

Achieving complete learning in Multi-Armed Bandit problems

Online learning for stochastic linear optimization problems

Risk model in fuzzy environments

Opcje filtrowania

Data publikacji

Słowa kluczowe

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania dla: Qing Zhao

Mean-variance and value at risk in multi-armed bandit problems

Online learning for network optimization under unknown models

Distributed node-weighted connected dominating set problems

Achieving complete learning in Multi-Armed Bandit problems

Online learning for stochastic linear optimization problems

Risk model in fuzzy environments

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Słowa kluczowe

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu