Qing Zhao

chapter

Mean-variance and value at risk in multi-armed bandit problems

Sattar Vakili, Qing Zhao

2015 53rd Annual Allerton Conference on Communication, Control, and Computing (Allerton) > 1330 - 1335

2015 53rd Annual Allerton Conference on Communication, Control and Computing (Allerton)

We study risk-averse multi-armed bandit problems under different risk measures. We consider three risk mitigation models. In the first model, the variations in the reward values obtained at different times are considered as risk and the objective is to minimize the mean-variance of the observed rewards. In the second and the third models, the quantity of interest is the total reward at the end of...

chapter

Online learning for network optimization under unknown models

Yixuan Zhai, Qing Zhao

2013 IEEE Global Conference on Signal and Information Processing > 575 - 578

2013 IEEE Global Conference on Signal and Information Processing (GlobalSIP)

We consider the shortest path problem in a communication network with random link costs drawn from unknown distributions. A realization of the total end-to-end cost is obtained when a path is selected for communication. The objective is an online learning algorithm that minimizes the total expected communication cost in the long run. The problem is formulated as a multi-armed bandit problem with dependent...

chapter

Distributed node-weighted connected dominating set problems

Sattar Vakili, Qing Zhao

2013 Asilomar Conference on Signals, Systems and Computers > 238 - 241

2013 Asilomar Conference on Signals, Systems and Computers

The Minimum Connected Dominating Set (MCDS) problem is to find a subset of vertices in a given graph G such that the set is connected and any vertex of G is either in the set or adjacent to a node in the set. This problem is shown to be NP-Hard and the best polynomial time approximation ratio is O(log n) where n is the number of vertices. The MCDS problem and its derivations are of interest in many...

chapter

Achieving complete learning in Multi-Armed Bandit problems

Sattar Vakili, Qing Zhao

2013 Asilomar Conference on Signals, Systems and Computers > 1778 - 1782

2013 Asilomar Conference on Signals, Systems and Computers

In the classic Multi-Armed Bandit (MAB) problem, there is a given set of arms with unknown reward distributions. At each time, a player selects one arm to play, aiming to maximize the total expected reward over a horizon of length T. It is known that the minimum growth rate of regret (defined as the total expected loss with respect to the ideal scenario of known reward models of all arms) is logarithmic...

chapter

Online learning for stochastic linear optimization problems

Keqin Liu, Qing Zhao

2012 Information Theory and Applications Workshop > 363 - 367

2012 Information Theory and Applications Workshop (ITA)

We consider the stochastic online linear optimization problems under unknown cost models. At each time, an action is chosen from a compact subset in R^d and a random cost with an unknown distribution (depending on the action) is incurred. The expected value of the random cost is assumed to be a (unknown) linear function over the action space. The objective is to minimize the growth rate of regret (i...

chapter

Risk model in fuzzy environments

Xiao-yan Zhao, Jing-gui Gao, Ming-qing Zhao

2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery > 2 > 922 - 926

2010 Seventh International Conference on Fuzzy Systems and Knowledge Discovery (FSKD)

In this paper, we consider a risk model in which the individual claim amount is assumed to be a random variable with fuzzy parameters and the claim number process is characterized as Poisson process with fuzzy intensity λ. The mean chance of the ultimate ruin is researched. Particularly, the expressions of the mean chance of the ultimate ruin are obtained for zero initial surplus and arbitrary initial...

INFONA - science communication portal

Search results for: Qing Zhao

Mean-variance and value at risk in multi-armed bandit problems

Online learning for network optimization under unknown models

Distributed node-weighted connected dominating set problems

Achieving complete learning in Multi-Armed Bandit problems

Online learning for stochastic linear optimization problems

Risk model in fuzzy environments

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results for: Qing Zhao

Mean-variance and value at risk in multi-armed bandit problems

Online learning for network optimization under unknown models

Distributed node-weighted connected dominating set problems

Achieving complete learning in Multi-Armed Bandit problems

Online learning for stochastic linear optimization problems

Risk model in fuzzy environments

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options