Qing Zhao

chapter

Online learning with side information

Xiao Xu, Sattar Vakili, Qing Zhao, Ananthram Swami

MILCOM 2017 - 2017 IEEE Military Communications Conference (MILCOM) > 303 - 308

MILCOM 2017 - 2017 IEEE Military Communications Conference (MILCOM)

An online learning problem with side information is considered. The problem is formulated as a graph structured stochastic Multi-Armed Bandit (MAB). Each node in the graph represents an arm in the bandit problem and an edge between two arms indicates closeness in their mean rewards. It is shown that such side information induces a Unit Interval Graph and several graph properties can be leveraged to...

chapter

Online learning and pricing for demand response in smart distribution networks

Sevi Baltaoglu, Lang Tong, Qing Zhao

2016 IEEE Statistical Signal Processing Workshop (SSP) > 1 - 5

2016 IEEE Statistical Signal Processing Workshop (SSP)

The problem of online learning of consumer response to retail pricing of electricity in a distribution network is considered. In a two-settlement market, the retailer who sets the retail price is exposed to risks from the stochastic response of its consumers and the real-time price fluctuation in the wholesale market. The optimal price maximizing the expected profit is a function of consumer's response...

chapter

Online learning and optimization of Markov jump linear models

Sevi Baltaoglu, Lang Tong, Qing Zhao

2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) > 2289 - 2293

ICASSP 2016 - 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

The problem of online learning and optimization of unknown Markov jump linear models is considered. A new online learning algorithm, referred to as Markovian simultaneous perturbations stochastic approximation (MSPSA), is proposed. It is shown that MSPSA achieves the minimax regret order of B(vT). Using the Van Trees inequality (stochastic Cramer-Rao bound), it is shown that B(vT) is the lowest regret...

chapter

Retail pricing for stochastic demand with unknown parameters: An online machine learning approach

Liyan Jia, Qing Zhao, Lang Tong

2013 51st Annual Allerton Conference on Communication, Control, and Computing (Allerton) > 1353 - 1358

2013 51st Annual Allerton Conference on Communication, Control, and Computing (Allerton)

The problem of dynamically pricing of electricity by a retailer for customers in a demand response program is considered. It is assumed that the retailer obtains electricity in a two-settlement wholesale market consisting of a day ahead market and a real-time market. Under a day ahead dynamic pricing mechanism, the retailer aims to learn the aggregated demand function of its customers while maximizing...

article

Learning in a Changing World: Restless Multiarmed Bandit With Unknown Dynamics

Haoyang Liu, Keqin Liu, Qing Zhao

IEEE Transactions on Information Theory > 2013 > 59 > 3 > 1902 - 1916

We consider the restless multiarmed bandit problem with unknown dynamics in which a player chooses one out of $N$ arms to play at each time. The reward state of each arm transits according to an unknown Markovian rule when it is played and evolves according to an arbitrary unknown random process when it is passive. The performance of an arm selection policy is measured by regret, defined as the reward...

chapter

Stochastic online learning under unknown time-varying models

Pouya Tehrani, Qing Zhao

2012 Conference Record of the Forty Sixth Asilomar Conference on Signals, Systems and Computers (ASILOMAR) > 1046 - 1050

2012 46th Asilomar Conference on Signals, Systems and Computers

An online learning problem under stochastic time-varying models is considered. The problem is treated as a generalization of the classic multi-armed bandit problem when the arm distributions are time-varying. The objective is to study the impact of time variation in arm distributions on the performance of the player's strategy. Sufficient conditions on the rate of model variations under which learning...

chapter

Multi-channel opportunistic spectrum access in unslotted primary systems with unknown models

Pouya Tehrani, Qing Zhao, Lang Tong

2011 4th IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive Processing (CAMSAP) > 157 - 160

2011 4th IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive Processing (CAMSAP)

Multi-channel opportunistic spectrum access in unslotted primary systems is considered. The primary occupancy of each channel is modeled as a general on-off renewal process. The distributions of the busy and idle times and the utilization factors of all channels are unknown to the secondary user. The objective of the secondary user is to identify and exploit the best channel (i.e., the channel with...

INFONA - science communication portal

Search results for: Qing Zhao

Online learning with side information

Online learning and pricing for demand response in smart distribution networks

Online learning and optimization of Markov jump linear models

Retail pricing for stochastic demand with unknown parameters: An online machine learning approach

Learning in a Changing World: Restless Multiarmed Bandit With Unknown Dynamics

Stochastic online learning under unknown time-varying models

Multi-channel opportunistic spectrum access in unslotted primary systems with unknown models

Filter options

Publication date

Publication type

Keywords

INFONA - science communication portal

Search results for: Qing Zhao

Online learning with side information

Online learning and pricing for demand response in smart distribution networks

Online learning and optimization of Markov jump linear models

Retail pricing for stochastic demand with unknown parameters: An online machine learning approach

Learning in a Changing World: Restless Multiarmed Bandit With Unknown Dynamics

Stochastic online learning under unknown time-varying models

Multi-channel opportunistic spectrum access in unslotted primary systems with unknown models

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options