Search results for: Aurélien Garivier

Items from 1 to 2 out of 2 results

article

Optimally Sensing a Single Channel Without Prior Information: The Tiling Algorithm and Regret Bounds

Sarah Filippi, Olivier Cappé, Aurélien Garivier

IEEE Journal of Selected Topics in Signal Processing > 2011 > 5 > 1 > 68 - 76

We consider the task of optimally sensing a two-state Markovian channel with an observation cost and without any prior information regarding the channel's transition probabilities. This task is of interest in the field of cognitive radio as a model for opportunistic access to a communication network by a secondary user. The optimal sensing problem may be cast into the framework of model-based reinforcement...

chapter

Optimism in reinforcement learning and Kullback-Leibler divergence

S Filippi, Olivier Cappé, Aurélien Garivier

2010 48th Annual Allerton Conference on Communication, Control, and Computing (Allerton) > 115 - 122

2010 48th Annual Allerton Conference on Communication, Control, and Computing (Allerton)

We consider model-based reinforcement learning in finite Markov Decision Processes (MDPs), focussing on so-called optimistic strategies. In MDPs, optimism can be implemented by carrying out extended value iterations under a constraint of consistency with the estimated model transition probabilities. The UCRL2 algorithm by Auer, Jaksch and Ortner (2009), which follows this strategy, has recently been...

Filter options

Publication date

Set your own date range

Publication type

article (1)
book (1)

Keywords

LEARNING (2)
LEARNING (ARTIFICIAL INTELLIGENCE) (2)
MARKOV DECISION PROCESSES (2)
MARKOV PROCESSES (2)
MODEL-BASED REINFORCEMENT LEARNING (2)
REGRET BOUNDS (2)
REINFORCEMENT LEARNING (2)
ALGORITHM DESIGN AND ANALYSIS (1)
BENCHMARK TESTING (1)
CHANNEL SENSING (1)
CHANNEL TRANSITION PROBABILITY (1)
COGNITIVE RADIO (1)
COMMUNICATION NETWORKS (1)
CONTEXT MODELING (1)
CONTRACTS (1)
COST FUNCTION (1)
EQUATIONS (1)
FINITE MARKOV DECISION PROCESS (1)
FINITE STATE MARKOV DECISION PROCESS (1)
FREQUENCY (1)
HARDWARE (1)
KULLBACK-LEIBLER DIVERGENCE (1)
LINEAR MAXIMIZATION PROBLEM (1)
MATHEMATICAL MODEL (1)
MODEL TRANSITION PROBABILITY (1)
MODEL-BASED APPROACHES (1)
MULTIARMED BANDIT (1)
OPPORTUNISTIC CHANNEL ACCESS (1)
OPTIMISM (1)
OPTIMISTIC STRATEGY (1)
PARTIALLY OBSERVABLE MARKOV DECISION PROCESSES (POMDPS) (1)
RADIOFREQUENCY IDENTIFICATION (1)
RESTLESS BANDIT (1)
TELECOMMUNICATIONS (1)
TILING ALGORITHM (1)
TWO-STATE MARKOVIAN CHANNEL (1)
WIRELESS CHANNELS (1)
WIRELESS NETWORKS (1)
more

INFONA - science communication portal

Search results for: Aurélien Garivier

Optimally Sensing a Single Channel Without Prior Information: The Tiling Algorithm and Regret Bounds

Optimism in reinforcement learning and Kullback-Leibler divergence

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options