Algorithmic Learning Theory

We consider the setting of stochastic bandit problems with a continuum of arms indexed by [0,1]^d. We first point out that the strategies considered so far in the literature only provided theoretical guarantees of the form: given some tuning parameters, the regret is small with respect to a class of environments that depends on these parameters. This is however not the right...

chapter

Deviations of Stochastic Bandit Regret

Antoine Salomon, Jean-Yves Audibert

Lecture Notes in Computer Science > Algorithmic Learning Theory > Bandit Problems > 159-173

This paper studies the deviations of the regret in a stochastic multi-armed bandit problem. When the total number of plays n is known beforehand by the agent, Audibert et al. (2009) exhibit a policy such that with probability at least 1-1/n, the regret of the policy is of order logn. They have also shown that such a property is not shared by the popular ucb1 policy of Auer et al. (2002). This work...

chapter

On Upper-Confidence Bound Policies for Switching Bandit Problems

Aurélien Garivier, Eric Moulines

Lecture Notes in Computer Science > Algorithmic Learning Theory > Bandit Problems > 174-188

Many problems, such as cognitive radio, parameter control of a scanning tunnelling microscope or internet advertisement, can be modelled as non-stationary bandit problems where the distributions of rewards changes abruptly at unknown time instants. In this paper, we analyze two algorithms designed for solving this issue: discounted UCB (D-UCB) and sliding-window UCB (SW-UCB). We establish an upper-bound...

chapter

Upper-Confidence-Bound Algorithms for Active Learning in Multi-armed Bandits

Alexandra Carpentier, Alessandro Lazaric, Mohammad Ghavamzadeh, Rémi Munos, more

Lecture Notes in Computer Science > Algorithmic Learning Theory > Bandit Problems > 189-203

In this paper, we study the problem of estimating the mean values of all the arms uniformly well in the multi-armed bandit setting. If the variances of the arms were known, one could design an optimal sampling strategy by pulling the arms proportionally to their variances. However, since the distributions are not known in advance, we need to design adaptive sampling strategies to select an arm at...

chapter

Editors’ Introduction

Jyrki Kivinen, Csaba Szepesvári, Esko Ukkonen, Thomas Zeugmann

Lecture Notes in Computer Science > Algorithmic Learning Theory > Editors’ Introduction > 1-13

The ALT-conference series is focuses on studies of learning from an algorithmic and mathematical perspective. During the last decades various models of learning emerged and a main goal is to investigate how various learning problems can be formulated and solved in some of the abstract models.

chapter

Erratum: Learning without Coding

Samuel E. Moelius, Sandra Zilles

Lecture Notes in Computer Science > Algorithmic Learning Theory > Erratum > 452-452

Our ALT’2010 paper claimed that every computably finitely thick [LZ96, Definition 9] class of languages can be identified by enumeration operator [MZ10, Definition 1(e) and Theorem 12]. However, this is, in fact, false. We intend to include a proof of the claim’s negation in the journal version of our paper, which has been submitted.

chapter

Iterative Learning from Positive Data and Counters

Timo Kötzing

Lecture Notes in Computer Science > Algorithmic Learning Theory > Inductive Inference > 40-54

We analyze iterative learning in the limit from positive data with the additional information provided by a counter. The simplest type of counter provides the current iteration number (counting up from 0 to infinity), which is known to improve learning power over plain iterative learning. We introduce five other (weaker) counter types, for example only providing some unbounded and non-decreasing...

chapter

Robust Learning of Automatic Classes of Languages

Sanjay Jain, Eric Martin, Frank Stephan

Lecture Notes in Computer Science > Algorithmic Learning Theory > Inductive Inference > 55-69

This paper adapts and investigates the paradigm of robust learning, originally defined in the inductive inference literature for classes of recursive functions, to learning languages from positive data. Robustness is a very desirable property, as it captures a form of invariance of learnability under admissible transformations on the object of study. The classes of languages of interest are automatic...

chapter

Learning and Classifying

Sanjay Jain, Eric Martin, Frank Stephan

Lecture Notes in Computer Science > Algorithmic Learning Theory > Inductive Inference > 70-83

We define and study a learning paradigm that sits between identification in the limit and classification. More precisely, we expect that a learner be able to identify in the limit which members of a set D of n possible data belong to a target language, where n and D are arbitrary. We show that Ex- and BC-learning are often more difficult than performing this classification task, taking into account...

chapter

Learning Relational Patterns

Michael Geilke, Sandra Zilles

Lecture Notes in Computer Science > Algorithmic Learning Theory > Inductive Inference > 84-98

Patterns provide a simple, yet powerful means of describing formal languages. However, for many applications, neither patterns nor their generalized versions of typed patterns are expressive enough. This paper extends the model of (typed) patterns by allowing relations between the variables in a pattern. The resulting formal languages are called Relational Pattern Languages (RPLs). We study the problem...

INFONA - science communication portal

Algorithmic Learning Theory
22nd International Conference, ALT 2011, Espoo, Finland, October 5-7, 2011. Proceedings

Invited Papers

Intelligent Agents

Inductive Inference

Bandit Problems

Erratum

Other Learning Models

Regression

Online Learning

Kernel and Margin Based Methods

Editors’ Introduction

Lipschitz Bandits without the Lipschitz Constant

Deviations of Stochastic Bandit Regret

On Upper-Confidence Bound Policies for Switching Bandit Problems

Upper-Confidence-Bound Algorithms for Active Learning in Multi-armed Bandits

Editors’ Introduction

Erratum: Learning without Coding

Iterative Learning from Positive Data and Counters

Robust Learning of Automatic Classes of Languages

Learning and Classifying

Learning Relational Patterns

Filter options

Publication date

Content availability

Keywords

INFONA - science communication portal

Algorithmic Learning Theory 22nd International Conference, ALT 2011, Espoo, Finland, October 5-7, 2011. Proceedings $("#expandableTitles").expandable();

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Content availability

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options

Algorithmic Learning Theory
22nd International Conference, ALT 2011, Espoo, Finland, October 5-7, 2011. Proceedings