Alexander Nazin

chapter

Adaptive mirror descent algorithm for the minimization of expected cumulative losses driven by a renewal process

Alexander Nazin, Svetlana Anulova, Andrey Tremba, Pavel Shcherbakov

2015 European Control Conference (ECC) > 1195 - 1199

2015 European Control Conference (ECC)

The problem considered in this paper is the minimization of expected cumulative losses in a stochastic system. The losses over time horizon are formed by the values of an unknown loss function at the consecutive jump times of a renewal process. The loss is assumed to be a convex function of a vector parameter, and the only available information is represented by an oracle which provides stochastic...

chapter

Local modelling with a priori known bounds using direct weight optimization

Jacob Roll, Alexander Nazin, Lennart Ljung

2003 European Control Conference (ECC) > 2138 - 2143

2003 European Control Conference (ECC)

In local modelling, function estimates are computed from observations in a local neighborhood of the point of interest. A central question is how to choose the size of the neighborhood. Often this question has been tackled using asymptotic (in the number of observations) arguments. The recently introduced direct weight optimization approach is a non-asymptotic approach, minimizing an upper bound on...

chapter

Application of the Mirror Descent Method to minimize average losses coming by a poisson flow

Alexander Nazin, Svetlana Anulova, Andrey Tremba

2014 European Control Conference (ECC) > 2194 - 2197

2014 European Control Conference (ECC)

We treat a convex problem to minimize average loss function for a stochastic system operating in continuous time. The losses on time horizon T arise at the jump times of a Poisson process with intensity being an unknown random process. The oracle gives randomly noised gradients of the loss function; the noises are additive, unbiased, with the bounded dual norm in average square sense. The goal consists...

chapter

Extension of a saddle point mirror descent algorithm with application to robust PageRank

Andrey Tremba, Alexander Nazin

52nd IEEE Conference on Decision and Control > 3691 - 3696

2013 IEEE 52nd Annual Conference on Decision and Control (CDC)

The paper is devoted to designing an efficient recursive algorithm for solving the robust PageRank problem recently proposed by Juditsky and Polyak (2012) [4]. To this end, we reformulate the problem to a specific convex-concave saddle point problem min_x∈X max_y∈Y q(x, y) with simple convex sets X ∈ ℝ^N and Y ∈ ℝ^N, i.e., standard simplex and Euclidean unit ball, respectively. Aiming this goal we develop...

chapter

On effectiveness of the Mirror Decent Algorithm for a stochastic multi-armed bandit governed by a stationary finite Markov chain

Alexander Nazin, Boris Miller

2013 Australian Control Conference > 244 - 250

2013 3rd Australian Control Conference (AUCC)

In this article, we study the effectiveness of the Mirror Descent Randomized Control Algorithm recently developed to a class of homogeneous finite Markov chains governed by the stochastic multi-armed bandit with unknown mean losses. We prove the explicit, non-asymptotic both upper and lower bounds for the mean losses at a given (finite) time horizon. These bounds are very similar as functions of problem...

chapter

Mirror decent algorithm for a multi-armed bandit governed by a stationary finite state Markov chain

Alexander Nazin, Boris Miller

2013 European Control Conference (ECC) > 371 - 375

2013 European Control Conference (ECC)

This article further develops an adaptive approach to the control of observable Markov chains with a finite number of states. We apply the Mirror Descent Randomized Control Algorithm (MDRCA) to a class of homogeneous finite Markov chains governed by the multi-armed bandit with unknown mean losses. The article develops the approach represented in [18]. As opposed to the partially observable Markov...

INFONA - science communication portal

Search results for: Alexander Nazin

Adaptive mirror descent algorithm for the minimization of expected cumulative losses driven by a renewal process

Local modelling with a priori known bounds using direct weight optimization

Application of the Mirror Descent Method to minimize average losses coming by a poisson flow

Extension of a saddle point mirror descent algorithm with application to robust PageRank

On effectiveness of the Mirror Decent Algorithm for a stochastic multi-armed bandit governed by a stationary finite Markov chain

Mirror decent algorithm for a multi-armed bandit governed by a stationary finite state Markov chain

Filter options

Publication date

Keywords

INFONA - science communication portal

Search results for: Alexander Nazin

Adaptive mirror descent algorithm for the minimization of expected cumulative losses driven by a renewal process

Local modelling with a priori known bounds using direct weight optimization

Application of the Mirror Descent Method to minimize average losses coming by a poisson flow

Extension of a saddle point mirror descent algorithm with application to robust PageRank

On effectiveness of the Mirror Decent Algorithm for a stochastic multi-armed bandit governed by a stationary finite Markov chain

Mirror decent algorithm for a multi-armed bandit governed by a stationary finite state Markov chain

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Keywords

Reporting an error / abuse

Sending the report failed

Accessibility options