Wyniki wyszukiwania dla: Orhan Sonmez

Pozycje od 1 do 4 spośród 4 wyników

rozdział

Sequential Monte Carlo samplers for model-based reinforcement learning

Orhan Sonmez, A. Taylan Cemgil

2013 21st Signal Processing and Communications Applications Conference (SIU) > 1 - 4

2013 21st Signal Processing and Communications Applications Conference (SIU)

Reinforcement learning problems are generally solved by using fixed-point iterations that converge to the suboptimal solutions of Bellman equations. However, it is also possible to formalize this problem as an equivalent likelihood maximization problem and employ probabilistic inference methods. We proposed an expectation-maximization algorithm that utilizes sequential Monte Carlo samplers with Metropolis-Hastings...

rozdział

Importance sampling for model-based reinforcement learning

Orhan Sonmez, A. Taylan Cemgil

2012 20th Signal Processing and Communications Applications Conference (SIU) > 1 - 4

2012 20th Signal Processing and Communications Applications Conference (SIU)

Most of the state-of-the-art reinforcement learning algorithms are based on Bellman equations and make use of fixed-point iteration methods to converge to suboptimal solutions. However, some of the recent approaches transform the reinforcement learning problem into an equivalent likelihood maximization problem with using appropriate graphical models. Hence, it allows the adoption of probabilistic...

rozdział

Reinforcement learning for peer to peer video streaming applications

Muge Sayit, Orhan Sonmez

2012 20th Signal Processing and Communications Applications Conference (SIU) > 1 - 4

2012 20th Signal Processing and Communications Applications Conference (SIU)

In this study, a system with reinforcement learning for push-pull mesh based video streaming applications running over p2p networks is designed. In push-pull based video streaming systems, each node in the system may receive video data from more than one parent. In the proposed system, a node which started to receive insufficient video data from any parent selects a new parent with a probabilistic...

artykuł

Combined perception and control for timing in robotic music performances

Umut Şimşekli, Orhan Sönmez, Barş Kurt, Ali Taylan Cemgil

EURASIP Journal on Audio, Speech, and Music Processing > 2012 > 2012 > 1 > 1-20

Interaction with human musicians is a challenging task for robots as it involves online perception and precise synchronization. In this paper, we present a consistent and theoretically sound framework for combining perception and control for accurate musical timing. For the perception, we develop a hierarchical hidden Markov model that combines event detection and tempo tracking. The robot performance...

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania dla: Orhan Sonmez

Sequential Monte Carlo samplers for model-based reinforcement learning

Importance sampling for model-based reinforcement learning

Reinforcement learning for peer to peer video streaming applications

Combined perception and control for timing in robotic music performances

Opcje filtrowania

Data publikacji

Typ publikacji

Słowa kluczowe

Zbiór danych

INFONA - portal komunikacji naukowej

Wyniki wyszukiwania dla: Orhan Sonmez

Sequential Monte Carlo samplers for model-based reinforcement learning

Importance sampling for model-based reinforcement learning

Reinforcement learning for peer to peer video streaming applications

Combined perception and control for timing in robotic music performances

Dodaj adresata

Anulowanie wysłania wiadomości

Czy na pewno chcesz anulować wysłanie wiadomości?

Wyślij wiadomość

Opcje filtrowania

Data publikacji

Ustawianie zakresu dat

Podaj zakres dat dla filtrowania wyświetlonych wyników. Możesz podać datę początkową, końcową lub obie daty. Daty możesz wpisać ręcznie lub wybrać za pomocą kalendarza.

Typ publikacji

Słowa kluczowe

Zbiór danych

Zgłaszanie błędu / nadużycia

Nieudane wysłanie zgłoszenia

Ułatwienia dostępu