Search results for: Orhan Sonmez

Items from 1 to 4 out of 4 results

chapter

Sequential Monte Carlo samplers for model-based reinforcement learning

Orhan Sonmez, A. Taylan Cemgil

2013 21st Signal Processing and Communications Applications Conference (SIU) > 1 - 4

2013 21st Signal Processing and Communications Applications Conference (SIU)

Reinforcement learning problems are generally solved by using fixed-point iterations that converge to the suboptimal solutions of Bellman equations. However, it is also possible to formalize this problem as an equivalent likelihood maximization problem and employ probabilistic inference methods. We proposed an expectation-maximization algorithm that utilizes sequential Monte Carlo samplers with Metropolis-Hastings...

chapter

Importance sampling for model-based reinforcement learning

Orhan Sonmez, A. Taylan Cemgil

2012 20th Signal Processing and Communications Applications Conference (SIU) > 1 - 4

2012 20th Signal Processing and Communications Applications Conference (SIU)

Most of the state-of-the-art reinforcement learning algorithms are based on Bellman equations and make use of fixed-point iteration methods to converge to suboptimal solutions. However, some of the recent approaches transform the reinforcement learning problem into an equivalent likelihood maximization problem with using appropriate graphical models. Hence, it allows the adoption of probabilistic...

chapter

Reinforcement learning for peer to peer video streaming applications

Muge Sayit, Orhan Sonmez

2012 20th Signal Processing and Communications Applications Conference (SIU) > 1 - 4

2012 20th Signal Processing and Communications Applications Conference (SIU)

In this study, a system with reinforcement learning for push-pull mesh based video streaming applications running over p2p networks is designed. In push-pull based video streaming systems, each node in the system may receive video data from more than one parent. In the proposed system, a node which started to receive insufficient video data from any parent selects a new parent with a probabilistic...

article

Combined perception and control for timing in robotic music performances

Umut Şimşekli, Orhan Sönmez, Barş Kurt, Ali Taylan Cemgil

EURASIP Journal on Audio, Speech, and Music Processing > 2012 > 2012 > 1 > 1-20

Interaction with human musicians is a challenging task for robots as it involves online perception and precise synchronization. In this paper, we present a consistent and theoretically sound framework for combining perception and control for accurate musical timing. For the perception, we develop a hierarchical hidden Markov model that combines event detection and tempo tracking. The robot performance...

INFONA - science communication portal

Search results for: Orhan Sonmez

Sequential Monte Carlo samplers for model-based reinforcement learning

Importance sampling for model-based reinforcement learning

Reinforcement learning for peer to peer video streaming applications

Combined perception and control for timing in robotic music performances

Filter options

Publication date

Publication type

Keywords

Data set

INFONA - science communication portal

Search results for: Orhan Sonmez

Sequential Monte Carlo samplers for model-based reinforcement learning

Importance sampling for model-based reinforcement learning

Reinforcement learning for peer to peer video streaming applications

Combined perception and control for timing in robotic music performances

Add recipient

Sending message cancelled

Are you sure you want to cancel sending this message?

Send message

Filter options

Publication date

Date range setting

Set the date range to filter the displayed results. You can set a starting date, ending date or both. You can enter the dates manually or choose them from the calendar.

Publication type

Keywords

Data set

Reporting an error / abuse

Sending the report failed

Accessibility options