莱奥本大学 Hossein Hajiabolhassan研究员 学术报告

时间:2022-11-08浏览:10设置

报告题目:An Introduction to Reinforcement Learning

人:Hossein Hajiabolhassan,莱奥本大学

报告时间:2022年1111日15:00-17:00

报告地点:腾讯会议,会议ID:620-901-689

摘要:

This talk provides an introduction to reinforcement learning and multiarmed bandit as a subclass of reinforcement learning problems. Reinforcement learning is a learning technique in which an agent has to interact with an environment by selecting and running actions, and progressively discovers the environment dynamics. Multi-armed bandit problem is derived from slot machines and an agent could pull the arms in order to maximize its cumulative reward in the long term. An agent learns optimal behavior through its interactions with arms. Multi-armed bandit problem has several interesting applications such as recommendation systems. In this talk, we review some of well-known algorithms to tackle multi-armed bandit problem.

邀请人:朱绪鼎


浙江师范大学离散数学研究中心版权所有 © 2018-2028
地址:浙江省金华市迎宾大道688号21幢 邮政编码:321004
联系电话:0579-82282629   电子邮箱:jcsx@zjnu.cn    管理登陆