Title

TDGammon Revisited: Integrating Invalid Actions and Dice Factor In Continuous Action and Observation Space

Abstract

Abstract

After TDGammon's success in 1991, the interest in game-playing agents has risen significantly. With the developments in Deep Learning and emulations for older games has been created, human-level control for Atari games has been achieved and Deep Reinforcement Learning has proven itself to be a success. However, the ancestor of DRL, TDGammon, and its game Backgammon got out of sight, because of the fact that Backgammon's actions are much more complex than other games (most of the Atari games has 2 or 4 different actions), the huge action space has much invalid actions, and there is a dice factor which involves stochasticity. Last but not least, the professional level in Backgammon has been achieved a long time ago. In this thesis, the latest methods in DRL will be tested against its ancestor game, Backgammon, while trying to teach how to select valid moves and considering the dice factor.

Biography:

Supervisor(s)

Supervisor(s)

ENGIN DENIZ USTA

Date and Location

Date and Location

2018-09-13;10:00:00-A105

Category

Category

MSc_Thesis