Supervised reinforcement learning in discrete environment domains

Boris Jensen, Daniel Ortiz-Arroyo, Nareli Cruz-Cortés, Francisco Rodríguez-Henríquez

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

This paper describes a supervised reinforcement learning-based model for discrete environment domains. The model was tested within the domain of backgammon game. Our results show that a supervised actor-critic based learning model is capable of improving the initial performance and then eventually reach similar performance levels as those obtained by TD-Gammon, an artificial neural network player (ANN) trained by temporal differences.

Original languageEnglish
Title of host publicationProceedings - 2010 2nd World Congress on Nature and Biologically Inspired Computing, NaBIC 2010
Pages215-220
Number of pages6
DOIs
StatePublished - 2010
Externally publishedYes
Event2010 2nd World Congress on Nature and Biologically Inspired Computing, NaBIC 2010 - Kitakyushu, Japan
Duration: 15 Dec 201017 Dec 2010

Publication series

NameProceedings - 2010 2nd World Congress on Nature and Biologically Inspired Computing, NaBIC 2010

Conference

Conference2010 2nd World Congress on Nature and Biologically Inspired Computing, NaBIC 2010
Country/TerritoryJapan
CityKitakyushu
Period15/12/1017/12/10

Keywords

  • Actorcritic
  • Automata player
  • Machine learning
  • Reinforcement learning

Fingerprint

Dive into the research topics of 'Supervised reinforcement learning in discrete environment domains'. Together they form a unique fingerprint.

Cite this