We are very excited to join forces with MLCommons and OctoML.ai! Contact Grigori Fursin for more details!

A Local Approach to Forward Model Learning: Results on the Game of Life Game

lib:a25a5c5c1b9cace0 (v1.0.0)

Authors: Simon M. Lucas,Alexander Dockhorn,Vanessa Volz,Chris Bamford,Raluca D. Gaina,Ivan Bravi,Diego Perez-Liebana,Sanaz Mostaghim,Rudolf Kruse
ArXiv: 1903.12508
Document:  PDF  DOI 
Abstract URL: http://arxiv.org/abs/1903.12508v1


This paper investigates the effect of learning a forward model on the performance of a statistical forward planning agent. We transform Conway's Game of Life simulation into a single-player game where the objective can be either to preserve as much life as possible or to extinguish all life as quickly as possible. In order to learn the forward model of the game, we formulate the problem in a novel way that learns the local cell transition function by creating a set of supervised training data and predicting the next state of each cell in the grid based on its current state and immediate neighbours. Using this method we are able to harvest sufficient data to learn perfect forward models by observing only a few complete state transitions, using either a look-up table, a decision tree or a neural network. In contrast, learning the complete state transition function is a much harder task and our initial efforts to do this using deep convolutional auto-encoders were less successful. We also investigate the effects of imperfect learned models on prediction errors and game-playing performance, and show that even models with significant errors can provide good performance.

Relevant initiatives  

Related knowledge about this paper Reproduced results (crowd-benchmarking and competitions) Artifact and reproducibility checklists Common formats for research projects and shared artifacts Reproducibility initiatives

Comments  

Please log in to add your comments!
If you notice any inapropriate content that should not be here, please report us as soon as possible and we will try to remove it within 48 hours!