ArXiv | 2021

Heuristics, Answer Set Programming and Markov Decision Process for Solving a Set of Spatial Puzzles

Abstract

Spatial puzzles composed of rigid objects, flexible strings and holes offer interesting domains for reasoning about spatial entities that are common in the human daily-life s activities. The goal of this work is to investigate the automated solution of this kind of puzzles adapting an algorithm that combines Answer Set Programming (ASP) with Markov Decision Process (MDP), algorithm oASP(MDP), to use heuristics accelerating the learning process. ASP is applied to represent the domain as an MDP, while a Reinforcement Learning algorithm (Q-Learning) is used to find the optimal policies. In this work, the heuristics were obtained from the solution of relaxed versions of the puzzles. Experiments were performed on deterministic, non-deterministic and non-stationary versions of the puzzles. Results show that the proposed approach can accelerate the learning process, presenting an advantage when compared to the non-heuristic versions of oASP(MDP) and Q-Learning.

Volume abs/1903.03411

ArXiv | 2021

Heuristics, Answer Set Programming and Markov Decision Process for Solving a Set of Spatial Puzzles

Abstract

Volume abs/1903.03411

Pages None

DOI 10.1007/s10489-021-02423-1

Language English

Journal ArXiv

Full Text