"Transition-based versus state-based reward functions for MDPs with ..."

Shuai Ma, Jia Yuan Yu (2017)

Details and statistics

DOI: 10.1109/ALLERTON.2017.8262843

access: closed

type: Conference or Workshop Paper

metadata version: 2019-12-13

a service of  Schloss Dagstuhl - Leibniz Center for Informatics