"A Reinforcement Learning Algorithm Based on Policy Iteration for Average ..."

Abhijit Gosavi (2004)

Details and statistics

DOI: 10.1023/B:MACH.0000019802.64038.6C

access: closed

type: Journal Article

metadata version: 2023-08-28

a service of  Schloss Dagstuhl - Leibniz Center for Informatics