"A Note on Target Q-learning For Solving Finite MDPs with A Generative Oracle."

Ziniu Li, Tian Xu, Yang Yu (2022)

Details and statistics

DOI: 10.48550/ARXIV.2203.11489

access: open

type: Informal or Other Publication

metadata version: 2022-03-29

a service of  Schloss Dagstuhl - Leibniz Center for Informatics