"An analysis of temporal-difference learning with function approximation."

John N. Tsitsiklis, Benjamin Van Roy (1997)

Details and statistics

DOI: 10.1109/9.580874

access: closed

type: Journal Article

metadata version: 2021-08-17

a service of  Schloss Dagstuhl - Leibniz Center for Informatics