"n-step temporal difference learning with optimal n."

Lakshmi Mandal, Shalabh Bhatnagar (2025)

Details and statistics

DOI: 10.1016/J.AUTOMATICA.2025.112449

access: closed

type: Journal Article

metadata version: 2025-07-16