"Off-Policy Reinforcement Learning for Partially Unknown Nonzero-Sum Games."

Qichao Zhang, Dongbin Zhao, Sibo Zhang (2017)

Details and statistics

DOI: 10.1007/978-3-319-70087-8_84

access: closed

type: Conference or Workshop Paper

metadata version: 2021-10-14