"A temporal difference method for multi-objective reinforcement learning."

Manuela Ruiz-Montiel, Lawrence Mandow, José-Luis Pérez-de-la-Cruz (2017)

Details and statistics

DOI: 10.1016/J.NEUCOM.2016.10.100

access: closed

type: Journal Article

metadata version: 2017-08-07