"Risk Aversion Operator for Addressing Maximization Bias in Q-Learning."

Bi Wang et al. (2020)

Details and statistics

DOI: 10.1109/ACCESS.2020.2977400

access: open

type: Journal Article

metadata version: 2020-04-09

a service of  Schloss Dagstuhl - Leibniz Center for Informatics