"A functional mirror ascent view of policy gradient methods with function ..."

Sharan Vaswani et al. (2021)
a service of  Schloss Dagstuhl - Leibniz Center for Informatics