"A Policy Iteration Algorithm for Learning from Preference-Based Feedback."

Christian Wirth, Johannes Fürnkranz (2013)

Details and statistics

DOI: 10.1007/978-3-642-41398-8_37

access: closed

type: Conference or Workshop Paper

metadata version: 2024-02-28

a service of  Schloss Dagstuhl - Leibniz Center for Informatics