:facetid:toc:\"db/conf/ewrl/ewrl2011.bht\"OKScott SannerMarcus HutterRecent Advances in Reinforcement Learning - 9th European Workshop, EWRL 2011, Athens, Greece, September 9-11, 2011, Revised Selected PapersEWRLLecture Notes in Computer Science7188Springer2012Editorshipconf/ewrl/201110.1007/978-3-642-29946-9https://doi.org/10.1007/978-3-642-29946-9https://dblp.org/rec/conf/ewrl/2011URL#4515491Mauricio Araya-LópezOlivier BuffetVincent ThomasFrançois CharpilletActive Learning of MDP Models.EWRL42-532011Conference and Workshop Papersclosedconf/ewrl/Araya-LopezBTC1110.1007/978-3-642-29946-9_8https://doi.org/10.1007/978-3-642-29946-9_8https://dblp.org/rec/conf/ewrl/Araya-LopezBTC11URL#4668651Peter AuerInvited Talk: UCRL and Autonomous Exploration.EWRL12011Conference and Workshop Papersclosedconf/ewrl/Auer1110.1007/978-3-642-29946-9_1https://doi.org/10.1007/978-3-642-29946-9_1https://dblp.org/rec/conf/ewrl/Auer11URL#4668652Georgios BoutsioukisIoannis PartalasIoannis P. VlahavasTransfer Learning in Multi-Agent Reinforcement Learning Domains.EWRL249-2602011Conference and Workshop Papersclosedconf/ewrl/BoutsioukisPV1110.1007/978-3-642-29946-9_25https://doi.org/10.1007/978-3-642-29946-9_25https://dblp.org/rec/conf/ewrl/BoutsioukisPV11URL#4668653Pablo Samuel CastroDoina PrecupAutomatic Construction of Temporally Extended Actions for MDPs Using Bisimulation Metrics.EWRL140-1522011Conference and Workshop Papersclosedconf/ewrl/CastroP1110.1007/978-3-642-29946-9_16https://doi.org/10.1007/978-3-642-29946-9_16https://dblp.org/rec/conf/ewrl/CastroP11URL#4668654Kyriakos C. ChatzidimitriouIoannis PartalasPericles A. MitkasIoannis P. VlahavasTransferring Evolved Reservoir Features in Reinforcement Learning Tasks.EWRL213-2242011Conference and Workshop Papersclosedconf/ewrl/ChatzidimitriouPMV1110.1007/978-3-642-29946-9_22https://doi.org/10.1007/978-3-642-29946-9_22https://dblp.org/rec/conf/ewrl/ChatzidimitriouPMV11URL#4668655Christos DimitrakakisRobust Bayesian Reinforcement Learning through Tight Lower Bounds.EWRL177-1882011Conference and Workshop Papersclosedconf/ewrl/Dimitrakakis1110.1007/978-3-642-29946-9_19https://doi.org/10.1007/978-3-642-29946-9_19https://dblp.org/rec/conf/ewrl/Dimitrakakis11URL#4668656Christos DimitrakakisConstantin A. RothkopfBayesian Multitask Inverse Reinforcement Learning.EWRL273-2842011Conference and Workshop Papersclosedconf/ewrl/DimitrakakisR1110.1007/978-3-642-29946-9_27https://doi.org/10.1007/978-3-642-29946-9_27https://dblp.org/rec/conf/ewrl/DimitrakakisR11URL#4668657Charles ElkanReinforcement Learning with a Bilinear Q Function.EWRL78-882011Conference and Workshop Papersclosedconf/ewrl/Elkan1110.1007/978-3-642-29946-9_11https://doi.org/10.1007/978-3-642-29946-9_11https://dblp.org/rec/conf/ewrl/Elkan11URL#4668658Anestis FachantidisIoannis PartalasMatthew E. TaylorIoannis P. VlahavasTransfer Learning via Multiple Inter-task Mappings.EWRL225-2362011Conference and Workshop Papersclosedconf/ewrl/FachantidisPTV1110.1007/978-3-642-29946-9_23https://doi.org/10.1007/978-3-642-29946-9_23https://dblp.org/rec/conf/ewrl/FachantidisPTV11URL#4668659Matthieu GeistBruno Scherrerℓ1-Penalized Projected Bellman Residual.EWRL89-1012011Conference and Workshop Papersclosedconf/ewrl/GeistS1110.1007/978-3-642-29946-9_12https://doi.org/10.1007/978-3-642-29946-9_12https://dblp.org/rec/conf/ewrl/GeistS11URL#4668660Matthew W. HoffmanAlessandro LazaricMohammad GhavamzadehRémi MunosRegularized Least Squares Temporal Difference Learning with Nested ℓ2 and ℓ1 Penalization.EWRL102-1142011Conference and Workshop Papersclosedconf/ewrl/HoffmanLGM1110.1007/978-3-642-29946-9_13https://doi.org/10.1007/978-3-642-29946-9_13https://dblp.org/rec/conf/ewrl/HoffmanLGM11URL#4668661Kristian KerstingInvited Talk: Increasing Representational Power and Scaling Inference in Reinforcement Learning.EWRL22011Conference and Workshop Papersclosedconf/ewrl/Kersting1110.1007/978-3-642-29946-9_2https://doi.org/10.1007/978-3-642-29946-9_2https://dblp.org/rec/conf/ewrl/Kersting11URL#4668662Edouard KleinMatthieu GeistOlivier PietquinBatch, Off-Policy and Model-Free Apprenticeship Learning.EWRL285-2962011Conference and Workshop Papersclosedconf/ewrl/KleinGP1110.1007/978-3-642-29946-9_28https://doi.org/10.1007/978-3-642-29946-9_28https://dblp.org/rec/conf/ewrl/KleinGP11URL#4668663Seiya KurodaKazuteru MiyazakiHiroaki KobayashiIntroduction of Fixed Mode States into Online Profit Sharing and Its Application to Waist Trajectory Generation of Biped Robot.EWRL297-3082011Conference and Workshop Papersclosedconf/ewrl/KurodaMK1110.1007/978-3-642-29946-9_29https://doi.org/10.1007/978-3-642-29946-9_29https://dblp.org/rec/conf/ewrl/KurodaMK11URL#4668664Ioannis LambrouVassilis VassiliadesChris ChristodoulouAn Extension of a Hierarchical Reinforcement Learning Algorithm for Multiagent Settings.EWRL261-2722011Conference and Workshop Papersclosedconf/ewrl/LambrouVC1110.1007/978-3-642-29946-9_26https://doi.org/10.1007/978-3-642-29946-9_26https://dblp.org/rec/conf/ewrl/LambrouVC11URL#4668665Boris LesnerBruno ZanuttiniHandling Ambiguous Effects in Action Learning.EWRL54-652011Conference and Workshop Papersclosedconf/ewrl/LesnerZ1110.1007/978-3-642-29946-9_9https://doi.org/10.1007/978-3-642-29946-9_9https://dblp.org/rec/conf/ewrl/LesnerZ11URL#4668666Kfir Y. LevyNahum ShimkinUnified Inter and Intra Options Learning Using Policy Gradient Methods.EWRL153-1642011Conference and Workshop Papersclosedconf/ewrl/LevyS1110.1007/978-3-642-29946-9_17https://doi.org/10.1007/978-3-642-29946-9_17https://dblp.org/rec/conf/ewrl/LevyS11URL#4668667Yuxi LiDale SchuurmansMapReduce for Parallel Reinforcement Learning.EWRL309-3202011Conference and Workshop Papersclosedconf/ewrl/LiS1110.1007/978-3-642-29946-9_30https://doi.org/10.1007/978-3-642-29946-9_30https://dblp.org/rec/conf/ewrl/LiS11URL#4668668Francis MaesLouis WehenkelDamien ErnstAutomatic Discovery of Ranking Formulas for Playing with Multi-armed Bandits.EWRL5-172011Conference and Workshop Papersclosedconf/ewrl/MaesWE1110.1007/978-3-642-29946-9_5https://doi.org/10.1007/978-3-642-29946-9_5https://dblp.org/rec/conf/ewrl/MaesWE11URL#4668669Francis MaesLouis WehenkelDamien ErnstOptimized Look-ahead Tree Search Policies.EWRL189-2002011Conference and Workshop Papersclosedconf/ewrl/MaesWE11a10.1007/978-3-642-29946-9_20https://doi.org/10.1007/978-3-642-29946-9_20https://dblp.org/rec/conf/ewrl/MaesWE11aURL#4668670Tohgoroh MatsuiTakashi Goto 0004Kiyoshi IzumiYu Chen 0007Compound Reinforcement Learning: Theory and an Application to Finance.EWRL321-3322011Conference and Workshop Papersclosedconf/ewrl/MatsuiGIC1110.1007/978-3-642-29946-9_31https://doi.org/10.1007/978-3-642-29946-9_31https://dblp.org/rec/conf/ewrl/MatsuiGIC11URL#4668671Kazuteru MiyazakiMasaaki IdaProposal and Evaluation of the Active Course Classification Support System with Exploitation-Oriented Learning.EWRL333-3442011Conference and Workshop Papersclosedconf/ewrl/MiyazakiI1110.1007/978-3-642-29946-9_32https://doi.org/10.1007/978-3-642-29946-9_32https://dblp.org/rec/conf/ewrl/MiyazakiI11URL#4668672Phuong Minh NguyenPeter SunehagMarcus HutterFeature Reinforcement Learning in Practice.EWRL66-772011Conference and Workshop Papersclosedconf/ewrl/NguyenSH1110.1007/978-3-642-29946-9_10https://doi.org/10.1007/978-3-642-29946-9_10https://dblp.org/rec/conf/ewrl/NguyenSH11URL#4668673Sylvie C. W. OngYuri GrinbergJoelle PineauGoal-Directed Online Learning of Predictive Models.EWRL18-292011Conference and Workshop Papersclosedconf/ewrl/OngGP1110.1007/978-3-642-29946-9_6https://doi.org/10.1007/978-3-642-29946-9_6https://dblp.org/rec/conf/ewrl/OngGP11URL#4668674Cosmin PaduraruDoina PrecupJoelle PineauA Framework for Computing Bounds for the Return of a Policy.EWRL201-2122011Conference and Workshop Papersclosedconf/ewrl/PaduraruPP1110.1007/978-3-642-29946-9_21https://doi.org/10.1007/978-3-642-29946-9_21https://dblp.org/rec/conf/ewrl/PaduraruPP11URL#4668675Matthew W. RobardsPeter SunehagGradient Based Algorithms with Loss Functions and Kernels for Improved On-Policy Control.EWRL30-412011Conference and Workshop Papersclosedconf/ewrl/RobardsS1110.1007/978-3-642-29946-9_7https://doi.org/10.1007/978-3-642-29946-9_7https://dblp.org/rec/conf/ewrl/RobardsS11URL#4668676Munu SairameshBalaraman RavindranOptions with Exceptions.EWRL165-1762011Conference and Workshop Papersclosedconf/ewrl/SairameshR1110.1007/978-3-642-29946-9_18https://doi.org/10.1007/978-3-642-29946-9_18https://dblp.org/rec/conf/ewrl/SairameshR11URL#4668677Bruno ScherrerMatthieu GeistRecursive Least-Squares Learning with Eligibility Traces.EWRL115-1272011Conference and Workshop Papersclosedconf/ewrl/ScherrerG1110.1007/978-3-642-29946-9_14https://doi.org/10.1007/978-3-642-29946-9_14https://dblp.org/rec/conf/ewrl/ScherrerG11URL#4668678Matthijs SnelShimon WhitesonMulti-Task Reinforcement Learning: Shaping and Feature Selection.EWRL237-2482011Conference and Workshop Papersclosedconf/ewrl/SnelW1110.1007/978-3-642-29946-9_24https://doi.org/10.1007/978-3-642-29946-9_24https://dblp.org/rec/conf/ewrl/SnelW11URL#4668679Peter StoneInvited Talk: PRISM - Practical RL: Representation, Interaction, Synthesis, and Mortality.EWRL32011Conference and Workshop Papersclosedconf/ewrl/Stone1110.1007/978-3-642-29946-9_3https://doi.org/10.1007/978-3-642-29946-9_3https://dblp.org/rec/conf/ewrl/Stone11URL#4668680Csaba SzepesváriInvited Talk: Towards Robust Reinforcement Learning Algorithms.EWRL42011Conference and Workshop Papersclosedconf/ewrl/Szepesvari1110.1007/978-3-642-29946-9_4https://doi.org/10.1007/978-3-642-29946-9_4https://dblp.org/rec/conf/ewrl/Szepesvari11URL#4668681Nikolaos TziortziotisKonstantinos BlekasValue Function Approximation through Sparse Bayesian Modeling.EWRL128-1392011Conference and Workshop Papersclosedconf/ewrl/TziortziotisB1110.1007/978-3-642-29946-9_15https://doi.org/10.1007/978-3-642-29946-9_15https://dblp.org/rec/conf/ewrl/TziortziotisB11URL#4668682