Abstract is missing.
- PrefaceMarc Peter Deisenroth, Csaba Szepesvári, Jan Peters. [doi]
- Learning Exploration/Exploitation Strategies for Single Trajectory Reinforcement LearningMichael Castronovo, Francis Maes, Raphael Fonteneau, Damien Ernst. 1-10 [doi]
- Feature Reinforcement Learning using Looping Suffix TreesMayank Daswani, Peter Sunehag, Marcus Hutter. 11-24 [doi]
- Planning in Reward-Rich Domains via PAC BanditsSergiu Goschin, Ari Weinstein, Michael L. Littman, Erick Chastain. 25-42 [doi]
- Actor-Critic Reinforcement Learning with Energy-Based PoliciesNicolas Heess, David Silver, Yee Whye Teh. 43-58 [doi]
- Directed Exploration in Reinforcement Learning with Transferred KnowledgeTimothy Arthur Mann, Yoonsuck Choe. 59-76 [doi]
- Online Skill Discovery using Graph-based ClusteringJan Hendrik Metzen. 77-88 [doi]
- An Empirical Analysis of Off-policy Learning in Discrete MDPsCosmin Paduraru, Doina Precup, Joelle Pineau, Gheorghe Comanici. 89-102 [doi]
- Evaluation and Analysis of the Performance of the EXP3 Algorithm in Stochastic EnvironmentsYevgeny Seldin, Csaba Szepesvári, Peter Auer, Yasin Abbasi-Yadkori. 103-116 [doi]
- Gradient Temporal Difference NetworksDavid Silver. 117-130 [doi]
- Semi-Supervised Apprenticeship LearningMichal Valko, Mohammad Ghavamzadeh, Alessandro Lazaric. 131-142 [doi]
- An investigation of imitation learning algorithms for structured predictionAndreas Vlachos. 143-154 [doi]
- Rollout-based Game-tree Search Outprunes Traditional Alpha-betaAri Weinstein, Michael L. Littman, Sergiu Goschin. 155-167 [doi]