Abstract is missing.
- Lazy Planning under Uncertainty by Optimizing Decisions on an Ensemble of Incomplete Disturbance TreesBoris Defourny, Damien Ernst, Louis Wehenkel. 1-14 [doi]
- Exploiting Additive Structure in Factored MDPs for Reinforcement LearningThomas Degris, Olivier Sigaud, Pierre-Henri Wuillemin. 15-26 [doi]
- Algorithms and Bounds for Rollout Sampling Approximate Policy IterationChristos Dimitrakakis, Michail G. Lagoudakis. 27-40 [doi]
- Efficient Reinforcement Learning in Parameterized Models: Discrete Parameter CaseKirill Dyagilev, Shie Mannor, Nahum Shimkin. 41-54 [doi]
- Regularized Fitted Q-Iteration: Application to PlanningAmir Massoud Farahmand, Mohammad Ghavamzadeh, Csaba Szepesvári, Shie Mannor. 55-68 [doi]
- A Near Optimal Policy for Channel Allocation in Cognitive RadioSarah Filippi, Olivier Cappé, Fabrice Clérot, Eric Moulines. 69-81 [doi]
- Evaluation of Batch-Mode Reinforcement Learning Methods for Solving DEC-MDPs with Changing Action SetsThomas Gabel, Martin Riedmiller. 82-95 [doi]
- Bayesian Reward FilteringMatthieu Geist, Olivier Pietquin, Gabriel Fricout. 96-109 [doi]
- Basis Expansion in Natural Actor Critic MethodsSertan Girgin, Philippe Preux. 110-123 [doi]
- Reinforcement Learning with the Use of Costly FeaturesRobby Goetschalckx, Scott Sanner, Kurt Driessens. 124-135 [doi]
- Variable Metric Reinforcement Learning Methods Applied to the Noisy Mountain Car ProblemVerena Heidrich-Meisner, Christian Igel. 136-150 [doi]
- Optimistic Planning of Deterministic SystemsJean-François Hren, Rémi Munos. 151-164 [doi]
- Policy Iteration for Learning an Exercise Policy for American OptionsYuxi Li, Dale Schuurmans. 165-178 [doi]
- Tile Coding Based on Hyperplane TilesDaniele Loiacono, Pier Luca Lanzi. 179-190 [doi]
- Use of Reinforcement Learning in Two Real ApplicationsJosé David Martín-Guerrero, Emilio Soria-Olivas, Marcelino Martínez-Sober, Antonio J. Serrano-López, Rafael Magdalena-Benedito, Juan Gómez-Sanchís. 191-204 [doi]
- Applications of Reinforcement Learning to Structured PredictionFrancis Maes, Ludovic Denoyer, Patrick Gallinari. 205-219 [doi]
- Policy Learning - A Unified Perspective with Applications in RoboticsJan Peters, Jens Kober, Duy Nguyen-Tuong. 220-228 [doi]
- Probabilistic Inference for Fast Learning in ControlCarl Edward Rasmussen, Marc Peter Deisenroth. 229-242 [doi]
- United We Stand: Population Based Methods for Solving Unknown POMDPsNoel Welsh, Jeremy Wyatt. 243-252 [doi]
- New Error Bounds for Approximations from Projected Linear EquationsHuizhen Yu, Dimitri P. Bertsekas. 253-267 [doi]
- Markov Decision Processes with Arbitrary Reward ProcessesJia Yuan Yu, Shie Mannor, Nahum Shimkin. 268-281 [doi]