Daniela Pucci de Farias, Benjamin Van Roy. Approximate value iteration with randomized policies. In 39th IEEE Conference on Decision and Control, CDC 2000, Sydney, Australia, December 12-15, 2000. pages 3421-3426, IEEE, 2000. [doi]
Abstract is missing.