Customised pearlmutter propagation: A hardware architecture for trust region policy optimisation

Shengjia Shao, Wayne Luk. Customised pearlmutter propagation: A hardware architecture for trust region policy optimisation. In Marco D. Santambrogio, Diana Göhringer, Dirk Stroobandt, Nele Mentens, Jari Nurmi, editors, 27th International Conference on Field Programmable Logic and Applications, FPL 2017, Ghent, Belgium, September 4-8, 2017. pages 1-6, IEEE, 2017. [doi]

Abstract

Abstract is missing.