Hardware implementation of the upper confidence-bound algorithm for reinforcement learning

Nevena Radovic, Milena Zogovic Erceg. Hardware implementation of the upper confidence-bound algorithm for reinforcement learning. Computers & Electrical Engineering, 96(Part):107537, 2021. [doi]

Abstract

Abstract is missing.