Hardware implementation of the upper confidence-bound algorithm for reinforcement learning

Nevena Radovic, Milena Zogovic Erceg. Hardware implementation of the upper confidence-bound algorithm for reinforcement learning. Computers & Electrical Engineering, 96(Part):107537, 2021. [doi]

Authors

Nevena Radovic

This author has not been identified. Look up 'Nevena Radovic' in Google

Milena Zogovic Erceg

This author has not been identified. Look up 'Milena Zogovic Erceg' in Google