Bandit Online Learning in Merely Coherent Games with Multi-Point Pseudo-Gradient Estimate

Yuanhanqing Huang, Jianghai Hu. Bandit Online Learning in Merely Coherent Games with Multi-Point Pseudo-Gradient Estimate. In 62nd IEEE Conference on Decision and Control, CDC 2023, Singapore, December 13-15, 2023. pages 1233-1238, IEEE, 2023. [doi]

Abstract

Abstract is missing.