A Bandit Learning Method for Continuous Games Under Feedback Delays with Residual Pseudo-Gradient Estimate

Yuanhanqing Huang, Jianghai Hu. A Bandit Learning Method for Continuous Games Under Feedback Delays with Residual Pseudo-Gradient Estimate. In 62nd IEEE Conference on Decision and Control, CDC 2023, Singapore, December 13-15, 2023. pages 1207-1212, IEEE, 2023. [doi]

Abstract

Abstract is missing.