A Bandit Learning Method for Continuous Games Under Feedback Delays with Residual Pseudo-Gradient Estimate - researchr publication

researchr

You are not signed in
Sign in
Sign up

Yuanhanqing Huang, Jianghai Hu. A Bandit Learning Method for Continuous Games Under Feedback Delays with Residual Pseudo-Gradient Estimate. In 62nd IEEE Conference on Decision and Control, CDC 2023, Singapore, December 13-15, 2023. pages 1207-1212, IEEE, 2023. [doi]

Abstract is missing.

runs on WebDSL