Finite-Sample Analysis of Multi-Agent Policy Evaluation with Kernelized Gradient Temporal Difference

Paulo Heredia, Shaoshuai Mou. Finite-Sample Analysis of Multi-Agent Policy Evaluation with Kernelized Gradient Temporal Difference. In 59th IEEE Conference on Decision and Control, CDC 2020, Jeju Island, South Korea, December 14-18, 2020. pages 5647-5652, IEEE, 2020. [doi]

References

No references recorded for this publication.

Cited by

No citations of this publication recorded.