Speedup Training Artificial Intelligence for Mahjong via Reward Variance Reduction

Jinqiu Li, Shuang Wu, Haobo Fu, Qiang Fu, Enmin Zhao, Junliang Xing. Speedup Training Artificial Intelligence for Mahjong via Reward Variance Reduction. In IEEE Conference on Games, CoG 2022, Beijing, China, August 21-24, 2022. pages 345-352, IEEE, 2022. [doi]

References

No references recorded for this publication.

Cited by

No citations of this publication recorded.