Bi-linear Value Networks for Multi-goal Reinforcement Learning

Zhang-Wei Hong, Ge Yang, Pulkit Agrawal. Bi-linear Value Networks for Multi-goal Reinforcement Learning. In The Tenth International Conference on Learning Representations, ICLR 2022, Virtual Event, April 25-29, 2022. OpenReview.net, 2022. [doi]

Authors

Zhang-Wei Hong

This author has not been identified. Look up 'Zhang-Wei Hong' in Google

Ge Yang

This author has not been identified. Look up 'Ge Yang' in Google

Pulkit Agrawal

This author has not been identified. Look up 'Pulkit Agrawal' in Google