Learning Heuristics for the TSP by Policy Gradient

Michel Deudon, Pierre Cournut, Alexandre Lacoste, Yossiri Adulyasak, Louis-Martin Rousseau. Learning Heuristics for the TSP by Policy Gradient. In Willem Jan van Hoeve, editor, Integration of Constraint Programming, Artificial Intelligence, and Operations Research - 15th International Conference, CPAIOR 2018, Delft, The Netherlands, June 26-29, 2018, Proceedings. Volume 10848 of Lecture Notes in Computer Science, pages 170-181, Springer, 2018. [doi]

Abstract

Abstract is missing.