Learning Heuristics for the TSP by Policy Gradient

Michel Deudon, Pierre Cournut, Alexandre Lacoste, Yossiri Adulyasak, Louis-Martin Rousseau. Learning Heuristics for the TSP by Policy Gradient. In Willem Jan van Hoeve, editor, Integration of Constraint Programming, Artificial Intelligence, and Operations Research - 15th International Conference, CPAIOR 2018, Delft, The Netherlands, June 26-29, 2018, Proceedings. Volume 10848 of Lecture Notes in Computer Science, pages 170-181, Springer, 2018. [doi]

Authors

Michel Deudon

This author has not been identified. Look up 'Michel Deudon' in Google

Pierre Cournut

This author has not been identified. Look up 'Pierre Cournut' in Google

Alexandre Lacoste

This author has not been identified. Look up 'Alexandre Lacoste' in Google

Yossiri Adulyasak

This author has not been identified. Look up 'Yossiri Adulyasak' in Google

Louis-Martin Rousseau

This author has not been identified. Look up 'Louis-Martin Rousseau' in Google