Mirko Stappert, Bernhard Lutz, Janis Brammer, Dirk Neumann 0001. Solving the paint shop problem with flexible management of multi-lane buffers using reinforcement learning and action masking. European Journal of Operational Research, 332(1):52-65, 2026. [doi]
Abstract is missing.