Browsing by Author "94349d45-2b1a-41c5-a4c1-87668e866944"
Now showing items 1-1 of 1
-
Ranked Reward: Enabling Self-Play Reinforcement Learning for Combinatorial Optimization
Laterre, A; Fu, Y; Jabri, MK; Cohen, A-S; Kas, D; Hajjar, K; Dahl, TS; Kerkeni, A; Beguir, KAdversarial self-play in two-player games has delivered impressive results when used with reinforcement learning algorithms that combine deep neural networks and tree search. Algorithms like AlphaZero and Expert Iteration ...