I am pretty sure that there was a solution tagged as ‘RL’ that made it to the top with leaderboard score < -0.1. I no longer see it. Just curious, what might have happened to it? Not that I am complaining just want to understand how high the score in pure RL based approach can go.
Also curious, as per the guidelines
The top three teams in the final round which use a reinforcement learning approach for their winning submission will be awarded one travel grant each.
Does it mean a pure multi-agent reinforcement learning approach or a hybrid approach, like mix of OR and RL (need to give it some thought on how to do it) be acceptable too? That would help me in focusing my efforts. Whether I should focus more on code efficiency (like last time’s winner using C++) or methodology experimentation?
I would really appreciate any guidance. Looking forward to your response.