My Agent is taking upto 50000 episodes to converge for the Catch Environment but by default Runner trains the agent only for 10000 episodes for the Catch Environment. Can the runner be enabled to run for more episodes?
BSuite gives the limit of 10000 as a fair amount episodes needed to converge.
Generally speaking, to have a level playing field among all competitors, while also not making it too easy, some constraint has to he applied. In this case, the number of episodes serves that role.
I know the number 10000 may be arbitrary, so if enough students feel it should be increased, we’ll do it. For now please try to improve your algorithm to get the highest score with 10000 episodes.