Evaluation resuts vary even if no changes made to the code

rsakthivel · January 19, 2022, 4:41am

Sometimes even if the submission is made without any changes to the code, the metrics vary drastically after the evaluation. I am not able to figure out why this is happening. Could someone help me understand why this might be happening?

arav_agarwal · January 19, 2022, 1:49pm

There’s inherent randomness in the training process; RL exacerbates this randomness.

siddha_ganju · January 19, 2022, 4:21pm

we just pushed updates to the eval code and starter kit including how the metrics are calculated.

rsakthivel · January 20, 2022, 4:48am

Yes, thats right, it might be the case for training… but the submission that I make uses a trained agent and only runs the evaluation. Will that be the case even if we use a trained agent with frozen weights ?