Does the evaluation metric (as shown on the Stage 2 leaderboard) include safety infractions happened during the 1-hour training session, or does it include only those infractions happened during the final 3 episodes evaluation?
Does the evaluation metric (as shown on the Stage 2 leaderboard) include safety infractions happened during the 1-hour training session, or does it include only those infractions happened during the final 3 episodes evaluation?