The final scores will be computed by adding the scores of each of the 5 environments after normalising these between 0-1.
For the Acrobot and the Taxi environment, min-max normalisation will be used.
For the KBC problems, min-max normalisation after applying a log operator will be used.
The scores will be computed after the submission period has ended.
Hi. How can we select out best submission to be considered for the grading?
This is crucial given that the leaderboard only updates on the basis of acrobot problem.