Thanks @CH_do for pointing out a bug in the evaluation loop which prevented done being passed to the agent after the env was reset.
This is fixed now. Please merge the changes from the latest starter kit.
Also, note that this means the data passed to an agent when done=True will be new observation from env.reset(). The evaluator on the server will follow the same pattern as local_evaluation.