In order to train a model locally and visualizably, using the following code we get an dict observation_2 that contains 4 keys and 339 numbers. These numbers contains an 2×11×11 2D target velocities on an 11×11 grid and the 97D body state(242+97=339).
Here it says S is a 97D vector representing the body state.
But in the evaluation environment which is not visualizable, we can use the following code to get an dict observation containing 14 keys and 688 numbers,
And when I use the first code to get the observation_2, it reports the following error:
Attempt to call step function after max_steps=1000 in a single simulation. Please reset your environment before calling the step function after max_step s
and here is the question: how can i get the 97D body state in the evaluation environment?
Sorry about the confusion. We recently noticed this issue and am working on it so that the evaluation environment will give the same observation dictionary as the current local environment. We will let you know once this is solved.
Hello! Can you please specify, will it be possible to obtain 14-keys observation dict from evaluation client in future (without errors), or should we operate with 4-keys in both training and evaluation? Thanks
Thanks a lot. The docker image aicrowd/neurips-learning-to-move-subcontractor:latest has updated and the local grader runs successfully now. And the new submit option is more convenient,