I need some guidance from you regarding the observation part.
I’m trying to understand how the observation is constructed from the “ flatland.observation.get(handle) ”, but I have difficulties understanding it from the comments. I’m not sure if I can find more details/examples somewhere else.
Some update is needed to the comments, due to the changes from 5 features to 8 features.
Moreover, I think a little bit clarification in the terminology would be really helpful for the readers and the challenge participants
For Example #2: “ distance to a target of another agent is detected between the previous node and the current one .” à is the calculated value the distance to another agent’s target? What does exactly the previous node mean?
Is the Current node where the agent is located, or it is the node that we are getting the features for?