I tested the Obstacle tower environment with local machines.
I confirmed that the action space is consist of 4 numbers in list, like [0, 0, 0, 1]
I submitted a starter kit agent for a test, and it evaluated successfully.
Then, I tested my agent for submission which slightly modified from starter kit.
The modification was to force jump action 0 from env.action_space.sample()
Actual source code is below. It is part of run.py
in run_episode(env) function
.
while not done:
action = env.action_space.sample()
action[2] = 0
obs, reward, done, info = env.step(action)
From evaluation log, it stuck at step 0.
It is my first try to participate in this kind of challenges, therefore I am not familiar with the environment.
What is the problem with my code?