Suggestions about the competition startkit

I think the competition startkit is not very clear, the upper bound of the submission time is 20, we don’t have enough chances to debug it.

  1. Your team can write a baseline (DQN, PPO) using startkit.
  2. Tell more details about the code format we should follow.