The following values will be set during the evaluations. Any changes that you make to these parameters will be dropped and replaced with the default values during the evaluations.
stop: timesteps_total: 8000000 time_total_s: 7200 checkpoint_freq: 25 checkpoint_at_end: True env_config: env_name: <accordingly> num_levels: 0 start_level: 0 paint_vel_info: False use_generated_assets: False distribution_mode: easy center_agent: True use_sequential_levels: False use_backgrounds: True restrict_themes: False use_monochrome_assets: False # We use this to generate the videos during training evaluation_interval: 25 evaluation_num_workers: 1 evaluation_num_episodes: 3 evaluation_config: num_envs_per_worker: 1 env_config: render_mode: rgb_array
During the rollouts, we will also pass a
rand_seed to the procgen env.