Demos 2020 are hard to understand, so we wonder if we can use demos 2019 to extract useful information ? (e.g., divide demos by actions and rewards, locate actions, …). Demos 2019 are only used to help agent training, final agents would be evaluated on vectorobf environments.
Best regards.