Rules for Agent Training

landon_chambers · June 13, 2019, 4:02am

I am unclear on the rules for training. Can I use all of the provided environments and provided demonstration data for training? Also, I believe the rules state that we are not allowed to specify an agent’s action policy by-hand, but how about an ‘environment training policy’ - something like train x number of times in Navigate, y number of times in Treechop, etc… ?

BrandonHoughton · June 14, 2019, 1:00am

You can use any environment you would like for training! For testing your agent will only be evaluate on MineRLObtainDiamond-v0, and you are welcome to chose the number of training examples by hand!

landon_chambers · June 14, 2019, 2:50pm

@BrandonHoughton Understood. Thank you!