Support for monte carlo algorithms

naakhash_cs17b020 · May 25, 2021, 3:51pm

Hi,

The current implementation of the Runner class does not allow the agent to know if an episode has completed. So we are not able to implement an algorithm that requires this information. Please modify the Runner class to pass done also as an argument to the learn function.

nimish_santosh · May 26, 2021, 8:40am

Hey,

We’re making the necessary changes to the code, it will be updated soon.