Is it true that soc(and soc_init) should always be no larger than capacity?
If this holds, then I guess I might found a bug in energy_model.py.
(else pls just ignore the following content)
In method get_max_input_power, the soc_normalized should be within [0, 1]. But there do exist such cases when self.soc_init gets a little bit larger than self.capacity, causing soc_normalized be like 1.00000001, which is NOT 1.0. And such slight difference could make the result of idx to be wholly different: soc_normalized==1.0 will lead to idx==1 while 1.00000001 will lead to idx==0, thereby totally affecting the result power and then essentially the episode afterwards.
[To reproduce the issue]
The above issue could be reproduced:
For example, using building_1, and follow the below action sequence in an episode:
action_series = [-0.6781939, 0.58110934, -0.73443955, 0.18472742, 0.18293902, -0.65686876, 0.5656311, -0.5761771, -0.46125087, 0.10670366, 0.07985443, 0.81957483, 0.9780224, 0.5936974]
You’ll get a wrong computation of energy at the last time step, cuz the current soc(6.399877760440494) is slightly larger than the current capacity(6.399874036444029).
Another option is, you could write a modified get_max_input_power function:
soc_normalized = np.clip(self.soc_init/self.capacity, 0, 1). Using a random agent and do control/treatment study with the unmodified function, you’ll easily get different observations in episodes.
[Suggestions to fix the bug]
- Clip the soc_normalizedinget_max_input_power;
 Or
- Since capacityis updated(degraded) every time step after updatingsocandenergy_balance, it is possible that it gets degraded to a value less thansoc, therefore I suggest always restricting thesocevery time we updatedcapacityin order to make suresoc <= capacityat all time.