Hi,
This is more a question. I am implementing some specific experiments using Recurrent PPO, but at some point I would like to set the number of BPTT steps, I mean in a truncated BPTT fashion (let's say I want a recurrence of 32 steps, for example). My questions are:
- In the current implementation, how many BPTT steps are performed?
- Is it possible to change this as an hyper-parameter?
I had a look in the code but haven't managed to figure out where this is performed.
Many thanks in advance!