@claudio1212 The RL example has been updated, and it should run without issues. The LSTM policy is indeed not available any more from SB3, but it is implemented in sb3-contrib. There is also an ...
I experienced when train_freq and save_frequency_hours do not align, that issues arise when writing rl params as outputs. Because the two rl_param outputs cannot be merged correctly, it results in ...