Callbacks¶
- class rl_zoo3.callbacks.ParallelTrainCallback(gradient_steps=100, verbose=0, sleep_time=0.0)[source]¶
Callback to explore (collect experience) and train (do gradient steps) at the same time using two separate threads. Normally used with off-policy algorithms and train_freq=(1, “episode”).
TODO: - blocking mode: wait for the model to finish updating the policy before collecting new experience at the end of a rollout - force sync mode: stop training to update to the latest policy for collecting new experience
- Parameters:
gradient_steps (
int) – Number of gradient steps to do before sending the new policyverbose (
int) – Verbosity levelsleep_time (
float) – Limit the fps in the thread collecting experience.
- class rl_zoo3.callbacks.RawStatisticsCallback(verbose=0)[source]¶
Callback used for logging raw episode data (return and episode length).
- class rl_zoo3.callbacks.SaveVecNormalizeCallback(save_freq, save_path, name_prefix=None, verbose=0)[source]¶
Callback for saving a VecNormalize wrapper every
save_freqsteps- Parameters:
save_freq (
int) – (int)save_path (
str) – (str) Path to the folder whereVecNormalizewill be saved, asvecnormalize.pklname_prefix (
Optional[str]) – (str) Common prefix to the savedVecNormalize, if None (default) only one file will be kept.