stable-baselines3-contrib
SB3-Contrib provides experimental reinforcement learning algorithms and utilities that extend stable-baselines3, featuring advanced implementations such as Augmented Random Search, Quantile Regression DQN, and PPO with recurrent policy. Aimed at researchers, it includes niche tools that support a broad range of RL research and applications, facilitating the integration and exploration of innovative enhancements in reinforcement learning.