RL can be really difficult and frustrating. Feedback on “Modular RL” library I’m building?
RL sounds like a lot of fun from the outside. “AI for training robots to learn from experience”, sounds good. But when you dive in, it can be really frustrating and overwhelming to learn. Rather than being a single clear algorithm, there are many named algorithms: Actor Critic, A2C, PPO, DDPG, TD3, SAC etc.. it turns out that every named algorithm is the result of a research paper. But generally, these are not distinctive algorithms. For instance, if […]