Soft Actor Critic (Visualized) Part 2: Lunar Lander Example from Scratch in Torch
From-scratch SAC in PyTorch applied to Lunar Lander: extends the Inverted Pendulum implementation to a harder task with sparse rewards and a 2D continuous action space.
Read more



