RL-based Control of a Lunar Lander
Lunar Landing Algorithm Comparison
We compare the performance of four different algorithms for controlling lunar lander simulations:
- DQN
- Q-Learning
- SARSA
- Monte Carlo
- Random
For each algorithm, we provide both a simulation GIF and a trajectory plot illustrating the landing path.
DQN


DQN: Simulation (left) and trajectory plot (right).
Q-Learning


Q-Learning: Simulation (left) and trajectory plot (right).
SARSA


SARSA: Simulation (left) and trajectory plot (right).
Monte Carlo


Monte Carlo: Simulation (left) and trajectory plot (right).
Random


Random: Simulation (left) and trajectory plot (right).