RL-based Control of a Lunar Lander
Lunar Landing Algorithm Comparison
We compare the performance of four different algorithms for controlling lunar lander simulations:
- DQN
- Q-Learning
- SARSA
- Monte Carlo
- Random
For each algorithm, we provide both a simulation GIF and a trajectory plot illustrating the landing path.
DQN
DQN: Simulation (left) and trajectory plot (right).
Q-Learning
Q-Learning: Simulation (left) and trajectory plot (right).
SARSA
SARSA: Simulation (left) and trajectory plot (right).
Monte Carlo
Monte Carlo: Simulation (left) and trajectory plot (right).
Random
Random: Simulation (left) and trajectory plot (right).