RL-based Control of a Lunar Lander

Lunar Landing Algorithm Comparison

We compare the performance of four different algorithms for controlling lunar lander simulations:

  • DQN
  • Q-Learning
  • SARSA
  • Monte Carlo
  • Random

For each algorithm, we provide both a simulation GIF and a trajectory plot illustrating the landing path.


DQN

DQN: Simulation (left) and trajectory plot (right).

Q-Learning

Q-Learning: Simulation (left) and trajectory plot (right).

SARSA

SARSA: Simulation (left) and trajectory plot (right).

Monte Carlo

Monte Carlo: Simulation (left) and trajectory plot (right).

Random

Random: Simulation (left) and trajectory plot (right).