reward +0.82 loss -0.27 policy stable
Move cursor near center to calibrate policy.
DRONE RL
Learning to Fly, Optimally.
work in progress
DRONE RL / DASHBOARD policy: sac_per_v3