A single‑file demo showing a pendulum environment and a simple policy trained via CEM. Click Train and watch it learn to keep the pendulum upright.