Level 01

Navigation / Escape (proof of concept)

Train the bot with one feedback action per AI turn and guide it from center to escape.

Turn cadence: 0.25s movement + 0.75s wait. Controls: D reward · A punish · R reset

Back to scaffold home

Status: running
Time: 0.0s
Turn: 0
Phase: wait
Score: 230
Best: 0
Last action: idle
Feedback token: spent

Training controls

The bot policy is intentionally hidden. You see behavior and outcomes, not internal action weights.

death
escape