Search for a command to run...
ArenaRL: Scaling RL for Open-Ended Agents via Tournament-based Relative Ranking