Search for a command to run...
Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models