Search for a command to run...
πππ-Bench: Evaluating Proactive Personal Assistant Agents in Long-Horizon Workflows