Search for a command to run...
SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks