Search for a command to run...
LifeSciBench: Evaluating Language Models on Realistic, Expert-Level Tasks in the Life Sciences