Check out our 🔬ScienceAgentBench, a new benchmark to rigorously assess language agents for data-driven scientific discovery.