Scientific discovery is not a single chain of thought @GaoShanghua @AdaFang_ . It is a long-running process of competing hypotheses, failed experiments, shared insights, and changing research directions AutoScientists lets AI agents do the same https://t.co/JHAkbEx2Ac đź§Ş We call this AutoScientists: self-organizing agent teams for long-running scientific experimentation Open science: Paper: https://t.co/mzEx5xwtSE Code: https://t.co/1OLxN4AW94 @HarvardDBMI @harvardmed @broadinstitute @KempnerInst
Harvard Researchers Launch AutoScientists Using Self-Organizing Agent Teams
- BioML-Bench performance
- 74.4% mean leaderboard percentile
- GPT optimization speedup
- 1.9x faster than autoresearch
- ProteinGym improvement
- +12.5% Spearman correlation
- Biomedical tasks evaluated
- 24 tasks
- Availability
- Open-source GitHub repository
This decentralized approach addresses the local optima problem where single-agent loops—exemplified by Andrej Karpathy's autoresearch launch—often plateau. By allowing agents to critique each other and share failed results, the system mimics human scientific discovery, outperforming prior agents by 8.3% on biomedical benchmarks and optimizing GPT training 1.9x faster.
You can use the framework to automate research in drug discovery, protein engineering, and ML optimization. The project is open science, with the research paper and code available for deployment. It provides a blueprint for building resilient multi-agent systems that sustain autonomous experimentation over extended periods without human intervention.
Still wondering? A few quick answers below.



