1. **A Scientific Human\-Agent Reproduction Pipeline**, Joschka Birk, Gregor Kasieczka, Siddharth Mishra\-Sharma, Benjamin Nachman, Dennis Noll, and Tanvi Wamorkar (2026) [arXiv:2604.18752](https://arxiv.org/abs/2604.18752)

2. **PRL\-Bench: A Comprehensive Benchmark Evaluating LLMs' Capabilities in Frontier Physics Research**, Tingjia Miao, and others (2026) [arXiv:2604.15411](https://arxiv.org/abs/2604.15411)
