1. A Scientific Human-Agent Reproduction Pipeline, Joschka Birk, Gregor Kasieczka, Siddharth Mishra-Sharma, Benjamin Nachman, Dennis Noll, and Tanvi Wamorkar, (2026), arXiv:2604.18752
  2. PRL-Bench: A Comprehensive Benchmark Evaluating LLMs' Capabilities in Frontier Physics Research, Tingjia Miao, and others, (2026), arXiv:2604.15411