
Columbia PhD (2024). Lead author on PRIME-RL (async & decentralized RL training at scale) and Verifiers (environments for LLM RL). Focus: multi-turn reasoning in LLM agents, credit assignment, distributed RL. Published at NeurIPS/ICLR. Ex-Morgan Stanley. willcb.com
Will Brown is Research Lead at Prime Intellect and the creator/maintainer of the open-source `verifiers` library and the Environments Hub, making him one of the most hands-on practitioners in agentic reinforcement learning. His session offers direct insight into the infrastructure decisions and research driving multi-turn RL training at scale.
RL Environments at Scale – Will Brown, Prime IntellectAI Engineer Code Summit · Nov 2025
How Prime Intellect Builds Scalable Infrastructure for Agentic RLRay Summit 2025 · Nov 2025
Training Agentic Reasoners — Will Brown, Prime IntellectCO/AI · Jul 2025
Open Questions in Agentic RL — Will Brown (Prime Intellect)Intelligence Unbound · May 2025Public activity researched automatically · as of Jun 2026