PRIME-RL: Async & Decentralized RL Training at Scale

SessionEngineering tracktentative

PRIME-RL: Async & Decentralized RL Training at Scale

Day
Day 3 — Session Day 2
Time
1:55pm-2:15pm
Room
Track 9
Track
Posttraining & Midtraining

Accessible with the Engineering pass and above.

About this session

Will Brown (Researcher at Prime Intellect) covers post-training for LLM agents: multi-turn reasoning, credit assignment, distributed RL, PRIME-RL, and verifier-driven environments for LLM RL.

Speaker