Intelligent Model Routing: Frontier Performance Without Frontier Bills

SessionLeadership trackconfirmed

Intelligent Model Routing: Frontier Performance Without Frontier Bills

Day: Day 3 — Session Day 2
Time: 2:50pm-3:10pm
Room: Leadership 2
Track: Sandbox & Platform Engineering

Accessible with the Leadership (All-Access) pass and above.

About this session

It is Summer 2026 and the world is burning for token consumption—figuratively and literally. Accelerating frontier model capabilities increasingly allow agents to operate across long-running, highly parallelized tasks at exponential inference growth. In this talk, I explain how dynamic model routing—intelligently directing agent requests to the best-suited model at the best price—can reduce token costs by up to 90% while maintaining or improving accuracy. I walk through how routing works, when it doesn't, and why the world (and your agent) need routing to scale intelligence to infinity and beyond.

Topics

LLM Production InfraAI ArchitectsCoding AgentsClaws (OpenClaw, Personal Agents)

Speaker

Tomás Hernando Kofman

Notdiamond