Preferences > Benchmarks: Model Routing for How Teams Actually Build: Model Routing for How Teams Actually Build

SessionLeadership trackconfirmed

Preferences > Benchmarks: Model Routing for How Teams Actually Build: Model Routing for How Teams Actually Build

Day: Day 4 — Session Day 3
Time: 12:05pm-12:25pm
Room: Leadership 2
Track: AI Architects: AI Factories

Accessible with the Leadership (All-Access) pass and above.

About this session

There is no best model. There's only the right model for a given task, and the right model depends on your team's preferences, not a benchmark score. This talk makes the case for preference-aligned routing: choosing models by the constraints that actually matter — cost, latency, task type, model preference — instead of a single leaderboard number. We'll demo a sub-200ms routing decision running on a purpose-built 30B MoE model with no application code changes, walk through real coding workflows routing most traffic to open models without losing accuracy, and show where this goes next: evals, caching, and personalization.

Speakers

Archana Kamath

Tyler Gillam