Vertical Mobility: Building an AI Inference Platform That Scales from MVP to Trillion-Parameter Workloads

SessionEngineering trackconfirmed

Vertical Mobility: Building an AI Inference Platform That Scales from MVP to Trillion-Parameter Workloads

Day: Day 4 — Session Day 3
Time: 12:05pm-12:25pm
Room: Track 9
Track: Inference

Accessible with the Engineering pass and above.

About this session

The future of AI inference is not one-size-fits-all. This talk explores a multi-tiered architecture that supports the full AI lifecycle, from rapid, pay-per-token experimentation to dedicated, SLO-bound production and extreme-scale, self-managed deployments. Learn about lessons learned from CoreWeave’s inference stack as performance, cost, and control requirements evolve.

Topics

LLM Production Infra

Speakers