The unreasonable effectiveness of BM25 for agentic search

SessionEngineering trackconfirmed

The unreasonable effectiveness of BM25 for agentic search

Day: Day 2 — Session Day 1
Time: 11:10am-11:30am
Room: Track 3
Track: Search & Retrieval

Accessible with the Engineering pass and above.

About this session

GPT-5 is shockingly good at search, and that changes the "BM25 as a baseline" story. Using GPT-5 search trajectories from BrowseComp-Plus, I'll show how default BM25 parameters and evaluation harnesses can make lexical retrieval look weak, while real agent queries often play directly to BM25's strengths. Much like grep became a core retrieval primitive for coding agents, BM25 is re-emerging as a powerful primitive for agentic search.

Topics

Search & Retrieval (RAG, Deep Research, Web search)

Speaker

Jo Kristian Bergum

CEO & co-founder · Hornet.dev