
Filip Makraduli is an applied AI researcher and founding ML Developer Relations engineer at Superlinked, where he designs and ships small‑LLM inference systems for search, retrieval, and agents in production. He holds a master’s degree in Biomedical Data Science from Imperial College London. Before Superlinked, Filip worked in machine learning, data science, and developer relations roles across early‑stage AI startups and larger enterprises, building language understanding, retrieval‑augmented generation (RAG), and LLM pipeline tooling while partnering closely with product and platform teams. He is a frequent open‑source contributor, with contributions to kernel libraries, model‑inference providers, and hands‑on demos used by practitioners. Filip is a co‑author of several publications on efficient transformer architectures and inference, including work on faster normalization for LLMs. He is an experienced speaker at meetups and conferences such as AI Engineer Europe and Berlin Buzzwords, sharing practical lessons on efficient transformers, retrieval systems, and embedding inference for production AI teams.
Filip is an ML/DevRel engineer at Superlinked focused on the unglamorous-but-critical layer of search and retrieval: small-model inference infrastructure, embedding-model performance, and mixture-of-encoders retrieval. Attend his session for concrete, production-grade engineering (GPU model hot-swapping, variable-length flash attention, multimodal encoders) rather than high-level AI hype.
Public activity researched automatically · as of Jun 2026