Ahmad Osman

AI researcher in SF · r/LocalLLaMA

Twitter / X

Bio

r/LocalLLaMA moderator and AI researcher in San Francisco; known for building a 14x RTX 3090 rig.

Sessions (2)

Day 111:05am-12:05pmTrack 6

Local LLMs and workstation agents: Part 1

Day 112:10pm-1:10pmTrack 6

Local LLMs and workstation agents: Part 2

Ahmad Osman is a hands-on AI infrastructure practitioner who has built a self-hosted 8x RTX 3090 inference server and moderates the GPUs community on r/LocalLLaMA, giving him rare ground-level credibility on running LLMs at scale outside the cloud. His session is worth attending for engineers who want practical, hardware-grounded insights on local AI inference rather than theoretical overviews.

GitHub

@TheAhmadOsman

Recent writing (3)

GPU Memory Math for LLMs (2026 Edition) · blog · May 2026
First Came The Tokenizer—Understanding The Unsung Hero of LLMs · blog · Jun 2025
So You Want to Learn LLMs? Here's the Roadmap · blog · Jun 2025

Public activity researched automatically · as of Jun 2026