
r/LocalLLaMA moderator and AI researcher in San Francisco; known for building a 14x RTX 3090 rig.
Ahmad Osman is a hands-on AI infrastructure practitioner who has built a self-hosted 8x RTX 3090 inference server and moderates the GPUs community on r/LocalLLaMA, giving him rare ground-level credibility on running LLMs at scale outside the cloud. His session is worth attending for engineers who want practical, hardware-grounded insights on local AI inference rather than theoretical overviews.
Public activity researched automatically · as of Jun 2026