Qingyang Wu

Staff Research Scientist · Together AI

Twitter / X

Session (1)

Day 19:00am-11:00amTrack 8

Open-Source Inference Engineering for the Agentic Era

Qingyang Wu is a Research Scientist at Together AI (and a Columbia University PhD) who has been a contributor to high-impact open-source RL post-training projects including DeepSWE (a state-of-the-art open-weight coding agent, which he co-led) and DeepCoder, as well as research on accelerating RL training via distribution-aware speculative decoding (an oral presentation at MLSys 2026). Attendees of his session can expect practical, systems-level insights on making reinforcement learning for LLMs faster and more scalable.

GitHub

@qywu

Recent writing (5)

DeepSWE: Training a Fully Open-sourced, State-of-the-Art Coding Agent by Scaling RL · blog · Jul 2025
DeepCoder: A Fully Open-Source 14B Coder at o3-mini Level · blog · Apr 2025
Beat the long tail: Distribution-Aware Speculative Decoding for RL Training · paper · Nov 2025
Think Deep, Think Fast: Investigating Efficiency of Verifier-free Inference-time-scaling Methods · paper · Apr 2025
V1: Unifying Generation and Self-Verification for Parallel Reasoners · paper · Mar 2026

Public activity researched automatically · as of Jun 2026