Zekun Qi
First-listed equal-contribution author and likely primary technical architect of Humanoid-GPT.
“Humanoid-GPT trains on a 2-billion-frame corpus, representing a 200x scale-up... achieves a 92.58% tracking success rate across unseen motions.”
Source→“their largest model (Humanoid-GPT-L, 80.4M parameters, trained on 2B frames) achieves a 92.58% tracking success rate across unseen motions, compared to MLP baselines trained on 6–9M frames that hover around 76–81% (Table 2). This is a 16+ point gap in real deployment stability.”
Source→“Our tracker reproduces these motions in real time without any task-specific fine-tuning, demonstrating strong zero-shot transfer from simulation to the real world.”
Source→“Zekun Qi — Tsinghua University. Co-author, also appears on SoFar (language-grounded spatial reasoning for manipulation) and ShapeLLM (3D object understanding for embodied interaction).”
Source→AI-extracted from podcast / newsletter / paper summaries. May contain errors.