2026 Small LLMs: Pruning vs. Training from Scratch Yufeng Xu, Taiming Lu, Kunjun Li, and 3 more authors 2026 Code Multi-Token Residual Prediction Yufeng Xu, Zishuo Bao, Qian Wang, and 6 more authors 2026 Code Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning Dylan Zhang, Yufeng Xu, Haojin Wang, and 2 more authors In ICML, 2026 Paper