介绍 SmolLM 模型的架构与训练方法
参考资料
论文
SmolLM 2:SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Modelarrow-up-right
SmolLM 3:SmolLM3: smol, multilingual, long-context reasonerarrow-up-right
Last updated 46 minutes ago
Was this helpful?