[{"data":1,"prerenderedAt":-1},["ShallowReactive",2],{"news-596723f0-aad3-4bdc-ba42-76b0f7437b52":3},{"id":4,"title":5,"summary":6,"original_url":7,"source_id":8,"tags":9,"published_at":23,"created_at":24,"modified_at":25,"is_published":26,"publish_type":27,"image_url":13,"view_count":28},"596723f0-aad3-4bdc-ba42-76b0f7437b52","蚂蚁百灵Ling-2.6-flash：104B参数模型背后的'Token效率'革命","## 技术背景\n\n4月22日，蚂蚁集团旗下百灵大模型正式推出Ling-2.6-flash，这款总参数量104B、激活参数7.4B的Instruct模型，标志着国产大模型在规模化应用上的重要突破。与传统追求单纯参数规模不同，Ling-2.6-flash的核心创新点在于**\"Token效率\"**理念，这反映了大模型技术从\"有多大\"向\"有多精\"的战略转变。\n\n## 核心技术亮点\n\n**混合线性架构优化**：Ling-2.6-flash沿用Ling 2.5的混合线性架构，在保持竞争力的智能水平前提下，通过架构创新实现了Token效率的显著提升。这种架构设计能够在4卡H20硬件条件下实现高效推理，大幅降低了企业部署门槛。\n\n**SOTA性能表现**：该模型在BFCL-V4、TAU2-bench、SWE-bench Verified、Claw-Eval、PinchBench等多个Agent相关基准测试中均达到同尺寸SOTA水平，证明了其在实际应用场景中的强大能力。\n\n## 市场影响与行业意义\n\nLing-2.6-flash的定价策略极具竞争力——输入每百万tokens仅需0.1美元，输出0.3美元，这一价格点不仅体现了蚂蚁百灵的技术自信，更为企业级应用打开了商业化的大门。更值得关注的是，其匿名测试版Elephant Alpha上线一周内日均tokens调用量即达100B级别，连续多日位列OpenRouter Trending榜首，这表明市场对其技术实力的高度认可。","https:\u002F\u002F36kr.com\u002Fnewsflashes\u002F3777678069044231","5e4fd3d1-9cb4-44a6-bae5-9ffb449c05c1",[10,14,17,20],{"id":11,"name":12,"slug":12,"description":13,"color":13},"471c51be-e620-49df-bd6c-0b5504f53f00","ant-group",null,{"id":15,"name":16,"slug":16,"description":13,"color":13},"f72d264d-7fa0-458b-8d43-0ec5168d69db","instruct-model",{"id":18,"name":19,"slug":19,"description":13,"color":13},"01598627-1ea6-4b27-a5d8-874971571a71","llm",{"id":21,"name":22,"slug":22,"description":13,"color":13},"045c011e-e2bb-45ce-bdd6-0c927f8a3b87","token-efficiency","2026-04-22T10:03:00Z","2026-04-22T10:06:59.776503Z","2026-04-22T10:06:59.776530Z",true,"agent",3]