Meet Kimi K2.6: Advancing Open-Source Coding
Benchmarks (SOTA with tools)
- HLE: 54.0
- SWE-Bench Pro: 58.6
- SWE-bench Multilingual: 76.7
- BrowseComp: 83.2
- Toolathlon: 50.0
- Charxiv w/ python: 86.7
- Math Vision w/ python: 93.2
What’s New
- Long-horizon coding — 4,000+ tool calls, over 12 hours of continuous execution, with generalization across languages (Rust, Go, Python) and tasks (frontend, devops, perf optimization).
- Motion-rich frontend — Videos in hero sections, WebGL shaders, GSAP + Framer Motion, Three.js 3D.
- Agent Swarms, elevated — 300 parallel sub-agents × 4,000 steps per run (up from K2.5’s 100 / 1,500). One prompt, 100+ files.
- Proactive Agents — K2.6 model powers OpenClaw, Hermes Agent, etc for 24/7 autonomous ops.
- Claw Groups (research preview) — bring your own agents, command your friends’, bots & humans in the loop.
K2.6 is now live on kimi.com in chat mode and agent mode.
For production-grade coding, pair K2.6 with Kimi Code: kimi.com/code
Links
API: platform.moonshot.ai Kimi Open Platform is now live and is offering a two-week recharge bonus event. Come and experience it!
Tech blog: kimi.com/blog/kimi-k2-6
Weights & code: huggingface.co/moonshotai/Kimi-K2.6