[{"data":1,"prerenderedAt":-1},["ShallowReactive",2],{"article-kimi-k26-open-source-coding-agents-en":3,"article-related-kimi-k26-open-source-coding-agents-en":33,"series-industry-0c07ccc4-690c-475d-a36a-aad13c2756d4":78},{"id":4,"slug":5,"title":6,"content":7,"summary":8,"source":9,"source_url":10,"author":11,"image_url":12,"cover_image":12,"category":13,"language":14,"translated_content":11,"related_article_id":15,"keywords":16,"key_takeaways":25,"views":29,"created_at":30,"published_at":31,"topic_cluster_id":32},"0c07ccc4-690c-475d-a36a-aad13c2756d4","kimi-k26-open-source-coding-agents-en","Kimi K2.6 turns open-source coding into agents","\u003Cp data-speakable=\"summary\">Kimi K2.6 improves open-source coding with longer runs, faster tool use, and larger \u003Ca href=\"\u002Ftag\u002Fagent\">agent\u003C\u002Fa> swarms.\u003C\u002Fp>\u003Cp>Kimi’s K2.6 release is built for developers who want more than code completion: it ships with long-horizon execution, agent swarms, and \u003Ca href=\"\u002Ftag\u002Fbenchmark\">benchmark\u003C\u002Fa> gains that show up in real workflows. One internal run used 4,000+ tool calls over 12 hours and pushed throughput from about 15 to 193 tokens\u002Fsec.\u003C\u002Fp>\u003Ctable>\u003Cthead>\u003Ctr>\u003Cth>Item\u003C\u002Fth>\u003Cth>Scale\u003C\u002Fth>\u003Cth>Reported gain\u003C\u002Fth>\u003C\u002Ftr>\u003C\u002Fthead>\u003Ctbody>\u003Ctr>\u003Ctd>Kimi K2.6 long-horizon coding\u003C\u002Ftd>\u003Ctd>4,000+ tool calls, 12+ hours\u003C\u002Ftd>\u003Ctd>~15 to ~193 tokens\u002Fsec\u003C\u002Ftd>\u003C\u002Ftr>\u003Ctr>\u003Ctd>exchange-core optimization\u003C\u002Ftd>\u003Ctd>1,000+ tool calls, 13 hours\u003C\u002Ftd>\u003Ctd>0.43 to 1.24 MT\u002Fs medium throughput\u003C\u002Ftd>\u003C\u002Ftr>\u003Ctr>\u003Ctd>Agent Swarm\u003C\u002Ftd>\u003Ctd>300 sub-agents, 4,000 steps\u003C\u002Ftd>\u003Ctd>Up from 100 sub-agents and 1,500 steps\u003C\u002Ftd>\u003C\u002Ftr>\u003Ctr>\u003Ctd>CodeBuddy eval\u003C\u002Ftd>\u003Ctd>Internal benchmark\u003C\u002Ftd>\u003Ctd>+12% code generation accuracy, 96.60% tool success\u003C\u002Ftd>\u003C\u002Ftr>\u003C\u002Ftbody>\u003C\u002Ftable>\u003Ch2>1. Long-horizon coding that keeps going\u003C\u002Fh2>\u003Cp>K2.6 is aimed at tasks that last hours, not minutes. The blog says it handles front-end work, devops, performance tuning, and language shifts across \u003Ca href=\"\u002Ftag\u002Frust\">Rust\u003C\u002Fa>, Go, and Python with better generalization than K2.5.\u003C\u002Fp>\n\u003Cfigure class=\"my-6\">\u003Cimg src=\"https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1781667191813-f80q.png\" alt=\"Kimi K2.6 turns open-source coding into agents\" class=\"rounded-xl w-full\" loading=\"lazy\" \u002F>\u003C\u002Ffigure>\n\u003Cp>That matters when the model has to recover from dead ends, revisit earlier decisions, and keep the project moving without a human stepping in after every failure. In beta tests, K2.6 was described as better at navigating nuanced API behavior and staying productive longer before hitting a wall.\u003C\u002Fp>\u003Cul>\u003Cli>4,000+ tool calls in one run\u003C\u002Fli>\u003Cli>12+ hours of continuous execution\u003C\u002Fli>\u003Cli>14 iterations in a local Mac deployment test\u003C\u002Fli>\u003C\u002Ful>\u003Ch2>2. Faster tool use in real codebases\u003C\u002Fh2>\u003Cp>One of the clearest examples in the post is the local deployment of Qwen3.5-0.8B on a Mac. K2.6 implemented and optimized \u003Ca href=\"\u002Ftag\u002Finference\">inference\u003C\u002Fa> in Zig, then raised throughput from about 15 tokens\u002Fsec to 193 tokens\u002Fsec.\u003C\u002Fp>\u003Cp>The same theme appears in the exchange-core case, where K2.6 analyzed flame graphs, changed thread topology, and edited more than 4,000 lines of code. The result was a 185% lift in medium throughput and a 133% gain in performance throughput.\u003C\u002Fp>\u003Cul>\u003Cli>Qwen3.5-0.8B deployed locally on Mac\u003C\u002Fli>\u003Cli>Zig used for inference optimization\u003C\u002Fli>\u003Cli>exchange-core thread topology changed from 4ME+2RE to 2ME+1RE\u003C\u002Fli>\u003C\u002Ful>\u003Ch2>3. Agent swarms that split work across many specialists\u003C\u002Fh2>\u003Cp>Kimi describes Agent Swarm as scaling out instead of only scaling up. K2.6 can decompose a task into specialized sub-agents that run in parallel, then combine search, research, writing, and content generation into one run.\u003C\u002Fp>\n\u003Cfigure class=\"my-6\">\u003Cimg src=\"https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1781667171664-smqt.png\" alt=\"Kimi K2.6 turns open-source coding into agents\" class=\"rounded-xl w-full\" loading=\"lazy\" \u002F>\u003C\u002Ffigure>\n\u003Cp>The reported scale is large: up to 300 sub-agents and 4,000 coordinated steps. K2.5’s research preview reached 100 sub-agents and 1,500 steps, so the new version is not just a small tuning pass. It is meant for multi-output jobs like documents, websites, slides, and spreadsheets.\u003C\u002Fp>\u003Cul>\u003Cli>300 sub-agents executing in parallel\u003C\u002Fli>\u003Cli>4,000 coordinated steps\u003C\u002Fli>\u003Cli>Outputs can include docs, slides, spreadsheets, and websites\u003C\u002Fli>\u003C\u002Ful>\u003Ch2>4. Coding-driven design for front ends and light full-stack work\u003C\u002Fh2>\u003Cp>K2.6 is not limited to backend or terminal tasks. The blog shows it turning prompts into structured interfaces with hero sections, animation, and interactive elements, while also handling simple full-stack flows such as authentication, user interaction, and database operations.\u003C\u002Fp>\u003Cp>That makes it useful for teams that need a fast first pass on product pages or internal tools. The internal Kimi Design Bench covers visual input tasks, landing pages, full-stack apps, and creative programming, and K2.6 is reported to perform well across those categories.\u003C\u002Fp>\u003Cul>\u003Cli>Landing page construction\u003C\u002Fli>\u003Cli>Full-stack application development\u003C\u002Fli>\u003Cli>Image and video generation tool use for richer assets\u003C\u002Fli>\u003C\u002Ful>\u003Ch2>5. Benchmarks and partner feedback point to stronger reliability\u003C\u002Fh2>\u003Cp>The post includes several external and internal signals that K2.6 is more dependable than K2.5. CodeBuddy reports a 12% rise in code generation accuracy, an 18% gain in long-context stability, and a 96.60% tool invocation success rate.\u003C\u002Fp>\u003Cp>Partner quotes also emphasize better instruction following, more careful task decomposition, and stronger performance on long multi-step sessions. For teams choosing an open model for \u003Ca href=\"\u002Ftag\u002Fagentic-coding\">agentic coding\u003C\u002Fa>, the pattern is clear: K2.6 is positioned as a safer pick when the job is complex, long, and expensive to retry.\u003C\u002Fp>\u003Cul>\u003Cli>Code generation accuracy: +12%\u003C\u002Fli>\u003Cli>Long-context stability: +18%\u003C\u002Fli>\u003Cli>Tool invocation success rate: 96.60%\u003C\u002Fli>\u003C\u002Ful>\u003Ch2>How to decide\u003C\u002Fh2>\u003Cp>Pick K2.6 if your work involves long-running coding tasks, multi-step tool use, or agent workflows that need to keep state across many iterations. It is also the better fit if \u003Ca href=\"\u002Fnews\u002Fmlops-is-not-optional-for-production-ml-en\">you want\u003C\u002Fa> open-source code generation that can stretch into design, docs, and light full-stack delivery.\u003C\u002Fp>\u003Cp>If your needs are narrower, a smaller model may be enough. But if you care about sustained execution, larger swarms, and fewer interruptions during complex engineering work, K2.6 is the one in this release that most clearly targets that job.\u003C\u002Fp>","5 ways Kimi K2.6 improves open-source coding, from 4,000+ tool calls to 300-agent swarms and faster long-horizon execution.","www.kimi.com","https:\u002F\u002Fwww.kimi.com\u002Fblog\u002Fkimi-k2-6",null,"https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1781667191813-f80q.png","industry","en","93960de8-e70d-4541-8b1a-b7a3860ca5ac",[17,18,19,20,21,22,23,24],"Kimi K2.6","open-source coding","agent swarms","long-horizon execution","tool calling","benchmark","CodeBuddy","Kimi Code",[26,27,28],"K2.6 is built for long coding sessions with 4,000+ tool calls and hours of continuous execution.","Its Agent Swarm can scale to 300 sub-agents and 4,000 coordinated steps.","Internal and partner benchmarks point to better accuracy, stability, and tool success than K2.5.",0,"2026-06-17T03:32:23.749693+00:00","2026-06-17T03:32:23.741+00:00","d19fc184-5852-4c4d-9ec0-db0c4841ac17",{"tags":34,"relatedLang":37,"relatedPosts":41},[35],{"name":21,"slug":36},"tool-calling",{"id":15,"slug":38,"title":39,"language":40},"kimi-k26-open-source-coding-agents-zh","Kimi K2.6 把開源寫碼推向代理工作流","zh",[42,48,54,60,66,72],{"id":43,"slug":44,"title":45,"cover_image":46,"image_url":46,"created_at":47,"category":13},"bb949245-e6b8-4c22-8c62-967ef7de7914","qualcomm-bets-on-ai-devices-over-apps-en","Qualcomm is right to bet on AI devices, not just AI apps","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1781675270238-6cwm.png","2026-06-17T05:47:28.170103+00:00",{"id":49,"slug":50,"title":51,"cover_image":52,"image_url":52,"created_at":53,"category":13},"f083416d-8e9c-4774-90aa-df99f13fdaf2","china-open-source-ai-pressure-us-labs-en","China’s Open-Source AI Play Is Pressuring U.S. Labs","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1781668083197-jbio.png","2026-06-17T03:47:35.042626+00:00",{"id":55,"slug":56,"title":57,"cover_image":58,"image_url":58,"created_at":59,"category":13},"0f44e556-64c9-4bd5-880a-78d025607de2","free-open-source-software-powers-computing-en","Free and open-source software powers modern computing","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1781665372839-xxr1.png","2026-06-17T03:02:21.462795+00:00",{"id":61,"slug":62,"title":63,"cover_image":64,"image_url":64,"created_at":65,"category":13},"103ce84a-f91b-473c-8bb9-5f1e83f6e681","openalternative-software-replacement-comparison-en","OpenAlternative makes software replacement easier to compare","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1781662678272-242k.png","2026-06-17T02:17:27.620296+00:00",{"id":67,"slug":68,"title":69,"cover_image":70,"image_url":70,"created_at":71,"category":13},"f6eaeaff-a18f-4cfc-ae8f-01ce6daf66e6","james-ii-project-adds-tuesday-meal-site-en","James II Project adds a Tuesday meal site","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1781659988994-1z63.png","2026-06-17T01:32:42.419936+00:00",{"id":73,"slug":74,"title":75,"cover_image":76,"image_url":76,"created_at":77,"category":13},"d204283d-bd6c-4113-b603-a604fe071377","databricks-model-serving-adapts-not-tuned-by-hand-en","Databricks is right: model serving should adapt, not be tuned by hand","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1781659072726-ew6p.png","2026-06-17T01:17:22.175898+00:00",[79,84,89,94,99,104,109,114,119,124],{"id":80,"slug":81,"title":82,"created_at":83},"d35a1bd9-e709-412e-a2df-392df1dc572a","ai-impact-2026-developments-market-en","AI's Impact in 2026: Key Developments and Market Shifts","2026-03-25T16:20:33.205823+00:00",{"id":85,"slug":86,"title":87,"created_at":88},"5ed27921-5fd6-492e-8c59-78393bf37710","trumps-ai-legislative-framework-en","Trump's AI Legislative Framework: What's Inside?","2026-03-25T16:22:20.005325+00:00",{"id":90,"slug":91,"title":92,"created_at":93},"e454a642-f03c-4794-b185-5f651aebbaca","nvidia-gtc-2026-key-highlights-innovations-en","NVIDIA GTC 2026: Key Highlights and Innovations","2026-03-25T16:22:47.882615+00:00",{"id":95,"slug":96,"title":97,"created_at":98},"0ebb5b16-774a-4922-945d-5f2ce1df5a6d","claude-usage-diversifies-learning-curves-en","Claude Usage Diversifies, Learning Curves Emerge","2026-03-25T16:25:50.770376+00:00",{"id":100,"slug":101,"title":102,"created_at":103},"69934e86-2fc5-4280-8223-7b917a48ace8","openclaw-ai-commoditization-concerns-en","OpenClaw's Rise Raises Concerns of AI Model Commoditization","2026-03-25T16:26:30.582047+00:00",{"id":105,"slug":106,"title":107,"created_at":108},"b4b2575b-2ac8-46b2-b90e-ab1d7c060797","google-gemini-ai-rollout-2026-en","Google's Gemini AI Rollout Extended to 2026","2026-03-25T16:28:14.808842+00:00",{"id":110,"slug":111,"title":112,"created_at":113},"6e18bc65-42ae-4ad0-b564-67d7f66b979e","meta-llama4-fabricated-results-scandal-en","Meta's Llama 4 Scandal: Fabricated AI Test Results Unveiled","2026-03-25T16:29:15.482836+00:00",{"id":115,"slug":116,"title":117,"created_at":118},"bf888e9d-08be-4f47-996c-7b24b5ab3500","accenture-mistral-ai-deployment-en","Accenture and Mistral AI Team Up for AI Deployment","2026-03-25T16:31:01.894655+00:00",{"id":120,"slug":121,"title":122,"created_at":123},"5382b536-fad2-49c6-ac85-9eb2bae49f35","mistral-ai-high-stakes-2026-en","Mistral AI: Facing High Stakes in 2026","2026-03-25T16:31:39.941974+00:00",{"id":125,"slug":126,"title":127,"created_at":128},"9da3d2d6-b669-4971-ba1d-17fdb3548ed5","cursors-meteoric-rise-pressures-en","Cursor's Meteoric Rise Faces Industry Pressures","2026-03-25T16:32:21.899217+00:00"]