[{"data":1,"prerenderedAt":-1},["ShallowReactive",2],{"article-kimi-long-context-models-moonshot-ai-en":3,"article-related-kimi-long-context-models-moonshot-ai-en":30,"series-model-release-6288131d-64e3-47ff-aeec-add641c952e2":79},{"id":4,"slug":5,"title":6,"content":7,"summary":8,"source":9,"source_url":10,"author":11,"image_url":12,"cover_image":12,"category":13,"language":14,"translated_content":11,"related_article_id":15,"keywords":16,"key_takeaways":22,"views":26,"created_at":27,"published_at":28,"topic_cluster_id":29},"6288131d-64e3-47ff-aeec-add641c952e2","kimi-long-context-models-moonshot-ai-en","Kimi’s long-context push keeps getting bigger","\u003Cp data-speakable=\"summary\">\u003Ca href=\"\u002Ftag\u002Fmoonshot-ai\">Moonshot AI\u003C\u002Fa>’s Kimi chatbot has grown from a long-context assistant into a family of large agentic models.\u003C\u002Fp>\u003Cp>\u003Ca href=\"https:\u002F\u002Fwww.kimi.com\" target=\"_blank\" rel=\"noopener\">Kimi\u003C\u002Fa> started in October 2023 as Moonshot AI’s answer to the long-context problem, and it got attention fast because the first public version could handle 128,000 tokens. By January 2026, the line had expanded into \u003Ca href=\"https:\u002F\u002Fwww.kimi.com\" target=\"_blank\" rel=\"noopener\">Kimi K2.5\u003C\u002Fa>, a 1 trillion parameter mixture-of-experts model with 32 billion active parameters.\u003C\u002Fp>\u003Ctable>\u003Cthead>\u003Ctr>\u003Cth>Version\u003C\u002Fth>\u003Cth>Release\u003C\u002Fth>\u003Cth>Key number\u003C\u002Fth>\u003Cth>Why it matters\u003C\u002Fth>\u003C\u002Ftr>\u003C\u002Fthead>\u003Ctbody>\u003Ctr>\u003Ctd>Kimi chatbot\u003C\u002Ftd>\u003Ctd>November 2023\u003C\u002Ftd>\u003Ctd>128,000 tokens\u003C\u002Ftd>\u003Ctd>First public release with ultra-long context\u003C\u002Ftd>\u003C\u002Ftr>\u003Ctr>\u003Ctd>Kimi Explore Edition\u003C\u002Ftd>\u003Ctd>October 2024\u003C\u002Ftd>\u003Ctd>36 million+ MAU\u003C\u002Ftd>\u003Ctd>Search-driven edition reached mass usage\u003C\u002Ftd>\u003C\u002Ftr>\u003Ctr>\u003Ctd>Kimi K2\u003C\u002Ftd>\u003Ctd>July 2025\u003C\u002Ftd>\u003Ctd>1 trillion parameters, 32 billion active\u003C\u002Ftd>\u003Ctd>Open-weight model with strong coding results\u003C\u002Ftd>\u003C\u002Ftr>\u003Ctr>\u003Ctd>Kimi K2.5\u003C\u002Ftd>\u003Ctd>January 2026\u003C\u002Ftd>\u003Ctd>1 trillion parameters, 32 billion active\u003C\u002Ftd>\u003Ctd>Added multimodal and agentic features\u003C\u002Ftd>\u003C\u002Ftr>\u003C\u002Ftbody>\u003C\u002Ftable>\u003Ch2>Moonshot AI built Kimi around long context first\u003C\u002Fh2>\u003Cp>\u003Ca href=\"https:\u002F\u002Fwww.moonshot.cn\" target=\"_blank\" rel=\"noopener\">Moonshot AI\u003C\u002Fa> was founded in March 2023, and Kimi arrived later that year as a chatbot built for very long inputs. That focus mattered because most assistants in 2023 still struggled once a conversation or document got too large.\u003C\u002Fp>\n\u003Cfigure class=\"my-6\">\u003Cimg src=\"https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1782231491199-wiwi.png\" alt=\"Kimi’s long-context push keeps getting bigger\" class=\"rounded-xl w-full\" loading=\"lazy\" \u002F>\u003C\u002Ffigure>\n\u003Cp>The original Kimi release in November 2023 supported 128,000 tokens of context, which made it one of the first public models to handle that scale. In practical terms, that meant users could paste long papers, codebases, or dense research notes without chopping them into tiny pieces.\u003C\u002Fp>\u003Cp>Moonshot pushed that idea further in March 2024 with a beta update that supported a 2 million character context window. Then in July 2024, the company opened public beta for context caching, a feature aimed at making repeated long prompts cheaper and faster to process.\u003C\u002Fp>\u003Cul>\u003Cli>October 2023: closed beta begins\u003C\u002Fli>\u003Cli>November 2023: public release with 128,000 tokens\u003C\u002Fli>\u003Cli>March 2024: 2 million character context beta\u003C\u002Fli>\u003Cli>July 2024: context caching enters public beta\u003C\u002Fli>\u003C\u002Ful>\u003Ch2>The product shifted from chat to agent behavior\u003C\u002Fh2>\u003Cp>By late 2024, Kimi was moving past plain question answering. On 11 October 2024, Moonshot AI launched \u003Ca href=\"https:\u002F\u002Fkimi.com\" target=\"_blank\" rel=\"noopener\">Kimi Explore Edition\u003C\u002Fa>, which added autonomous search features. The company later said monthly active users passed 36 million, a useful sign that the product had broken out of niche AI-tinkerer territory.\u003C\u002Fp>\u003Cp>That same year, Moonshot also began internal testing of video generation. The pattern is pretty clear: Kimi was no longer being positioned as a chat box that summarizes text. It was becoming a tool that searches, plans, drafts, and eventually creates multi-step outputs.\u003C\u002Fp>\u003Cblockquote>\u003Cp>“We believe the best model is the one that can think, use tools, and solve real problems.” — Yang Zhilin, Moonshot AI\u003C\u002Fp>\u003C\u002Fblockquote>\u003Cp>The quote above captures Moonshot’s direction well. Yang Zhilin has repeatedly framed Kimi as a system for reasoning and long-horizon work, not a novelty chatbot. That matters because the company’s releases keep adding \u003Ca href=\"\u002Ftag\u002Fagent\">agent\u003C\u002Fa>-like behavior instead of only chasing \u003Ca href=\"\u002Ftag\u002Fbenchmark\">benchmark\u003C\u002Fa> chatter.\u003C\u002Fp>\u003Ch2>Kimi K1.5, K2, and K2.5 show the pace of the model line\u003C\u002Fh2>\u003Cp>The release cadence from 2025 into 2026 is the most revealing part of Kimi’s story. On 20 January 2025, Moonshot AI released Kimi K1.5 and claimed it matched \u003Ca href=\"https:\u002F\u002Fopenai.com\" target=\"_blank\" rel=\"noopener\">OpenAI\u003C\u002Fa> o1 on mathematics, coding, and multimodal reasoning. In April 2025, the company followed with \u003Ca href=\"https:\u002F\u002Fgithub.com\u002FMoonshotAI\u002FKimi-VL\" target=\"_blank\" rel=\"noopener\">Kimi-VL\u003C\u002Fa>, a 16 billion parameter open-source mixture-of-experts model with 3 billion active parameters.\u003C\u002Fp>\n\u003Cfigure class=\"my-6\">\u003Cimg src=\"https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1782231483884-q5a0.png\" alt=\"Kimi’s long-context push keeps getting bigger\" class=\"rounded-xl w-full\" loading=\"lazy\" \u002F>\u003C\u002Ffigure>\n\u003Cp>June 2025 brought \u003Ca href=\"https:\u002F\u002Fgithub.com\u002FMoonshotAI\u002FKimi-Dev\" target=\"_blank\" rel=\"noopener\">Kimi-Dev\u003C\u002Fa>, a 72B coding model based on Qwen2.5-72B that hit state-of-the-art results among open-source models on \u003Ca href=\"\u002Ftag\u002Fswe-bench-verified\">SWE-bench Verified\u003C\u002Fa>. Moonshot also launched \u003Ca href=\"https:\u002F\u002Fmoonshotai.github.io\u002FKimi-Researcher\u002F\" target=\"_blank\" rel=\"noopener\">Kimi-Researcher\u003C\u002Fa>, an autonomous research agent available through the app and website.\u003C\u002Fp>\u003Cp>Then came the bigger jump. In July 2025, Moonshot AI released \u003Ca href=\"https:\u002F\u002Fmoonshotai.github.io\u002FKimi-K2\u002F\" target=\"_blank\" rel=\"noopener\">Kimi K2\u003C\u002Fa>, a 1 trillion parameter MoE model with 32 billion active parameters, open sourced under a modified MIT license. In September 2025, the updated \u003Ca href=\"https:\u002F\u002Fhuggingface.co\u002Fmoonshotai\u002FKimi-K2-Instruct-0905\" target=\"_blank\" rel=\"noopener\">Kimi-K2-Instruct-0905\u003C\u002Fa> expanded context from 128K to 256K tokens and improved coding performance. By January 2026, Kimi K2.5 added multimodal vision and language understanding plus instant and thinking modes.\u003C\u002Fp>\u003Cul>\u003Cli>Kimi K1.5: January 2025\u003C\u002Fli>\u003Cli>Kimi K2: July 2025, 1T parameters, 32B active\u003C\u002Fli>\u003Cli>Kimi-K2-Instruct-0905: September 2025, 256K context\u003C\u002Fli>\u003Cli>Kimi K2.5: January 2026, multimodal and agentic\u003C\u002Fli>\u003C\u002Ful>\u003Ch2>Kimi’s attention trick matters more than raw size\u003C\u002Fh2>\u003Cp>One of the more interesting details in the line is \u003Ca href=\"https:\u002F\u002Fwww.kimi.com\" target=\"_blank\" rel=\"noopener\">Kimi Linear\u003C\u002Fa>, released in October 2025. That model used Kimi Delta Attention, or KDA, which cuts memory use and speeds up generation when context windows get long. That is exactly the kind of engineering choice that matters once a model is expected to process huge documents or multi-step tasks.\u003C\u002Fp>\u003Cp>Here is the practical comparison: bigger models grab headlines, but attention efficiency decides whether long-context use is affordable. A model with 256K tokens that burns less memory can run more conversations, handle more documents, and keep latency lower than a model that simply throws more parameters at the problem.\u003C\u002Fp>\u003Cp>Kimi’s model family now covers several distinct use cases rather than one chatbot experience. The public line includes general chat, research, coding, and agentic task execution, which makes it closer to a product suite than a single assistant.\u003C\u002Fp>\u003Cul>\u003Cli>128K tokens in the first public Kimi release\u003C\u002Fli>\u003Cli>256K tokens in Kimi-K2-Instruct-0905\u003C\u002Fli>\u003Cli>1 million rows of input data for OK Computer\u003C\u002Fli>\u003Cli>36 million+ monthly active users for Kimi Explore Edition\u003C\u002Fli>\u003C\u002Ful>\u003Cp>That spread also hints at Moonshot’s strategy. Instead of trying to win on one benchmark or one interface, the company keeps widening the product surface: \u003Ca href=\"\u002Fnews\u002Frandomized-yarn-long-context-reasoning-en\">long context\u003C\u002Fa> for heavy reading, agent modes for task execution, and open-weight releases for developers who want to inspect or adapt the models.\u003C\u002Fp>\u003Ch2>What Kimi says about China’s AI race\u003C\u002Fh2>\u003Cp>Kimi’s story is also a useful snapshot of how fast Chinese AI labs are moving on product design. Moonshot AI is not just shipping a chatbot; it is iterating on model families, agent workflows, and licensing choices at a pace that keeps pressure on bigger Western labs.\u003C\u002Fp>\u003Cp>The open-weight releases matter here. A model like Kimi K2 under a modified MIT license gives developers and researchers a path to test, fine-tune, and compare without waiting for a closed \u003Ca href=\"\u002Ftag\u002Fapi\">API\u003C\u002Fa> to expose every feature. That makes the ecosystem around Kimi more active than a simple consumer app would be.\u003C\u002Fp>\u003Cp>For developers, the next question is less about whether Kimi can chat and more about where it is cheapest and most useful. If Moonshot keeps improving KDA, expanding context, and tightening agent reliability, Kimi could become the default choice for long-document work and coding-heavy workflows in teams that want open models with strong throughput.\u003C\u002Fp>\u003Cp>The real test will be whether Kimi K2.7 Code and the next research agent can keep that pace without turning the product into a pile of overlapping modes. If Moonshot can keep one clean interface while the models underneath get more capable, Kimi will stay interesting for reasons that go beyond model size.\u003C\u002Fp>","Moonshot AI’s Kimi chatbot keeps expanding context, agents, and model size, with Kimi K2.5 arriving in January 2026.","en.wikipedia.org","https:\u002F\u002Fen.wikipedia.org\u002Fwiki\u002FKimi_(chatbot)",null,"https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1782231491199-wiwi.png","model-release","en","8b0b6a07-b173-42ab-883a-77d720808276",[17,18,19,20,21],"Kimi","Moonshot AI","long context","mixture of experts","agentic AI",[23,24,25],"Kimi started as a long-context chatbot and evolved into a multi-model product line.","Moonshot AI has repeatedly increased context windows, from 128K tokens to 256K tokens.","Kimi K2.5 in January 2026 added multimodal and agentic capabilities on top of a 1T-parameter MoE model.",0,"2026-06-23T16:17:38.462613+00:00","2026-06-23T16:17:38.456+00:00","1bae1133-d241-4581-9332-fbf39690c319",{"tags":31,"relatedLang":38,"relatedPosts":42},[32,34,36],{"name":18,"slug":33},"moonshot-ai",{"name":19,"slug":35},"long-context",{"name":21,"slug":37},"agentic-ai",{"id":15,"slug":39,"title":40,"language":41},"kimi-long-context-models-moonshot-ai-zh","Kimi 的長上下文一路加大","zh",[43,49,55,61,67,73],{"id":44,"slug":45,"title":46,"cover_image":47,"image_url":47,"created_at":48,"category":13},"7fa092e5-5be7-41b6-bb60-2792c0a79fac","gpt-56-rumors-2m-context-coding-gains-en","GPT-5.6 rumors point to 2M context and coding gains","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1782248567152-pbak.png","2026-06-23T21:02:23.553973+00:00",{"id":50,"slug":51,"title":52,"cover_image":53,"image_url":53,"created_at":54,"category":13},"4a4096ae-b174-4db7-b327-3e1d736f838c","midjourney-medical-60-second-body-scan-claim-en","Midjourney Medical’s 60-Second Body Scan Claim","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1782182881225-rvy4.png","2026-06-23T02:47:38.350835+00:00",{"id":56,"slug":57,"title":58,"cover_image":59,"image_url":59,"created_at":60,"category":13},"54271426-cb3c-4580-b96d-04a260cae6a0","glm-5-2-open-source-1m-context-long-tasks-en","GLM-5.2开源：1M上下文冲刺长程任务","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1782065869449-dgqt.png","2026-06-21T18:17:26.463894+00:00",{"id":62,"slug":63,"title":64,"cover_image":65,"image_url":65,"created_at":66,"category":13},"232fd4fb-5c31-468c-a8c7-105097726845","apple-intelligence-ai-everyday-experiences-en","Apple pushes AI deeper into iPhone apps","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1782012783760-dl40.png","2026-06-21T03:32:34.781747+00:00",{"id":68,"slug":69,"title":70,"cover_image":71,"image_url":71,"created_at":72,"category":13},"35368396-f604-46b7-9aa3-35ea227c99da","google-gemini-35-live-translate-audio-model-en","Google launches Gemini 3.5 Live Translate audio model","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1781807575345-04yn.png","2026-06-18T18:32:29.328812+00:00",{"id":74,"slug":75,"title":76,"cover_image":77,"image_url":77,"created_at":78,"category":13},"d18e6176-7ba9-4460-8230-425e3aeaeb86","kimi-k27-code-highspeed-mode-skips-benchmarks-en","Kimi K2.7-Code Adds HighSpeed Mode, Skips Benchmarks","https:\u002F\u002Fxxdpdyhzhpamafnrdkyq.supabase.co\u002Fstorage\u002Fv1\u002Fobject\u002Fpublic\u002Fcovers\u002Finline-1781795890377-d0e8.png","2026-06-18T15:17:41.403224+00:00",[80,85,90,95,100,105,110,115,120,125],{"id":81,"slug":82,"title":83,"created_at":84},"d4cffde7-9b50-4cc7-bb68-8bc9e3b15477","nvidia-rubin-ai-supercomputer-en","NVIDIA Unveils Rubin: A Leap in AI Supercomputing","2026-03-25T16:24:35.155565+00:00",{"id":86,"slug":87,"title":88,"created_at":89},"eab919b9-fbac-4048-89fc-afad6749ccef","google-gemini-ai-innovations-2026-en","Google's AI Leap with Gemini Innovations in 2026","2026-03-25T16:27:18.841838+00:00",{"id":91,"slug":92,"title":93,"created_at":94},"5f5cfc67-3384-4816-a8f6-19e44d90113d","gap-google-gemini-ai-checkout-en","Gap Teams Up with Google Gemini for AI-Driven Checkout","2026-03-25T16:27:46.483272+00:00",{"id":96,"slug":97,"title":98,"created_at":99},"f6d04567-47f6-49ec-804c-52e61ab91225","ai-model-release-wave-march-2026-en","Navigating the AI Model Release Wave of March 2026","2026-03-25T16:28:45.409716+00:00",{"id":101,"slug":102,"title":103,"created_at":104},"895c150c-569e-4fdf-939d-dade785c990e","small-language-models-transform-ai-en","Small Language Models: Llama 3.2 and Phi-3 Transform AI","2026-03-25T16:30:26.688313+00:00",{"id":106,"slug":107,"title":108,"created_at":109},"38eb1d26-d961-4fd3-ae12-9c4089680f5f","midjourney-v8-alpha-features-pricing-en","Midjourney V8 Alpha: A Deep Dive into Its Features and Pricing","2026-03-26T01:25:36.387587+00:00",{"id":111,"slug":112,"title":113,"created_at":114},"bf36bb9e-3444-4fb8-ab19-0df6bc9d8271","rag-2026-indispensable-ai-bridge-en","RAG in 2026: The Indispensable AI Bridge","2026-03-26T01:28:34.472046+00:00",{"id":116,"slug":117,"title":118,"created_at":119},"60881d6d-2310-44ef-b1fb-7f98e9dd2f0e","xiaomi-mimo-trio-agents-robots-voice-en","Xiaomi’s MiMo trio targets agents, robots, and voice","2026-03-28T03:05:08.899895+00:00",{"id":121,"slug":122,"title":123,"created_at":124},"f063d8d1-41d1-4de4-8ebc-6c40511b9369","xiaomi-mimo-v2-pro-1t-moe-agents-en","Xiaomi MiMo-V2-Pro: 1T MoE Model for Agents","2026-03-28T03:06:19.238032+00:00",{"id":126,"slug":127,"title":128,"created_at":129},"a1379e9a-6785-4ff5-9b0a-8cff55f8264f","cursor-composer-2-started-from-kimi-en","Cursor’s Composer 2 started from Kimi","2026-03-28T03:11:59.132398+00:00"]