Model Releases

Latest AI model releases, benchmarks, and comparisons. Stay up to date with every new model launch from OpenAI, Anthropic, Google, Meta, and more.

Jun 29

Kimi K2.6 tops coding and agentic AI benchmarks

Moonshot AI’s Kimi K2.6 hits top marks in coding and agentic tasks, with a 262K context window and open-weight pricing at $0.74/$3.50 per 1M tokens.

Jun 29

Llama Legends 3.8.0 adds Season 3 heroes and raids

Llama Legends 3.8.0 adds 100 superhero cards, 12 achievements, four raid bosses, and the Atlas Ancient card.

Jun 29

oMLX 0.4.5.dev1 speeds up GLM-5.2 and MiniMax M3

oMLX 0.4.5.dev1 adds custom kernels for GLM-5.2 and MiniMax M3, plus cache fixes and better model profile exposure.

Jun 29

Grok 4.5 enters private beta at Tesla and SpaceX

xAI’s Grok 4.5 has entered private beta inside Tesla and SpaceX, its first internal rollout.

Jun 27

Google OpenRL brings RL fine-tuning to Kubernetes

Google’s OpenRL lets teams run LLM post-training and fine-tuning on their own Kubernetes clusters.

Jun 27

DiffusionGemma runs fast on NVIDIA RTX and DGX

Google DeepMind’s DiffusionGemma generates text in parallel, and NVIDIA says RTX and DGX hardware can run it up to 4x faster.

Jun 27

GLM-5.2 beats GPT-5.5 on coding tests

Z.ai’s GLM-5.2 beats GPT-5.5 on several coding benchmarks while claiming far lower cost.

Jun 27

OpenAI narrows GPT-5.6 rollout after U.S. request

OpenAI is limiting GPT-5.6 Sol, Terra and Luna to trusted partners before a wider release.

Jun 27

Ubuntu 26.10 Snapshot 2 adds GNOME 50 and kernel 7.0

Ubuntu 26.10 Snapshot 2 is out for testing with kernel 7.0, GNOME 50, and planned upgrades to kernel 7.2, GNOME 51, and Mesa 26.2.

Jun 27

Claude Fable 5 launches with 1M context, $10/$50 pricing

Anthropic adds Claude Fable 5 and limited-release Claude Mythos 5, both with 1M-token context, 128k output, and new refusal handling.

Jun 26

Google Pushes Gemini 3.5 Pro to July

Google pushed Gemini 3.5 Pro from June to July after early tester feedback and added pressure from OpenAI and Anthropic.

Jun 26

Gemini 3.5 Flash makes computer use a default, not a demo

Google is right to make computer use a native Gemini 3.5 Flash feature.

Jun 26

Xiaomi MiMo-V2.5-Pro: pricing, benchmarks, and limits

Xiaomi’s MiMo-V2.5-Pro pairs a 1M-token context with strong coding, agentic, and reasoning scores at mid-range pricing.

Jun 25

OpenAI’s Sora hardware targets enterprise video

OpenAI’s Sora enterprise hardware brings local AI video generation to studios, agencies, and firms that need speed and privacy.

Jun 24

GPT-5.6 rumors point to 2M context and coding gains

Rumors point to GPT-5.6 and GPT-5.6 Pro arriving June 25 with 2M context, stronger coding agents, and lower prices than rivals.

Jun 24

Kimi’s long-context push keeps getting bigger

Moonshot AI’s Kimi chatbot keeps expanding context, agents, and model size, with Kimi K2.5 arriving in January 2026.

Jun 23

Midjourney Medical’s 60-Second Body Scan Claim

Midjourney Medical’s concept scanner claims a 60-second whole-body ultrasound scan, but the clinical evidence and FDA path are still open.

Jun 22

GLM-5.2开源：1M上下文冲刺长程任务

智谱开源GLM-5.2，主打1M上下文、Coding和长程任务，API与主流推理框架同步支持。

Jun 21

Apple pushes AI deeper into iPhone apps

Apple’s 2026 Apple Intelligence update adds AI editing, Siri upgrades, Safari tools, and on-device privacy across its platforms.

Jun 19

Google launches Gemini 3.5 Live Translate audio model

Google unveiled Gemini 3.5 Live Translate, an audio model for live speech-to-speech translation.

Jun 18

Kimi K2.7-Code Adds HighSpeed Mode, Skips Benchmarks

Moonshot’s Kimi K2.7-Code adds a faster mode and lower token use, but only Moonshot’s own benchmarks back the claims.

Jun 18

Kimi K2.7: What Changed and How to Run It

Kimi K2.7 adds a fresh option for long-context, Chinese, and agentic coding workflows.

Jun 18

Linux Kernel 7.1 adds FRED, NTFS, and AMD fixes

Linux Kernel 7.1 lands with FRED on by default, a new NTFS driver, AMD power controls, and support for 12 new SoCs.

Jun 18

Fable 5 drew rare praise from top AI voices

Ethan Mollick and Andrej Karpathy praised Fable 5, putting the model under a bright spotlight.

Jun 18

Devin pricing in June 2026: plans, limits, tradeoffs

Devin starts at $20 and scales to a $500 Team plan, with enterprise pricing reserved for custom deals.

Jun 18

Self-host MiniMax M3 on GPU cloud

MiniMax M3 brings 229.9B MoE weights, 1M context, and multimodal output, but it needs serious GPU memory to run.

Jun 17

Apple’s Gemini-backed AI is still its own thing

Apple’s new Apple Intelligence uses Google-derived models, but Apple rebuilt them with its own weights, data, and guardrails.

Jun 17

Gemma 4 brings 256K context to open models

Google’s Gemma 4 adds text, image, and audio input, plus up to 256K context and five model sizes for local or server use.

Jun 17

Kimi K2.7 Code 该优先上 API 和 Kimi Code，而不是等生态成熟

Kimi K2.7 Code 应该优先通过 Kimi API 和 Kimi Code 上线使用。

Jun 16

Kingdom Hearts IV confirmed for Switch 2 launch

Square Enix confirmed Kingdom Hearts IV in a June Nintendo Direct and said the game will launch on Nintendo Switch 2.

You've reached the end