Model Releases
Latest AI model releases, benchmarks, and comparisons. Stay up to date with every new model launch from OpenAI, Anthropic, Google, Meta, and more.

Kimi K2.6 tops coding and agentic AI benchmarks
Moonshot AI’s Kimi K2.6 hits top marks in coding and agentic tasks, with a 262K context window and open-weight pricing at $0.74/$3.50 per 1M tokens.

Llama Legends 3.8.0 adds Season 3 heroes and raids
Llama Legends 3.8.0 adds 100 superhero cards, 12 achievements, four raid bosses, and the Atlas Ancient card.

oMLX 0.4.5.dev1 speeds up GLM-5.2 and MiniMax M3
oMLX 0.4.5.dev1 adds custom kernels for GLM-5.2 and MiniMax M3, plus cache fixes and better model profile exposure.

Grok 4.5 enters private beta at Tesla and SpaceX
xAI’s Grok 4.5 has entered private beta inside Tesla and SpaceX, its first internal rollout.

Google OpenRL brings RL fine-tuning to Kubernetes
Google’s OpenRL lets teams run LLM post-training and fine-tuning on their own Kubernetes clusters.

DiffusionGemma runs fast on NVIDIA RTX and DGX
Google DeepMind’s DiffusionGemma generates text in parallel, and NVIDIA says RTX and DGX hardware can run it up to 4x faster.

GLM-5.2 beats GPT-5.5 on coding tests
Z.ai’s GLM-5.2 beats GPT-5.5 on several coding benchmarks while claiming far lower cost.

OpenAI narrows GPT-5.6 rollout after U.S. request
OpenAI is limiting GPT-5.6 Sol, Terra and Luna to trusted partners before a wider release.

Ubuntu 26.10 Snapshot 2 adds GNOME 50 and kernel 7.0
Ubuntu 26.10 Snapshot 2 is out for testing with kernel 7.0, GNOME 50, and planned upgrades to kernel 7.2, GNOME 51, and Mesa 26.2.

Claude Fable 5 launches with 1M context, $10/$50 pricing
Anthropic adds Claude Fable 5 and limited-release Claude Mythos 5, both with 1M-token context, 128k output, and new refusal handling.

Google Pushes Gemini 3.5 Pro to July
Google pushed Gemini 3.5 Pro from June to July after early tester feedback and added pressure from OpenAI and Anthropic.

Gemini 3.5 Flash makes computer use a default, not a demo
Google is right to make computer use a native Gemini 3.5 Flash feature.

Xiaomi MiMo-V2.5-Pro: pricing, benchmarks, and limits
Xiaomi’s MiMo-V2.5-Pro pairs a 1M-token context with strong coding, agentic, and reasoning scores at mid-range pricing.

OpenAI’s Sora hardware targets enterprise video
OpenAI’s Sora enterprise hardware brings local AI video generation to studios, agencies, and firms that need speed and privacy.

GPT-5.6 rumors point to 2M context and coding gains
Rumors point to GPT-5.6 and GPT-5.6 Pro arriving June 25 with 2M context, stronger coding agents, and lower prices than rivals.

Kimi’s long-context push keeps getting bigger
Moonshot AI’s Kimi chatbot keeps expanding context, agents, and model size, with Kimi K2.5 arriving in January 2026.

Midjourney Medical’s 60-Second Body Scan Claim
Midjourney Medical’s concept scanner claims a 60-second whole-body ultrasound scan, but the clinical evidence and FDA path are still open.

GLM-5.2开源:1M上下文冲刺长程任务
智谱开源GLM-5.2,主打1M上下文、Coding和长程任务,API与主流推理框架同步支持。

Apple pushes AI deeper into iPhone apps
Apple’s 2026 Apple Intelligence update adds AI editing, Siri upgrades, Safari tools, and on-device privacy across its platforms.

Google launches Gemini 3.5 Live Translate audio model
Google unveiled Gemini 3.5 Live Translate, an audio model for live speech-to-speech translation.

Kimi K2.7-Code Adds HighSpeed Mode, Skips Benchmarks
Moonshot’s Kimi K2.7-Code adds a faster mode and lower token use, but only Moonshot’s own benchmarks back the claims.

Kimi K2.7: What Changed and How to Run It
Kimi K2.7 adds a fresh option for long-context, Chinese, and agentic coding workflows.

Linux Kernel 7.1 adds FRED, NTFS, and AMD fixes
Linux Kernel 7.1 lands with FRED on by default, a new NTFS driver, AMD power controls, and support for 12 new SoCs.

Fable 5 drew rare praise from top AI voices
Ethan Mollick and Andrej Karpathy praised Fable 5, putting the model under a bright spotlight.

Devin pricing in June 2026: plans, limits, tradeoffs
Devin starts at $20 and scales to a $500 Team plan, with enterprise pricing reserved for custom deals.

Self-host MiniMax M3 on GPU cloud
MiniMax M3 brings 229.9B MoE weights, 1M context, and multimodal output, but it needs serious GPU memory to run.

Apple’s Gemini-backed AI is still its own thing
Apple’s new Apple Intelligence uses Google-derived models, but Apple rebuilt them with its own weights, data, and guardrails.

Gemma 4 brings 256K context to open models
Google’s Gemma 4 adds text, image, and audio input, plus up to 256K context and five model sizes for local or server use.

Kimi K2.7 Code 该优先上 API 和 Kimi Code,而不是等生态成熟
Kimi K2.7 Code 应该优先通过 Kimi API 和 Kimi Code 上线使用。

Kingdom Hearts IV confirmed for Switch 2 launch
Square Enix confirmed Kingdom Hearts IV in a June Nintendo Direct and said the game will launch on Nintendo Switch 2.