Tag
MoE
MoE, or Mixture of Experts, is an architecture that activates only a subset of experts per token or task, balancing total parameter count, inference cost, and quality. It shows up in open coding models, long-context agents, and other systems built for efficient scaling.
2 articles

Tools & Apps/Jun 22
Mistral Models Guide Turns Picking Easier
A practical breakdown of Mistral’s model lineup, pricing, and when I’d pick each one over OpenAI.

Research/May 8
UniPool shares MoE experts across layers
UniPool replaces per-layer MoE experts with one shared pool, cutting redundancy and improving validation loss in five LLaMA-scale models.