view article Article Context Engineering & Reuse Pattern Under the Hood of Claude Code Dec 22, 2025 • 6
Switch-Transformers release Collection This release included various MoE (Mixture of expert) models, based on the T5 architecture . The base models use from 8 to 256 experts. • 9 items • Updated Mar 12 • 19