
GLM 4.5 Air is the lightweight variant of the GLM-4.5 flagship family, purpose-built for agent-focused applications. It retains the Mixture-of-Experts (MoE) architecture but with a more compact parameter size for efficiency. Like its larger counterpart, it supports hybrid inference modes—offering a “thinking mode” for advanced reasoning and tool use, and a “non-thinking mode” for fast, real-time interactions. Users can easily control reasoning behavior through a simple boolean toggle.
| Creator | zAI |
| Release Date | July, 2025 |
| License | MIT |
| Context Window | 131,072 |
| Image Input Support | No |
| Open Source (Weights) | Yes |
| Model Weights | Click here |
