What's Changed
- docs: update hermes by @ParthSareen in #15655
- mlx: apply repeat penalties in sampler by @dhiltgen in #15631
- mlx: fuse sigmoid router head in glm4_moe_lite by @jessegross in #15659
- launch: add kimi cli integration with installer flow by @ParthSareen in #15723
- server: apply format when think=false with thinking-capable parser by @ParthSareen in #15678
Full Changelog: v0.21.0...v0.21.1-rc0




