AI Navigate

Small models (Qwen 3.5 0.8B, Llama 3.2 1B, Gemma 3 1B) stuck in repetitive loops

Reddit r/LocalLLaMA / 3/20/2026

💬 OpinionTools & Practical UsageModels & Research

Key Points

  • Small models around 1B parameters tend to generate repetitive loops, especially at low temperature.
  • Increasing temperature above 1.0 provides some relief but does not fully solve the issue.
  • Adjusting generation controls like repetition_penalty, top_p, and top_k has been tried without a definitive fix.
  • Larger models (e.g., 3B+ from the same families) do not exhibit this looping behavior, suggesting a size-related limitation.
  • The post asks whether this is a known limitation of small models and what practical workarounds or parameter settings are effective.

I'm working with small models (~1B parameters) and frequently encounter issues where the output gets stuck in loops, repeatedly generating the same sentences or phrases. This happens especially consistent when temperature is set low (e.g., 0.1-0.3).

What I've tried:

  • Increasing temperature above 1.0 — helps somewhat but doesn't fully solve the issue
  • Setting repetition_penalty and other penalty parameters
  • Adjusting top_p and top_k

Larger models from the same families (e.g., 3B+) don't exhibit this problem.

Has anyone else experienced this? Is this a known limitation of smaller models, or are there effective workarounds I'm missing? Are there specific generation parameters that work better for small models?

submitted by /u/lionellee77
[link] [comments]