Do 2B models have practical use cases, or are they just toys for now?

Reddit r/LocalLLaMA / 3/28/2026

💬 OpinionSignals & Early TrendsTools & Practical Usage

共有:

Key Points

A Reddit user tests locally hosted 2B (billions of parameters) models on a smartphone and finds that responses often include substantial hallucinations even for relatively straightforward questions.
The user asks whether their setup is incorrect or whether this level of inaccuracy is expected behavior for small local models like Qwen 2.5/3.5 and Gemma.
The post frames the core issue as whether 2B models can deliver practical, reliable use cases or are currently mostly experimental “toys.”

I'm new to the local hosting, and I have just tried 2B models on my smartphone (qwen2.5/3.5, gemma).

I have asked generic questions, like the top 3 cities of a small country. It goes in the right general direction, but 80% of the reply is a hallucination

Am I doing something wrong, or is this expected?

submitted by /u/Civic_Hactivist_86
[link] [comments]