Did Google hide the best version of Gemma 4 e4b in Android? The extracted model beats Unsloth and everything else I've tried.

Reddit r/LocalLLaMA / 4/22/2026

💬 OpinionSignals & Early TrendsTools & Practical Usage

Key Points

  • A Reddit user reports that the Gemma 4 e4b model obtained via Google AI Edge Gallery on Android is unusually small (about 3.6GB) compared with other distributions they tried, such as an Unsloth build (about 3.7GB).
  • They claim the Android-extracted model (and its litertlm format) performs “smarter” than versions downloaded from the internet, while a litert-community variant showed noticeable bugs and produced incoherent Russian text.
  • The user questions whether Google may have effectively “hidden” or provided a better-performing build of Gemma 4 e4b through Android/AI Edge Gallery.
  • The post is framed as a personal experience with uncertainty, asking whether others observe the same behavior or if the results could be due to confusion or exhaustion (i.e., possible hallucination).

Why does Gemma 4 e4b from Google AI Edge Gallery on Android weigh only 3.6 gigs, while the one from Unsloth (gemma-4-E4B-it-UD-Q2_K_XL.gguf) weighs 3.7, and for some reason the model image in litertlm format extracted via adb from Google AI Edge Gallery on Android acts smarter than all the versions I've downloaded from the internet and tried, and the one from litert-community/gemma-4-E4B-it-litert-lm turned out to be especially buggy, it writes completely incoherent text in Russian. Does anyone else have it like this, or did I get confused somewhere, or am I hallucinating from lack of sleep?

submitted by /u/LawyerCompetitive478
[link] [comments]