We just shipped Gemma 4 support in Off Grid 🔥- open-source mobile app, on-device inference, zero cloud. Android live, iOS coming soon.

Reddit r/LocalLLaMA / 4/9/2026

📰 NewsSignals & Early TrendsTools & Practical UsageIndustry & Market Moves

Key Points

  • Off Grid, an open-source offline-first AI mobile app, has shipped Gemma 4 support for Android today, with iOS planned for the near future.
  • The app runs Gemma 4 entirely on-device (no server, no Python/laptop), leveraging the phone’s NPU/CPU for local inference.
  • It adds Gemma 4’s 128K context window for practical long-document and code use on mobile, plus native vision by pointing the camera at objects.
  • The release bundles additional capabilities including Whisper speech-to-text, Stable Diffusion image generation, and tool calling within a single app.
  • Performance targets cited for Snapdragon 8 Gen 3 and Apple A17 Pro are roughly 15–30 tokens per second, and the team highlights an open Apache-2.0 model and MIT app licensing.
We shipped Gemma 4 (E2B and E4B edge variants) in Off Grid today — our open-source, offline-first AI app for Android and iOS. What makes this different from other local LLM setups: → No server, no Python, no laptop. Runs entirely on your phone's NPU/CPU. → Gemma 4's 128K context window, fully on-device — finally useful for long docs and code on mobile. → Native vision: point your camera at anything and ask Gemma 4 about it. → Whisper speech-to-text, Stable Diffusion image gen, tool calling — all in one app. → ~15–30 tok/s on Snapdragon 8 Gen 3 / Apple A17 Pro. → Apache 2.0 model, MIT app — genuinely open all the way down. Gemma 4's E2B variant running in under 1.5GB RAM on a phone is honestly wild. The E4B with 128K context + vision is what we've been waiting for. Android (live now): https://play.google.com/store/apps/details?id=ai.offgridmobile iOS: coming soon GitHub (MIT): https://github.com/alichherawalla/off-grid-mobile-ai Would love to hear tok/s numbers people are seeing across different devices. Drop them below. 
submitted by /u/CamusCave
[link] [comments]