On-device LLMs in 2026: Gemini Nano vs Apple Intelligence
On-device LLMs are finally usable for production features. Where Gemini Nano wins, where Apple Intelligence wins, and the Hungarian-language gap.
On-device LLMs are finally usable for production features. Where Gemini Nano wins, where Apple Intelligence wins, and the Hungarian-language gap.
Two years ago, running an LLM on-device was a science project. In 2026 it is a deploy target. Both Google and Apple ship first-party on-device models with public APIs, the RAM ceiling finally fits 2-4B parameter models, and power draw is defensible for features you run a few times a minute.
Both models underperform on Hungarian vs English. In our evals Gemini Nano gives ~85% acceptable-output rate on Hungarian summarisation, Apple Intelligence ~80%. For comparison, Claude 3.7 Haiku is ~97%. For Hungarian-heavy features we keep a cloud fallback for now.
Always design the UI for a cloud fallback from day one. The right on-device feature feels instant when the model is present and works anyway when it is not.

Founder, DField Solutions
I've shipped production products from fintech to creator-tooling · for startups and enterprises, from Budapest to San Francisco.
Let's talk about your project. 30 minutes, no strings.