Google's New Gemma 4 Models Bring Complex Reasoning Skills to Low-Power Devices Google LLC has launched its latest open-weight artificial intelligence models, Gemma 4, marking a significant advancement in the field of lightweight, high-performance AI. These models, built on the architectural foundation of Gemini 3, are designed to handle complex reasoning tasks and support autonomous AI agents running on low-power devices such as workstations and smartphones. The release positions Google as a key player in the growing market for edge computing and local AI applications. The Gemma 4 family includes four variants: Effective 2B, Effective 4B, a 26B Mixture of Experts (MoE) model, and a 31B Dense model. The smaller "Effective" models are tailored for edge use cases, such as Android smartphones and Raspberry Pi computers, while the 26B MoE model introduces an innovative approach by activating only 3.8 billion parameters during inference tasks. This optimization allows the model to maintain high performance without compromising the depth of knowledge typical of larger models. The 31B Dense variant currently ranks third in open models on the industry-standard Arena AI Text leaderboard, demonstrating its competitive edge. Google DeepMind researchers Clement Farabet and Olivier Lacombe highlighted the models' ability to deliver "more intelligence per parameter," enabling them to outperform their size class. This efficiency is critical for applications requiring real-time processing and minimal computational resources. The models are also engineered to support AI agents, with native capabilities for function calling and structured JavaScript Object Notation (JSON) outputs.#google_llc #google_deepmind #clement_farabet #olivier_lacombe #hugging_face
