Gemma 4 (E2B)

Dense decoder architecture with MQA + QK-Norm + SWA attention mechanism.

MQA + QK-Norm + SWA·SwiGLU

5.1B|128K context|MQA + QK-Norm + SWA|Dense

Architecture Specifications

Parameters5.1B

Context Window128K

Decoder TypeDense

AttentionMQA + QK-Norm + SWA

Vocabulary Size262K

Release Date2026-04

CategoryEfficient & Small

OrganizationGoogle

Effective 2.3B parametersMQA efficiencyOn-device

Enterprise AI platform

Colaberry AI provides architecture specifications, benchmark comparisons, and deployment guidance for enterprise AI teams.