Tiny Aya

Dense decoder architecture with GQA + SWA + NoPE attention mechanism.

GQA + SWA + NoPE·SwiGLU

3.35B|8,192 context|GQA + SWA + NoPE|Dense

Architecture Specifications

Parameters3.35B

Context Window8,192

Decoder TypeDense

AttentionGQA + SWA + NoPE

Release Date2026-02

CategoryEfficient & Small

OrganizationCohere

No positional embeddingsMassively multilingualCompact

Enterprise AI platform

Colaberry AI provides architecture specifications, benchmark comparisons, and deployment guidance for enterprise AI teams.