Step 3.5 Flash

MoE decoder architecture with GQA + SWA attention mechanism.

GQA + SWA·MoE · 11B active

11B active / 196B total|262K context|GQA + SWA|MoE

Architecture Specifications

Parameters11B active / 196B total

Context Window262K

Decoder TypeMoE

AttentionGQA + SWA

Active Parameters11B

Release Date2026-02

CategoryMixture of Experts

OrganizationStepFun

Fast inference MoESWA11B active

Enterprise AI platform

Colaberry AI provides architecture specifications, benchmark comparisons, and deployment guidance for enterprise AI teams.