About this model
Qwen 3.5 9B is a compact model from Alibaba's Qwen team, released in early 2026 as part of the Qwen 3.5 small-model series. Per the catalog, it offers vision support, covers 201 languages, and supports reasoning ("thinking") mode plus function calling. Its defining feature is a Gated DeltaNet hybrid attention architecture that pairs linear attention with periodic full-attention layers, giving it a 262K native context window extensible toward roughly 1M tokens.
Compared with same-family predecessors, the model targets efficiency over scale. Its Qwen model card describes the hybrid-attention design as a way to make long-context serving cheaper than the quadratic attention used in earlier dense Qwen generations. It contrasts with the older Qwen3 VL 235B vision line, which used the prior architecture and far larger parameter counts.
Within the 3.5 generation it sits below larger siblings such as Qwen 3.5 35B A3B and the flagship Qwen 3.5 397B, while sharing the same Gated DeltaNet foundation. The later Qwen 3.6 27B continues this hybrid-attention lineage.
The weights are openly available under Apache 2.0 on Hugging Face. Independent or vendor-official benchmark figures from a top evaluator were not available, so no performance numbers are reported here.
This About section is AI-generated from public sources (Claude Opus 4.8), with no human editing. It may contain inaccuracies — verify critical details against the sources listed above.
Data sources: Venice API · HuggingFace · Wikipedia — enrichment updated 1d ago