Alibaba's small-scale Qwen3 vision-language model with a 256K-token context window. Integrates thinking and non-thinking modes for fast image and document understanding, and supports structured output.
Alibaba's small-scale Qwen3 vision-language model with a 256K-token context window. Integrates thinking and non-thinking modes for fast image and document understanding, and supports structured output.