One API for every major LLM provider — routing, spend, logs, and controls in one place.

New York

130 E 59th St, 17th floor

New York, NY 10022

Wilmington

1201 N. Market Street, Suite 200

Wilmington, DE 19801

LLM Gateway

LLM Gateway
Request Routing
Usage Monitoring
Spend Management
Data Security
Access Controls

Teams

AI Engineering
Engineering Leadership
Finance & Operations
Security & Compliance

Integrations

All Integrations
Migration Guides

Platform

Pricing
Model Fortress
Enterprise
Documentation
Status

Legal

Privacy Policy
Terms of Service
Data Processing Addendum
Acceptable Use Policy

Features

Universal API Keys
Spend Tracking
Token Allocation
Usage Analytics
Request Logs
Alerts
Data Redaction
Zero Data Retention
Audit Logs

LLM Gateway

LLM Gateway
Request Routing
Usage Monitoring
Spend Management
Data Security
Access Controls

Teams

AI Engineering
Engineering Leadership
Finance & Operations
Security & Compliance

Integrations

All Integrations
Migration Guides

Platform

Pricing
Model Fortress
Enterprise
Documentation
Status

Legal

Privacy Policy
Terms of Service
Data Processing Addendum
Acceptable Use Policy

Features

Universal API Keys
Spend Tracking
Token Allocation
Usage Analytics
Request Logs
Alerts
Data Redaction
Zero Data Retention
Audit Logs

Offices

New York

130 E 59th St, 17th floor

New York, NY 10022

Wilmington

1201 N. Market Street, Suite 200

Wilmington, DE 19801

© 2026 Concentrate AI. All rights reserved.

Models DocsRequest a Demo

Back to Model Fortress

Qwen3 VL Flash

Alibaba Cloud

alibaba/qwen3-vl-flash

Released Oct 15, 2025

Providers

1

Context

256K

Input

$0.02/M

Output

$0.21/M

Aliases:alibaba-qwen3-vl-flashqwen3vl-flash

Alibaba's small-scale Qwen3 vision-language model with a 256K-token context window. Integrates thinking and non-thinking modes for fast image and document understanding, and supports structured output.

Alibaba's small-scale Qwen3 vision-language model with a 256K-token context window. Integrates thinking and non-thinking modes for fast image and document understanding, and supports structured output.

Alibaba Cloud

alibaba/qwen3-vl-flash

256K context32K max out

Input Price

$0.02/M tokens

Output Price

$0.21/M tokens

Cache read: $0.01/M