Qwen API Pricing 2026 — From $0.02/1M tokens

Why deploy Qwen through AI Security Gateway?

Automatic PII Redaction

Every Qwen request is scanned for 30+ PII entity types — SSNs, credit cards, emails, API keys, and more — before it reaches any provider.

Smart Cost Routing

Qwen is available across 3 providers. Our Smart Router picks the cheapest one per-request. 25% managed markup / 0% on Pro BYOK.

Native SDK or OpenAI Compatible

Use the AISG SDK (pip install aisg) for typed metadata and error handling, or change two lines in your OpenAI SDK. Both work.

Full Observability

Per-request logging of token counts, latency, DLP violations, and cost. Never wonder what your AI spend is again.

Qwen Strengths

Best-in-class Chinese and Asian language support
Competitive English performance in Qwen 2.5 series
Open-weights for transparency and fine-tuning
Strong coding variants (Qwen-Coder)
Multiple sizes from 0.5B to 72B for flexible deployment

Available Qwen Models (40)

Qwen QwQ-32B

oah/qwen3-235b-a22b-instruct-2507-tput

Open Source

Deploy Qwen QwQ-32B with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai

ReasoningInput: $1.20/MOutput: $1.20/M

Qwen 2 (1.5B)

oah/qwen2-1.5b

Open Source

Deploy Qwen 2 (1.5B) with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai

Input: $0.02/MOutput: $0.02/M

Qwen 2 (72B)

oah/qwen2

Open Source

Deploy Qwen 2 (72B) with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai

Input: $0.90/MOutput: $0.90/M

Qwen2-VL (72B) Instruct

oah/qwen2-vl

Open Source

Deploy Qwen2-VL (72B) Instruct with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai

Input: $1.20/MOutput: $1.20/M

Qwen2.5 1.5B

oah/qwen2.5-1.5b

Open Source

Deploy Qwen2.5 1.5B with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai

Input: Free/MOutput: Free/M

Qwen2.5 14B

oah/qwen2.5

Open Source

Deploy Qwen2.5 14B with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.aiDeepInfra

Input: $0.12/MOutput: $0.30/M

Qwen 2.5 Coder 32B Instruct

oah/qwen2.5-coder

Open Source

Deploy Qwen 2.5 Coder 32B Instruct with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai

Input: $0.80/MOutput: $0.80/M

Qwen2.5-VL (72B) Instruct

oah/qwen2.5-vl

Open Source

Deploy Qwen2.5-VL (72B) Instruct with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai

Input: $1.95/MOutput: $8.00/M

Qwen3 0.6B

oah/qwen3-0.6b

Open Source

Deploy Qwen3 0.6B with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai

ReasoningInput: Free/MOutput: Free/M

Qwen3 0.6B Base

oah/qwen3-0.6b-base

Open Source

Deploy Qwen3 0.6B Base with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai

ReasoningInput: Free/MOutput: Free/M

Qwen3 1.7B

oah/qwen3-1.7b

Open Source

Deploy Qwen3 1.7B with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai

ReasoningInput: Free/MOutput: Free/M

Qwen3 1.7B Base

oah/qwen3-1.7b-base

Open Source

Deploy Qwen3 1.7B Base with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai

ReasoningInput: Free/MOutput: Free/M

Qwen3 14B

oah/qwen3

Open Source

Deploy Qwen3 14B with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.aiGroqDeepInfra

ReasoningInput: Free/MOutput: Free/M

Qwen3 14B Base

oah/qwen3-14b-base

Open Source

Deploy Qwen3 14B Base with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai

ReasoningInput: Free/MOutput: Free/M

Qwen3 235B A22b Instruct 2507 Fp8

oah/qwen3-235b-a22b-instruct-2507

Open Source

Deploy Qwen3 235B A22b Instruct 2507 Fp8 with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai

ReasoningInput: Free/MOutput: Free/M

Qwen3 30B A3b Base

oah/qwen3-30b-a3b-base

Open Source

Deploy Qwen3 30B A3b Base with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai

ReasoningInput: Free/MOutput: Free/M

Qwen3 30B A3B Instruct 2507 Lora

oah/qwen3-30b-a3b-instruct-2507-lora

Open Source

Deploy Qwen3 30B A3B Instruct 2507 Lora with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai

ReasoningInput: Free/MOutput: Free/M

Qwen3 4B Base

oah/qwen3-4b-base

Open Source

Deploy Qwen3 4B Base with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai

ReasoningInput: Free/MOutput: Free/M

Qwen3 8B Base

oah/qwen3-8b-base

Open Source

Deploy Qwen3 8B Base with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai

ReasoningInput: Free/MOutput: Free/M

Qwen3 8B Lora

oah/qwen3-8b-lora

Open Source

Deploy Qwen3 8B Lora with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai

ReasoningInput: Free/MOutput: Free/M

Qwen3 Coder 30B A3b Instruct

oah/qwen3-coder

Open Source

Deploy Qwen3 Coder 30B A3b Instruct with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.aiDeepInfra

ReasoningInput: $0.29/MOutput: $1.20/M

Qwen3 Coder Next Fp8

oah/qwen3-coder-next

Open Source

Deploy Qwen3 Coder Next Fp8 with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai

ReasoningInput: $0.50/MOutput: $1.20/M

Qwen3 Next 80B A3b Instruct

oah/qwen3-next

Open Source

Deploy Qwen3 Next 80B A3b Instruct with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.aiDeepInfra

ReasoningInput: $0.14/MOutput: $1.40/M

Qwen3 Next 80B A3b Thinking

oah/qwen3-next-80b-a3b-thinking

Open Source

Deploy Qwen3 Next 80B A3b Thinking with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai

ReasoningInput: $0.15/MOutput: $1.50/M

Qwen3-VL-235B-A22B-Instruct-FP8

oah/qwen3-vl

Open Source

Deploy Qwen3-VL-235B-A22B-Instruct-FP8 with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.aiDeepInfra

ReasoningInput: $0.18/MOutput: $0.68/M

Qwen3.5 122B A10b Fp8

oah/qwen3.5

Open Source

Deploy Qwen3.5 122B A10b Fp8 with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.aiDeepInfra

ReasoningInput: $0.17/MOutput: $0.25/M

Qwen3.5 2B Lora

oah/qwen3.5-2b-lora

Open Source

Deploy Qwen3.5 2B Lora with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai

ReasoningInput: Free/MOutput: Free/M

Qwen3.5 35B A3B Lora

oah/qwen3.5-35b-a3b-lora

Open Source

Deploy Qwen3.5 35B A3B Lora with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai

ReasoningInput: Free/MOutput: Free/M

Qwen3.6 35B A3b Fp8

oah/qwen3.6

Open Source

Deploy Qwen3.6 35B A3b Fp8 with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.aiDeepInfra

ReasoningInput: Free/MOutput: Free/M

Qwen3.6 35B A3B Lora

oah/qwen3.6-35b-a3b-lora

Open Source

Deploy Qwen3.6 35B A3B Lora with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai

ReasoningInput: Free/MOutput: Free/M

Qwen3.6 Plus

oah/qwen3.6-plus

Open Source

Deploy Qwen3.6 Plus with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai

ReasoningInput: $0.50/MOutput: $3.00/M

Qwen3.7 Max

oah/qwen3.7-max

Open Source

Deploy Qwen3.7 Max with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.aiDeepInfra

ReasoningInput: $1.25/MOutput: $3.75/M

Qwen3.7 Plus

oah/qwen3.7-plus

Open Source

Deploy Qwen3.7 Plus with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai

ReasoningInput: $0.32/MOutput: $1.28/M

Arize AI Qwen 2 1.5B Instruct

oah/qwen-2-1.5b

Open Source

Deploy Arize AI Qwen 2 1.5B Instruct with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai

Input: $0.10/MOutput: $0.10/M

Cogito V1 Preview Qwen 14B

oah/cogito-v1-preview-qwen

Open Source

Deploy Cogito V1 Preview Qwen 14B with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Together.ai

Input: Free/MOutput: Free/M

Qwen/Qwen3-235B-A22B-Thinking-2507

oah/qwen3-235b-a22b-thinking

Open Source

Deploy Qwen/Qwen3-235B-A22B-Thinking-2507 with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

DeepInfra

ReasoningInput: $0.30/MOutput: $2.90/M

Qwen/Qwen3-Max

oah/qwen3-max

Open Source

Deploy Qwen/Qwen3-Max with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

DeepInfra

ReasoningInput: Free/MOutput: Free/M

Qwen/Qwen3-Max-Thinking

oah/qwen3-max-thinking

Open Source

Deploy Qwen/Qwen3-Max-Thinking with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

DeepInfra

ReasoningInput: Free/MOutput: Free/M

Qwen/Qwen3-TTS

oah/qwen3-tts

Open Source

Deploy Qwen/Qwen3-TTS with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

DeepInfra

ReasoningInput: Free/MOutput: Free/M

Qwen/Qwen3-TTS-VoiceDesign

oah/qwen3-tts-voicedesign

Open Source

Deploy Qwen/Qwen3-TTS-VoiceDesign with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

DeepInfra

ReasoningInput: Free/MOutput: Free/M

Qwen Pricing Comparison (per 1M tokens, USD)

Input / Output pricing by provider. Managed Mode adds a 25% managed markup. Pro BYOK = 0% markup.

Model	Params	Context	Vision	Together.ai	DeepInfra	Groq
Qwen QwQ-32B `oah/qwen3-235b-a22b-instruct-2507-tput`	—	131K	No	$1.20/$1.20	—	—
Qwen 2 (1.5B) `oah/qwen2-1.5b`	—	33K	No	$0.02/$0.02	—	—
Qwen 2 (72B) `oah/qwen2`	—	33K	No	$0.90/$0.90	—	—
Qwen2-VL (72B) Instruct `oah/qwen2-vl`	—	33K	No	$1.20/$1.20	—	—
Qwen2.5 1.5B `oah/qwen2.5-1.5b`	—	131K	No	—	—	—
Qwen2.5 14B `oah/qwen2.5`	—	131K	No	$0.30/$0.30	$0.12/$0.39	—
Qwen 2.5 Coder 32B Instruct `oah/qwen2.5-coder`	—	16K	No	$0.80/$0.80	—	—
Qwen2.5-VL (72B) Instruct `oah/qwen2.5-vl`	—	33K	No	$1.95/$8.00	—	—
Qwen3 0.6B `oah/qwen3-0.6b`	—	41K	No	—	—	—
Qwen3 0.6B Base `oah/qwen3-0.6b-base`	—	33K	No	—	—	—
Qwen3 1.7B `oah/qwen3-1.7b`	—	41K	No	—	—	—
Qwen3 1.7B Base `oah/qwen3-1.7b-base`	—	33K	No	—	—	—
Qwen3 14B `oah/qwen3`	—	2K	No	Free/Free	$0.10/$0.28	$0.29/$0.39
Qwen3 14B Base `oah/qwen3-14b-base`	—	33K	No	—	—	—
Qwen3 235B A22b Instruct 2507 Fp8 `oah/qwen3-235b-a22b-instruct-2507`	—	262K	No	—	—	—
Qwen3 30B A3b Base `oah/qwen3-30b-a3b-base`	—	33K	No	—	—	—
Qwen3 30B A3B Instruct 2507 Lora `oah/qwen3-30b-a3b-instruct-2507-lora`	—	262K	No	—	—	—
Qwen3 4B Base `oah/qwen3-4b-base`	—	33K	No	—	—	—
Qwen3 8B Base `oah/qwen3-8b-base`	—	33K	No	—	—	—
Qwen3 8B Lora `oah/qwen3-8b-lora`	—	41K	No	—	—	—
Qwen3 Coder 30B A3b Instruct `oah/qwen3-coder`	—	262K	No	$2.00/$2.00	$0.29/$1.20	—
Qwen3 Coder Next Fp8 `oah/qwen3-coder-next`	—	262K	No	$0.50/$1.20	—	—
Qwen3 Next 80B A3b Instruct `oah/qwen3-next`	—	262K	No	$0.15/$1.50	$0.14/$1.40	—
Qwen3 Next 80B A3b Thinking `oah/qwen3-next-80b-a3b-thinking`	—	262K	No	$0.15/$1.50	—	—
Qwen3-VL-235B-A22B-Instruct-FP8 `oah/qwen3-vl`	—	262K	No	$0.18/$0.68	—	—
Qwen3.5 122B A10b Fp8 `oah/qwen3.5`	—	262K	No	$0.17/$0.25	—	—
Qwen3.5 2B Lora `oah/qwen3.5-2b-lora`	—	262K	No	—	—	—
Qwen3.5 35B A3B Lora `oah/qwen3.5-35b-a3b-lora`	—	262K	No	—	—	—
Qwen3.6 35B A3b Fp8 `oah/qwen3.6`	—	262K	No	—	—	—
Qwen3.6 35B A3B Lora `oah/qwen3.6-35b-a3b-lora`	—	262K	No	—	—	—
Qwen3.6 Plus `oah/qwen3.6-plus`	—	1.0M	No	$0.50/$3.00	—	—
Qwen3.7 Max `oah/qwen3.7-max`	—	1.0M	No	$1.25/$3.75	—	—
Qwen3.7 Plus `oah/qwen3.7-plus`	—	1.0M	No	$0.32/$1.28	—	—
Arize AI Qwen 2 1.5B Instruct `oah/qwen-2-1.5b`	—	33K	No	$0.10/$0.10	—	—
Cogito V1 Preview Qwen 14B `oah/cogito-v1-preview-qwen`	—	131K	No	—	—	—
Qwen/Qwen3-235B-A22B-Thinking-2507 `oah/qwen3-235b-a22b-thinking`	—	—	No	—	$0.30/$2.90	—
Qwen/Qwen3-Max `oah/qwen3-max`	—	—	No	—	—	—
Qwen/Qwen3-Max-Thinking `oah/qwen3-max-thinking`	—	—	No	—	—	—
Qwen/Qwen3-TTS `oah/qwen3-tts`	—	—	No	—	—	—
Qwen/Qwen3-TTS-VoiceDesign `oah/qwen3-tts-voicedesign`	—	—	No	—	—	—

Qwen Direct vs AI Security Gateway

What you get at each pricing tier. Hub adds security, governance, and multi-provider routing on top of raw API access.

Mode	What You Pay	PII Redaction	Budget Caps	Routing	Audit Trail
Direct to Alibaba Cloud	Provider pricing only	None	None	Manual	None
Hub — Managed Mode	Provider + 25% markup	30+ PII types	Per-key hard caps	Smart Router	Full compliance log
Hub — Pro BYOK ($29/mo)	Direct to provider (0% markup)	30+ PII types	Per-key hard caps	Smart Router	Full compliance log

Popular Use Cases

Chinese/Asian language applications

Multilingual content generation and translation

Budget-friendly open-source deployments

Fine-tuning base models for domain-specific tasks

Quick Integration

# pip install aisg
from aisg import AISG

client = AISG(api_key="your_hub_api_key")

response = client.chat.create(
    model="oah/qwen3-235b-a22b-instruct-2507-tput",
    messages=[{"role": "user", "content": "Hello!"}],
)

print(response.content)
print(response.aisg_metadata.pii_detected)
print(response.aisg_metadata.cost_usd)

Use any virtual model name from the pricing table above (prefixed with oah/). Also works with the standard OpenAI SDK — just change base_url. Every request is PII-scanned before reaching Alibaba Cloud (Open Source).

Frequently Asked Questions

What is the Qwen API pricing?

Qwen API pricing varies by model size and provider. In Managed Mode, we add a 25% markup. With Pro BYOK, pay the provider directly at 0% markup. See the pricing table above for current rates.

What is the Qwen 2.5 cost?

Qwen 2.5 cost depends on the parameter count (0.5B to 72B) and provider. Smaller variants are extremely affordable. Check the pricing comparison table above.

Is Qwen good for English tasks?

Yes. Qwen 2.5 72B is competitive with Llama 3.3 70B on English benchmarks. For Chinese and multilingual tasks, Qwen is often the best open-source choice.

Previous Versions

These models have been retired by the provider. Migrate to a current variant above.

Qwen3 235B A22B Instruct 2507 FP8 Throughputtogether

Qwen3 235B A22B Thinking 2507 FP8together

Qwen3.5 0.8B Loratogether

Qwen3.5 122B A10B Loratogether

Qwen3.5 27B Loratogether

Qwen3.5 2B Loratogether

Qwen3.5 35B A3B Base Loratogether

Qwen3.5 397B A17B Loratogether

Qwen3.5 4B Loratogether

Qwen3.5 9B Loratogether

Qwen3.6 27B Loratogether

Qwen3.6 35B A3B Loratogether

Qwen/Qwen-Image-Editdeepinfra

Qwen/Qwen-Image-Edit-Maxdeepinfra

Qwen/Qwen2.5-VL-32B-Instructdeepinfra

Qwen/Qwen3-Embedding-0.6B-batchdeepinfra

Qwen/Qwen3-Embedding-4B-batchdeepinfra

Qwen/Qwen3-Embedding-8B-batchdeepinfra

Qwen/Qwen3.5-0.8Bdeepinfra

Deploy Qwen with Enterprise-Grade Security

Get started with 1,000,000 free credits. Every Qwen request is PII-scanned, cost-optimized, and fully logged — zero configuration.

Get 1,000,000 Free Credits Free PII Leak Checker

Not ready yet? Get notified about Qwen updates:

Explore Other Model Families

🦙Llama

Meta's open-weights Llama family is the most widely deployed open-source LLM series. Compare Llama API pricing across Gr…

🧠GPT

OpenAI's GPT family powers the majority of commercial AI applications. Compare GPT-4 API cost and OpenAI API pricing acr…

💎Gemini

Google's Gemini family offers powerful multimodal capabilities with large context windows. Compare Gemini API pricing an…

🤖Claude

Anthropic's Claude family is built with safety and reliability at its core. Compare Claude API pricing and Claude Sonnet…

🔍DeepSeek

DeepSeek has rapidly risen as a leading open-source model family, known for exceptional coding performance and cost effi…

← View all 11 model families

Model registry last updated: . Pricing shown is the lowest available rate across providers (per 1M tokens, USD). Actual pricing depends on provider and plan.

🏮Qwen Models

Why deploy Qwen through AI Security Gateway?

Automatic PII Redaction

Smart Cost Routing

Native SDK or OpenAI Compatible

Full Observability

Qwen Strengths

Available Qwen Models (40)

Qwen QwQ-32B

Qwen 2 (1.5B)

Qwen 2 (72B)

Qwen2-VL (72B) Instruct

Qwen2.5 1.5B

Qwen2.5 14B

Qwen 2.5 Coder 32B Instruct

Qwen2.5-VL (72B) Instruct

Qwen3 0.6B

Qwen3 0.6B Base

Qwen3 1.7B

Qwen3 1.7B Base

Qwen3 14B

Qwen3 14B Base

Qwen3 235B A22b Instruct 2507 Fp8

Qwen3 30B A3b Base

Qwen3 30B A3B Instruct 2507 Lora

Qwen3 4B Base

Qwen3 8B Base

Qwen3 8B Lora

Qwen3 Coder 30B A3b Instruct

Qwen3 Coder Next Fp8

Qwen3 Next 80B A3b Instruct

Qwen3 Next 80B A3b Thinking

Qwen3-VL-235B-A22B-Instruct-FP8

Qwen3.5 122B A10b Fp8

Qwen3.5 2B Lora

Qwen3.5 35B A3B Lora

Qwen3.6 35B A3b Fp8

Qwen3.6 35B A3B Lora

Qwen3.6 Plus

Qwen3.7 Max

Qwen3.7 Plus

Arize AI Qwen 2 1.5B Instruct

Cogito V1 Preview Qwen 14B

Qwen/Qwen3-235B-A22B-Thinking-2507

Qwen/Qwen3-Max

Qwen/Qwen3-Max-Thinking

Qwen/Qwen3-TTS

Qwen/Qwen3-TTS-VoiceDesign

Qwen Pricing Comparison (per 1M tokens, USD)

Qwen Direct vs AI Security Gateway

Popular Use Cases

Quick Integration

Frequently Asked Questions

Previous Versions

Deploy Qwen with Enterprise-Grade Security

Explore Other Model Families