Qwen QwQ-32B
oah/qwen3-235b-a22b-thinkingDeploy Qwen QwQ-32B with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
by Alibaba Cloud (Open Source)
Alibaba's Qwen family offers strong multilingual performance with a particular edge in Chinese and Asian languages. Compare Qwen API pricing and Qwen 2.5 cost across providers. Qwen 2.5 brings competitive performance on English benchmarks while maintaining multilingual excellence.
Every Qwen request is scanned for 28+ PII entity types — SSNs, credit cards, emails, API keys, and more — before it reaches any provider.
Qwen is available across 2 providers. Our Smart Router picks the cheapest one per-request. 25% managed markup / 0% on Pro BYOK.
Use the AISG SDK (pip install aisg) for typed metadata and error handling, or change two lines in your OpenAI SDK. Both work.
Per-request logging of token counts, latency, DLP violations, and cost. Never wonder what your AI spend is again.
oah/qwen3-235b-a22b-thinkingDeploy Qwen QwQ-32B with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen2-1.5bDeploy Qwen 2 (1.5B) with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen2Deploy Qwen 2 (72B) with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen2-vlDeploy Qwen2-VL (72B) Instruct with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen2.5-1.5bDeploy Qwen2.5 1.5B with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen2.5Deploy Qwen2.5 14B with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen2.5-coderDeploy Qwen 2.5 Coder 32B Instruct with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen2.5-vlDeploy Qwen2.5-VL (72B) Instruct with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-0.6bDeploy Qwen3 0.6B with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-0.6b-baseDeploy Qwen3 0.6B Base with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-1.7bDeploy Qwen3 1.7B with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-1.7b-baseDeploy Qwen3 1.7B Base with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3Deploy Qwen3 14B with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-14b-baseDeploy Qwen3 14B Base with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-235b-a22b-instruct-2507Deploy Qwen3 235B A22b Instruct 2507 Fp8 with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-235b-a22b-instruct-2507-tputDeploy Qwen3 235B A22B Instruct 2507 FP8 Throughput with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-30b-a3b-baseDeploy Qwen3 30B A3b Base with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-30b-a3b-instruct-2507-loraDeploy Qwen3 30B A3B Instruct 2507 Lora with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-4b-baseDeploy Qwen3 4B Base with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-8b-baseDeploy Qwen3 8B Base with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-8b-loraDeploy Qwen3 8B Lora with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-coderDeploy Qwen3 Coder 30B A3b Instruct with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-coder-nextDeploy Qwen3 Coder Next Fp8 with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-nextDeploy Qwen3 Next 80B A3b Instruct with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-next-80b-a3b-thinkingDeploy Qwen3 Next 80B A3b Thinking with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-vlDeploy Qwen3-VL-235B-A22B-Instruct-FP8 with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3.5Deploy Qwen3.5 122B A10b Fp8 with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3.6Deploy Qwen3.6 35B A3b Fp8 with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3.6-plusDeploy Qwen3.6 Plus with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3.7-maxDeploy Qwen3.7 Max with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen-2-1.5bDeploy Arize AI Qwen 2 1.5B Instruct with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/cogito-v1-preview-qwenDeploy Cogito V1 Preview Qwen 14B with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen-image-edit-maxDeploy Qwen/Qwen-Image-Max with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-maxDeploy Qwen/Qwen3-Max with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-max-thinkingDeploy Qwen/Qwen3-Max-Thinking with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3-tts-voicedesignDeploy Qwen/Qwen3-TTS-VoiceDesign with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
oah/qwen3.5-0.8bDeploy Qwen/Qwen3.5-0.8B with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.
Input / Output pricing by provider. Managed Mode adds a 25% managed markup. Pro BYOK = 0% markup.
| Model | Params | Context | Vision | Together.ai | DeepInfra |
|---|---|---|---|---|---|
Qwen QwQ-32B oah/qwen3-235b-a22b-thinking | — | 131K | No | $1.20/$1.20 | $0.30/$2.90 |
Qwen 2 (1.5B) oah/qwen2-1.5b | — | 33K | No | $0.02/$0.02 | — |
Qwen 2 (72B) oah/qwen2 | — | 33K | No | $0.90/$0.90 | — |
Qwen2-VL (72B) Instruct oah/qwen2-vl | — | 33K | No | $1.20/$1.20 | — |
Qwen2.5 1.5B oah/qwen2.5-1.5b | — | 131K | No | — | — |
Qwen2.5 14B oah/qwen2.5 | — | 131K | No | $0.30/$0.30 | $0.12/$0.39 |
Qwen 2.5 Coder 32B Instruct oah/qwen2.5-coder | — | 16K | No | $0.80/$0.80 | — |
Qwen2.5-VL (72B) Instruct oah/qwen2.5-vl | — | 33K | No | $1.95/$8.00 | — |
Qwen3 0.6B oah/qwen3-0.6b | — | 41K | No | — | — |
Qwen3 0.6B Base oah/qwen3-0.6b-base | — | 33K | No | — | — |
Qwen3 1.7B oah/qwen3-1.7b | — | 41K | No | — | — |
Qwen3 1.7B Base oah/qwen3-1.7b-base | — | 33K | No | — | — |
Qwen3 14B oah/qwen3 | — | 2K | No | Free/Free | $0.10/$0.28 |
Qwen3 14B Base oah/qwen3-14b-base | — | 33K | No | — | — |
Qwen3 235B A22b Instruct 2507 Fp8 oah/qwen3-235b-a22b-instruct-2507 | — | 262K | No | — | — |
Qwen3 235B A22B Instruct 2507 FP8 Throughput oah/qwen3-235b-a22b-instruct-2507-tput | — | 262K | No | $0.20/$0.60 | — |
Qwen3 30B A3b Base oah/qwen3-30b-a3b-base | — | 33K | No | — | — |
Qwen3 30B A3B Instruct 2507 Lora oah/qwen3-30b-a3b-instruct-2507-lora | — | 262K | No | — | — |
Qwen3 4B Base oah/qwen3-4b-base | — | 33K | No | — | — |
Qwen3 8B Base oah/qwen3-8b-base | — | 33K | No | — | — |
Qwen3 8B Lora oah/qwen3-8b-lora | — | 41K | No | — | — |
Qwen3 Coder 30B A3b Instruct oah/qwen3-coder | — | 262K | No | $2.00/$2.00 | $0.29/$1.20 |
Qwen3 Coder Next Fp8 oah/qwen3-coder-next | — | 262K | No | $0.50/$1.20 | — |
Qwen3 Next 80B A3b Instruct oah/qwen3-next | — | 262K | No | $0.15/$1.50 | $0.14/$1.40 |
Qwen3 Next 80B A3b Thinking oah/qwen3-next-80b-a3b-thinking | — | 262K | No | $0.15/$1.50 | — |
Qwen3-VL-235B-A22B-Instruct-FP8 oah/qwen3-vl | — | 262K | No | $0.18/$0.68 | — |
Qwen3.5 122B A10b Fp8 oah/qwen3.5 | — | 262K | No | $0.17/$0.25 | — |
Qwen3.6 35B A3b Fp8 oah/qwen3.6 | — | 262K | No | — | — |
Qwen3.6 Plus oah/qwen3.6-plus | — | 1.0M | No | $0.50/$3.00 | — |
Qwen3.7 Max oah/qwen3.7-max | — | 1.0M | No | $1.25/$3.75 | — |
Arize AI Qwen 2 1.5B Instruct oah/qwen-2-1.5b | — | 33K | No | $0.10/$0.10 | — |
Cogito V1 Preview Qwen 14B oah/cogito-v1-preview-qwen | — | 131K | No | — | — |
Qwen/Qwen-Image-Max oah/qwen-image-edit-max | — | — | No | — | — |
Qwen/Qwen3-Max oah/qwen3-max | — | — | No | — | — |
Qwen/Qwen3-Max-Thinking oah/qwen3-max-thinking | — | — | No | — | — |
Qwen/Qwen3-TTS-VoiceDesign oah/qwen3-tts-voicedesign | — | — | No | — | — |
Qwen/Qwen3.5-0.8B oah/qwen3.5-0.8b | — | — | No | — | — |
What you get at each pricing tier. Hub adds security, governance, and multi-provider routing on top of raw API access.
| Mode | What You Pay | PII Redaction | Budget Caps | Routing | Audit Trail |
|---|---|---|---|---|---|
| Direct to Alibaba Cloud | Provider pricing only | None | None | Manual | None |
| Hub — Managed Mode | Provider + 25% markup | 28+ PII types | Per-key hard caps | Smart Router | Full compliance log |
| Hub — Pro BYOK ($29/mo) | Direct to provider (0% markup) | 28+ PII types | Per-key hard caps | Smart Router | Full compliance log |
Chinese/Asian language applications
Multilingual content generation and translation
Budget-friendly open-source deployments
Fine-tuning base models for domain-specific tasks
# pip install aisg
from aisg import AISG
client = AISG(api_key="your_hub_api_key")
response = client.chat.create(
model="oah/qwen3-235b-a22b-thinking",
messages=[{"role": "user", "content": "Hello!"}],
)
print(response.content)
print(response.aisg_metadata.pii_detected)
print(response.aisg_metadata.cost_usd)Use any virtual model name from the pricing table above (prefixed with oah/). Also works with the standard OpenAI SDK — just change base_url. Every request is PII-scanned before reaching Alibaba Cloud (Open Source).
These models have been retired by the provider. Migrate to a current variant above.
Get started with 1,000,000 free credits. Every Qwen request is PII-scanned, cost-optimized, and fully logged — zero configuration.
Not ready yet? Get notified about Qwen updates:
Meta's open-weights Llama family is the most widely deployed open-source LLM series. Compare Llama API pricing across Gr…
OpenAI's GPT family powers the majority of commercial AI applications. Compare GPT-4 API cost and OpenAI API pricing acr…
Google's Gemini family offers powerful multimodal capabilities with large context windows. Compare Gemini API pricing an…
Anthropic's Claude family is built with safety and reliability at its core. Compare Claude API pricing and Claude Sonnet…
DeepSeek has rapidly risen as a leading open-source model family, known for exceptional coding performance and cost effi…
Model registry last updated: . Pricing shown is the lowest available rate across providers (per 1M tokens, USD). Actual pricing depends on provider and plan.