All Models
4 Models · 1 Providers · PII Redacted

💎Gemini Models

by Google (Closed Source)

Google's Gemini family offers powerful multimodal capabilities with large context windows. Compare Gemini API pricing and Google AI API pricing across model tiers. Gemini 2.5 Pro leads with 1M+ token context and native multimodal understanding, while Gemini 2.0 Flash provides fast, cost-effective inference.

From $0.10/M tokens
1 providers
28+ PII entities redacted

Why deploy Gemini through AI Security Gateway?

Automatic PII Redaction

Every Gemini request is scanned for 28+ PII entity types — SSNs, credit cards, emails, API keys, and more — before it reaches any provider.

Smart Cost Routing

Gemini is available across 1 providers. Our Smart Router picks the cheapest one per-request. 30% managed markup / 0% on Pro BYOK.

Zero Code Changes

Change two lines in your OpenAI SDK — base_url and api_key — and every request flows through AISG. Full backward compatibility.

Full Observability

Per-request logging of token counts, latency, DLP violations, and cost. Never wonder what your AI spend is again.

Gemini Strengths

  • Massive context windows (1M+ tokens on Pro variants)
  • Native multimodal: text, image, audio, and video understanding
  • Gemini 2.0 Flash: fast and cost-competitive
  • Strong performance on knowledge-intensive and multilingual tasks
  • Tight integration with Google Cloud ecosystem

Available Gemini Models (4)

Gemini 2.5 Pro

oah/gemini-2.5-pro
Closed — Premium

Deploy Gemini 2.5 Pro with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Google Gemini
Input: $1.25/MOutput: $10.00/M

Gemini 2.5 Flash

oah/gemini-2.5-flash
Closed — Mini

Deploy Gemini 2.5 Flash with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Google Gemini
Input: $0.30/MOutput: $2.50/M

Gemini 2.5 Flash Lite

oah/gemini-2.5-flash-lite
Closed — Mini

Deploy Gemini 2.5 Flash Lite with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Google Gemini
Input: $0.10/MOutput: $0.40/M

Gemini 2.0 Flash

oah/gemini-2.0-flash
Closed — Mini

Deploy Gemini 2.0 Flash with built-in PII redaction and Hub governance. Available on Managed Credits and BYOK.

Google Gemini
Input: $0.10/MOutput: $0.40/M

Gemini Pricing Comparison (per 1M tokens, USD)

Input / Output pricing by provider. Managed Mode adds a 30% managed markup. Pro BYOK = 0% markup.

ModelParamsContextVisionGoogle Gemini
Gemini 2.5 Pro
oah/gemini-2.5-pro
No
$1.25/$10.00
Gemini 2.5 Flash
oah/gemini-2.5-flash
No
$0.30/$2.50
Gemini 2.5 Flash Lite
oah/gemini-2.5-flash-lite
No
$0.10/$0.40
Gemini 2.0 Flash
oah/gemini-2.0-flash
No
$0.10/$0.40

Gemini Direct vs AI Security Gateway

What you get at each pricing tier. Hub adds security, governance, and multi-provider routing on top of raw API access.

ModeWhat You PayPII RedactionBudget CapsRoutingAudit Trail
Direct to GoogleProvider pricing onlyNoneNoneManualNone
Hub — Managed ModeProvider + 30% markup28+ PII typesPer-key hard capsSmart RouterFull compliance log
Hub — Pro BYOK ($29/mo)Direct to provider (0% markup)28+ PII typesPer-key hard capsSmart RouterFull compliance log

Popular Use Cases

1

Long-document analysis and RAG over large corpora

2

Multimodal applications processing images and documents

3

Multilingual customer support and translation

4

High-throughput batch processing (Flash variant)

Integration — 2 Lines

from openai import OpenAI

client = OpenAI(
    base_url="https://api.aisecuritygateway.ai/v1",
    api_key="your_hub_api_key"
)

# Use any virtual model name from the pricing table above
response = client.chat.completions.create(
    model="oah/gemini-2.5-pro",
    messages=[{"role": "user", "content": "Hello!"}]
)

Use any virtual model name from the pricing table above (prefixed with oah/). Works with the standard OpenAI SDK. Every request is PII-scanned before reaching Google (Closed Source).

Frequently Asked Questions

What is the Gemini API pricing on AI Security Gateway?
Gemini API pricing depends on the model variant. In Managed Mode, we add a 30% markup on Google's published API pricing. With Pro BYOK, use your own Google AI key at 0% markup. See the live pricing table above.
What is Google AI API pricing in 2026?
Google AI API pricing varies by Gemini model tier. Gemini 2.0 Flash is the most cost-effective, while Gemini 2.5 Pro is the premium option with 1M+ token context. The pricing table above reflects current rates.
What is the Gemini 2 API cost?
Gemini 2.0 Flash and Gemini 2.5 Pro have different pricing tiers. Check the comparison table above for per-token input/output costs across providers.
Can I use Gemini with PII redaction?
Yes. Our AI Firewall scans all Gemini-bound prompts for 28+ PII types before they reach Google's servers.
Does Gemini support vision through AISG?
Yes. Vision-capable Gemini models work through AISG, including OCR-based DLP scanning for images.

Deploy Gemini with Enterprise-Grade Security

Get started with 1,000,000 free credits. Every Gemini request is PII-scanned, cost-optimized, and fully logged — zero configuration.

Not ready yet? Get notified about Gemini updates:

Explore Other Model Families

Model registry last updated: 2026-04-18T17:41:46.389Z. Pricing shown is the lowest available rate across providers (per 1M tokens, USD). Actual pricing depends on provider and plan.