Simple
‍Transparent Pricing

Cloud-hosted SaaS or on-premises β€” same platform, your terms. Start free with up to 5 seats, scale when you're ready.

Choose Your Deployment

Cloud-hosted SaaS or on-premises β€” same platform, your terms.

Per seat per month β€” requests pool across your team. Start free, scale when ready.

Annual license based on cluster size. Unlimited users and requests within cluster capacity.

Per seat per year – save 20% vs. monthly. Requests pool across your team.

🎁 Summer discount β€” Get 30% off any annual plan β€” valid until June 30, 2026 🎁
* Summer discount is not combinable with the annual billing discount.

Monthly Annually βˆ’20%

Free

$0
Up to 5 seats
  • βœ“ AI Gateway with smart routing
    βœ“ Basic guardrails
    βœ“ Cost dashboard
    βœ“ 5 MCP connectors
    βœ“ 7-day log retention
    βœ“ Community support
Sign Up Free

Starter

$12/seat/mo
Min 5 seats
  • βœ“ Everything in Free, plus:
    βœ“ Advanced filtering & PII detection
    βœ“ Basic tracing
    βœ“ 10 MCP connectors
    βœ“ Role-based access control
    βœ“ 30-day log retention
    βœ“ Email support Β· 99.5% SLA
try free

Enterprise

Custom
Volume pricing Β· Unlimited requests
  • βœ“ Everything in Business, plus:
    βœ“ Unlimited seats & requests
    βœ“ SSO/SAML (enterprise providers)
    βœ“ Unlimited MCP connectors
    βœ“ Custom retention & audit export
    βœ“ Dedicated success manager
    βœ“ White-glove onboarding Β· 99.99% SLA
Talk to Sales

Small Cluster

$6,500/year
Up to 5 nodes Β· 40 vCPUs
  • βœ“ Full AI Gateway + smart routing
    βœ“ Full analytics & tracing
    βœ“ All 20+ MCP connectors
    βœ“ Agent builder (up to 10 agents)
    βœ“ Host up to 3 local models
    βœ“ SSO / SAML / LDAP
    βœ“ Quarterly updates Β· Business hours support
Request Quote

Large Cluster

$85K+/year
21+ nodes Β· Custom vCPUs Β· Unlimited
  • βœ“ Everything in Medium, plus:
    βœ“ Unlimited agents & local models
    βœ“ Custom MCP connectors
    βœ“ Continuous updates
    βœ“ 24/7 dedicated support + engineer
    βœ“ White-glove onboarding (4 weeks)
    βœ“ Included professional services hours
Talk to Sales

Requests pool across all seats. Overages: $1.00/1K (Starter), $0.60/1K (Business). Model costs passed through at provider rates

Compare Plans

Full Feature Comparison

See exactly what's included in each plan

Feature Free Starter Business Enterprise
Seats & Usage
Seats Up to 5 5+ Unlimited
Overage rate β€” $1.00/1K Custom
AI Gateway
Smart routing (LLMs + MCP) βœ“ βœ“ βœ“
Prompt optimization System prompt System prompt per provider Full set
Model output arbitrage β€” β€” βœ“
Retrieval Augmented Generation β€” β€” βœ“
Security & Guardrails
Basic guardrails βœ“ βœ“ βœ“
Advanced filtering & PII detection β€” βœ“ βœ“
DLP & policy enforcement β€” β€” βœ“
Analytics & Tracing
Cost dashboard βœ“ βœ“ βœ“
Basic tracing β€” βœ“ βœ“
Full tracing & debugging suite β€” β€” βœ“
Department-level billing β€” β€” βœ“
Log retention 7 days 30 days Custom
Audit export β€” β€” βœ“
Agents & MCP Connectors
MCP connectors 5 10 Unlimited
Agent builder β€” β€” βœ“
RAG system for Agents β€” β€” βœ“
Access & Support
Role-based access control β€” βœ“ βœ“
SSO / SAML βœ“ βœ“ Enterprise providers
Uptime SLA β€” 99.5% 99.99%
Support Community Email Dedicated CSM
Feature Small Cluster Medium Cluster Large Cluster
Infrastructure
Nodes Up to 5 21+
vCPUs 40 Custom
GPU support β€” βœ“
Air-gapped installation β€” βœ“
Users & requests Unlimited Unlimited
AI Gateway
Smart routing (LLMs + MCP) βœ“ βœ“
Prompt optimization & arbitrage βœ“ βœ“
Local models hosted Up to 3 Unlimited
GPU-accelerated inference β€” βœ“
Security & Guardrails
Full guardrails & PII detection βœ“ βœ“
DLP & policy enforcement βœ“ βœ“
SSO / SAML / LDAP βœ“ βœ“
Analytics & Tracing
Full analytics & tracing βœ“ βœ“
Department-level billing βœ“ βœ“
Log retention Unlimited (local) Unlimited (local)
Audit export βœ“ βœ“
Agents & MCP Connectors
MCP connectors All 20+ All + custom
Agent builder βœ“ βœ“
Max agents 10 Unlimited
Custom MCP connectors β€” βœ“
Corporate data training βœ“ βœ“
Deployment & Support
Update frequency Quarterly Continuous
Support Business hours 24/7 dedicated + engineer
Onboarding Self-serve White-glove (4 weeks)
Professional services β€” Included hours
Eval license 30 days 30 days