Pricing & Plans

Official WatchLLM documentation for pricing & plans.

Pricing & Plans#

WatchLLM offers flexible plans to match your usage, from hobby projects to enterprise deployments.

Plans Overview#

Feature Free Starter Pro
Price $0/mo $29/mo $49/mo
Monthly Requests 50,000 250,000 1,000,000
Rate Limit 10 rpm 50 rpm 200 rpm
Data Retention 7 days 30 days 90 days
Semantic Caching
Analytics Dashboard Basic Full Full
BYOK Support
A/B Testing
Priority Support
Custom Domains

Free Plan#

Perfect for trying out WatchLLM and small personal projects.

  • 50,000 requests/month — enough for prototyping and development
  • 10 requests/minute rate limit
  • 7-day data retention — usage logs available for 1 week
  • No credit card required
  • After quota is reached, requests switch to cache-only mode (no new upstream calls)

Starter Plan — $29/month#

Ideal for small teams and production applications.

  • 250,000 requests/month with overage at $0.50/1,000 requests
  • 50 requests/minute rate limit
  • 30-day data retention for analytics
  • A/B testing — compare models and providers
  • Overage cap: Up to 200,000 additional requests per month

Pro Plan — $49/month#

Best for high-traffic applications and teams that need advanced features.

  • 1,000,000 requests/month with overage at $0.40/1,000 requests
  • 200 requests/minute rate limit
  • 90-day data retention for deep analytics
  • Priority support — faster response times
  • Custom domains — use your own domain for the proxy
  • Overage cap: Up to 750,000 additional requests per month

Enterprise#

For organizations with custom requirements:

  • Unlimited requests with negotiated pricing
  • Custom rate limits tailored to your traffic patterns
  • Unlimited data retention
  • Dedicated support with SLA guarantees
  • Self-hosting support with deployment assistance
  • HIPAA compliance available
  • SSO / SAML integration

Contact us for Enterprise pricing.

Annual Billing#

Save 20% with annual billing:

Plan Monthly Annual (per month) Annual Total
Starter $29/mo $23.20/mo $278.40/yr
Pro $49/mo $39.20/mo $470.40/yr

How Billing Works#

Subscription Billing#

  • Billed at the start of each billing cycle (monthly or annually)
  • Processed securely through Stripe
  • Cancel anytime — your plan remains active until the end of the billing period
  • Downgrade takes effect at the next billing cycle

Overage Billing#

  • Overages are calculated at the end of each month
  • Charged separately from your subscription
  • Detailed usage breakdown available in the dashboard
  • Overage caps prevent unexpected bills

Comparing Plans#

Which Plan Should I Choose?#

  • Free: You're evaluating WatchLLM or building a side project
  • Starter: You're running a production app with moderate traffic
  • Pro: You need high throughput, longer analytics retention, or priority support
  • Enterprise: You need custom limits, SLAs, or compliance certifications

Cost Savings Example#

A typical application making 100,000 OpenAI API calls/month at an average cost of $0.03/call:

Scenario Monthly Cost With WatchLLM (40% cache hit)
Without WatchLLM $3,000
With Starter Plan $1,829 ($1,800 API + $29 WatchLLM)
Savings $1,171/month (39%)

FAQ#

Can I switch plans at any time? Yes. Upgrades take effect immediately. Downgrades take effect at the next billing cycle.

Is there a free trial for paid plans? The Free plan serves as a trial. You can evaluate all core features before upgrading.

Do cached responses count toward my quota? Yes, all requests count toward your monthly quota, including cache hits.

What happens when I hit my quota? Free plan: cache-only mode. Paid plans: overage billing kicks in (up to the overage cap).

Can I get a refund? Contact us within 14 days of your first payment for a full refund.

© 2026 WatchLLM. All rights reserved.