Pricing & Plans#
WatchLLM offers flexible plans to match your usage, from hobby projects to enterprise deployments.
Plans Overview#
| Feature | Free | Starter | Pro |
|---|---|---|---|
| Price | $0/mo | $29/mo | $49/mo |
| Monthly Requests | 50,000 | 250,000 | 1,000,000 |
| Rate Limit | 10 rpm | 50 rpm | 200 rpm |
| Data Retention | 7 days | 30 days | 90 days |
| Semantic Caching | ✅ | ✅ | ✅ |
| Analytics Dashboard | Basic | Full | Full |
| BYOK Support | ✅ | ✅ | ✅ |
| A/B Testing | — | ✅ | ✅ |
| Priority Support | — | — | ✅ |
| Custom Domains | — | — | ✅ |
Free Plan#
Perfect for trying out WatchLLM and small personal projects.
- 50,000 requests/month — enough for prototyping and development
- 10 requests/minute rate limit
- 7-day data retention — usage logs available for 1 week
- No credit card required
- After quota is reached, requests switch to cache-only mode (no new upstream calls)
Starter Plan — $29/month#
Ideal for small teams and production applications.
- 250,000 requests/month with overage at $0.50/1,000 requests
- 50 requests/minute rate limit
- 30-day data retention for analytics
- A/B testing — compare models and providers
- Overage cap: Up to 200,000 additional requests per month
Pro Plan — $49/month#
Best for high-traffic applications and teams that need advanced features.
- 1,000,000 requests/month with overage at $0.40/1,000 requests
- 200 requests/minute rate limit
- 90-day data retention for deep analytics
- Priority support — faster response times
- Custom domains — use your own domain for the proxy
- Overage cap: Up to 750,000 additional requests per month
Enterprise#
For organizations with custom requirements:
- Unlimited requests with negotiated pricing
- Custom rate limits tailored to your traffic patterns
- Unlimited data retention
- Dedicated support with SLA guarantees
- Self-hosting support with deployment assistance
- HIPAA compliance available
- SSO / SAML integration
Contact us for Enterprise pricing.
Annual Billing#
Save 20% with annual billing:
| Plan | Monthly | Annual (per month) | Annual Total |
|---|---|---|---|
| Starter | $29/mo | $23.20/mo | $278.40/yr |
| Pro | $49/mo | $39.20/mo | $470.40/yr |
How Billing Works#
Subscription Billing#
- Billed at the start of each billing cycle (monthly or annually)
- Processed securely through Stripe
- Cancel anytime — your plan remains active until the end of the billing period
- Downgrade takes effect at the next billing cycle
Overage Billing#
- Overages are calculated at the end of each month
- Charged separately from your subscription
- Detailed usage breakdown available in the dashboard
- Overage caps prevent unexpected bills
Comparing Plans#
Which Plan Should I Choose?#
- Free: You're evaluating WatchLLM or building a side project
- Starter: You're running a production app with moderate traffic
- Pro: You need high throughput, longer analytics retention, or priority support
- Enterprise: You need custom limits, SLAs, or compliance certifications
Cost Savings Example#
A typical application making 100,000 OpenAI API calls/month at an average cost of $0.03/call:
| Scenario | Monthly Cost | With WatchLLM (40% cache hit) |
|---|---|---|
| Without WatchLLM | $3,000 | — |
| With Starter Plan | — | $1,829 ($1,800 API + $29 WatchLLM) |
| Savings | — | $1,171/month (39%) |
FAQ#
Can I switch plans at any time? Yes. Upgrades take effect immediately. Downgrades take effect at the next billing cycle.
Is there a free trial for paid plans? The Free plan serves as a trial. You can evaluate all core features before upgrading.
Do cached responses count toward my quota? Yes, all requests count toward your monthly quota, including cache hits.
What happens when I hit my quota? Free plan: cache-only mode. Paid plans: overage billing kicks in (up to the overage cap).
Can I get a refund? Contact us within 14 days of your first payment for a full refund.