Developer Hub

Build faster.
Save more.

Welcome to WatchLLM documentation. Here you'll find everything you need to implement semantic caching at the edge.

Join Discord

Get help from community

Security Policy

How we handle your data

Getting Started

Everything you need to get WatchLLM up and running in your project.

Getting Started

Complete beginner's guide from signup to first cached request.

Quick Start

2-minute integration guide for experienced developers.

Deployment

Step-by-step production deployment guide.

Cheat Sheet

Quick reference for SDKs and CLI commands.

Guides & Concepts

Deep dive into how WatchLLM handles semantic caching and analytics.

Python SDK

Complete reference for the WatchLLM Python SDK with auto-instrumentation.

Node.js SDK

Complete reference for the WatchLLM Node.js/TypeScript SDK.

Self-Hosting Guide

Enterprise deployment guide for self-hosted WatchLLM infrastructure.

Architecture

Understanding the edge proxy system design.

Analytics Guide

Mastering cost savings and performance metrics.

Code Examples

Boilerplate for JS, Python, and cURL.

API & Reference

Technical specifications and error resolution guides.

API Reference

Detailed endpoint and parameter specs.

Error Codes

Troubleshoot common integration issues.

Troubleshooting

Connectivity and performance debugging.

SDKs

Complete SDK documentation for Node.js and Python integrations.

Node.js SDK

TypeScript SDK for Node.js applications with full type safety.

Python SDK

Python SDK for seamless integration with AI applications.

New Feature

Semantic A/B Testing

Compare performance and cost between different LLM providers in real-time. Automatically route requests to the most efficient variant.

View Example Implementation

Build faster. Save more.

Getting Started

Getting Started

Quick Start

Deployment

Cheat Sheet

Guides & Concepts

Python SDK

Node.js SDK

Self-Hosting Guide

Architecture

Analytics Guide

Code Examples

API & Reference

API Reference

Error Codes

Troubleshooting

SDKs

Node.js SDK

Python SDK

Semantic A/B Testing

Build faster.
Save more.