A CLI proxy that routes Warp terminal AI requests to a local LLM instead of the cloud
Warp terminal has 1,378 upvotes on a single GitHub issue asking for local LLM support (Ollama, LMStudio, vLLM). The vendor acknowledged it in an April 2025 livestream as 'on the roadmap' with no ETA, then locked the thread in August 2025 citing +1 spam with 900 reactions. The core blocker is business: Warp's monetization depends on routing AI through their cloud. Developers on production servers, security-conscious teams, and air-gapped environments cannot use Warp's AI features. Two tiny orphan repos (open-warp, 3 stars; harp, 1 star) exist but are unshipped hacks with no install path. The clean product is a CLI shim that intercepts Warp's local AI API socket and proxies it to any OpenAI-compatible local server, with a config file for model routing and fallback.
A CLI proxy that intercepts Warp terminal AI requests and routes them to a local LLM without cloud exposure
1.6k โฒScore Breakdown
Social Proof 2 sources
Gap Assessment
Vendor deferred indefinitely for business reasons. Only two orphan repos (3 and 1 star) attempt this. No credible installable product exists. Clean third-party CLI shim slot.