A CLI proxy that routes Warp terminal AI requests to a local LLM instead of the cloud

Warp terminal has 1,378 upvotes on a single GitHub issue asking for local LLM support (Ollama, LMStudio, vLLM). The vendor acknowledged it in an April 2025 livestream as 'on the roadmap' with no ETA, then locked the thread in August 2025 citing +1 spam with 900 reactions. The core blocker is business: Warp's monetization depends on routing AI through their cloud. Developers on production servers, security-conscious teams, and air-gapped environments cannot use Warp's AI features. Two tiny orphan repos (open-warp, 3 stars; harp, 1 star) exist but are unshipped hacks with no install path. The clean product is a CLI shim that intercepts Warp's local AI API socket and proxies it to any OpenAI-compatible local server, with a config file for model routing and fallback.

Product Idea from this Signal

A CLI proxy that intercepts Warp terminal AI requests and routes them to a local LLM without cloud exposure

1.6k ▲

Warp terminal routes all AI inference through its own cloud servers. Even after adding a custom inference endpoint, Warp rejects localhost URLs and requires users to expose their local Ollama or LMStudio instance to the public internet via ngrok or Cloudflare Tunnel. This means every AI-assisted terminal session leaks shell context, command history, and file paths to Warp's servers and a public tunnel endpoint, with no opt-out. A local socket-level proxy intercepts Warp's AI requests before they leave the machine, rewrites the destination to a local model endpoint, and returns the response in the same format Warp expects. Zero cloud exposure, zero tunneling, full Warp UX intact. The proxy runs as a background daemon, requires no Warp modification, and works with any OpenAI-compatible local backend (Ollama, LMStudio, llama.cpp, vLLM). Targets privacy-conscious developers, enterprise teams with data-handling constraints, and air-gapped environments where routing terminal context to a third-party cloud is not acceptable.

dev-toolsprivacylocal-aicliwarpollamaproxyterminal

Competitive110 leadsView Opportunity →

Score Breakdown

GitHub

1,517

110

Social Proof 2 sources

Make Warp work with Local Language Models (like Ollama models)

hmdz105 · 2/26/2024

1,517 HN

Warp sends a terminal session to LLM without user consent

ykurtov · 8/19/2025

110

Gap Assessment

Wide OpenNo dedicated solution exists

Vendor deferred indefinitely for business reasons. Only two orphan repos (3 and 1 star) attempt this. No credible installable product exists. Clean third-party CLI shim slot.

Virality Score

1,627

across 0 platforms

Details

Signalissue

Ecosystemdev_tool_cli

Sources2

Platforms0

Updatedunknown

Trend→ stable

Top ideas

All ideas →

0A CLI tool that runs a project's workloads across two Bun versions and reports behavioral and performance regressions before a version bump ships 0A CLI tool that ingests CI run logs after a supply-chain compromise and produces a per-secret rotation impact map across repos and providers 0A CLI tool that scans a project dependency tree for npm v12 breaking-change exposure and outputs a prioritized migration plan

Related signals

All signals →

4.7KDevelopers lack a keyboard-driven TUI for SQL databases with lazygit-style navigation