An SDK that preserves voice-agent session state across mid-call interrupts and cross-agent handoffs
Real-time voice agents built on OpenAI Realtime, Pipecat, LiveKit, Vapi, Retell, or Bland lose all in-flight state the moment a user barges in while a tool call is executing or when a call is routed to a second agent. The audio pipeline cancels or replays; the tool result is orphaned or replayed out of order; the new agent starts cold. Developers currently stitch together their own checkpoint-and-replay wrappers, which are fragile, untested at scale, and re-built from scratch for every framework. This SDK provides a framework-agnostic middleware layer that checkpoints tool-call state before and during execution, reconciles barge-in events with in-flight tool results, serializes full conversational context for cross-agent handoffs, and recovers dropped or stale audio sessions from the last clean checkpoint. It ships as a drop-in adapter for every major voice-agent framework and exposes a recovery-event observability stream so teams can measure and tune recovery quality in production.
Demand Breakdown
Social Proof 2 sources
Gap Assessment
6 tools exist (Pipecat (Daily), LiveKit Agents, Vapi, Retell AI, Bland AI, Inworld Realtime API) but gaps remain: No checkpoint of tool-call state before cancellation; no replay or recovery primitive; cross-agent handoff is user-implemented.; No mid-tool-call state serialization; no cross-agent session transfer primitive; recovery from dropped calls starts from zero..
Features8 agent-ready prompts
Competitive LandscapeFREE
| Product | Does | Missing |
|---|---|---|
| Pipecat (Daily) | Open-source Python framework for real-time voice agents; handles barge-in by canceling in-flight pipeline tasks when VAD fires. | No checkpoint of tool-call state before cancellation; no replay or recovery primitive; cross-agent handoff is user-implemented. |
| LiveKit Agents | Real-time audio/video infra with an agent SDK; good turn detection and interrupt handling at the WebRTC layer. | No mid-tool-call state serialization; no cross-agent session transfer primitive; recovery from dropped calls starts from zero. |
| Vapi | Managed voice-agent API with phone/web support; barge-in cancels current speech output. | No SDK-level hook to snapshot tool-call progress; no structured handoff payload for agent transfer; session crash on mid-tool interrupt. |
| Retell AI | Managed voice agent platform with low-latency TTS/STT and barge-in. | No tool-call checkpoint or resumption; no cross-agent state package; interrupt during tool call drops result. |
| Bland AI | Voice agent automation for phone calls; handles interruptions at the conversation level. | No developer-facing state checkpoint API; no mid-tool-call recovery; no cross-agent handoff state spec. |
| Inworld Realtime API | Realtime API with an opinionated mid-tool interrupt policy (idempotent reads complete, mutations cancel on contradiction). | Policy is opinionated and internal; no developer SDK to configure checkpoint/resume behavior; no cross-agent handoff primitive. |
Leads106BUILDER
Sign in to unlock full access.