clawsmith.com/idea/detect-guidance-injection-openclaw-skills-runtime-behavioral-sandbox

IdeaCompetitiveCLIOPEN-SOURCESECURITYLive

A runtime behavioral sandbox that detects guidance injection attacks in OpenClaw skills by observing what agents actually do instead of scanning what skills say

Existing OpenClaw skill scanners use static analysis and LLM-based content scanning to flag malicious skills before installation. The Trojan's Whisper paper (March 2026) proved that 94% of guidance injection attacks evade both approaches because the malicious payload is disguised as routine operational guidance, not explicit instructions. Meanwhile 12% of ClawHub's skill registry has been compromised at some point in 2026. The gap is clear. Instead of scanning skill text, this product spins up an isolated OpenClaw instance, installs the skill, runs a battery of natural user prompts, and observes what the agent actually does. Credential access, file writes outside sandbox, network exfiltration, privilege escalation attempts all get flagged as behavioral anomalies regardless of how the skill's guidance file describes them.

Demand Breakdown

4,457

2,340

Social Proof 4 sources

Snyk ToxicSkills: 36% of ClawHub skills have prompt injection

2026-05-01

2,450 RD

20% of ClawHub skills are malicious

2026-01-15

2,340 HN

ClawHavoc: Malicious ClawHub Skills

2026-01-15

1,180 HN

A GitHub Issue Title Compromised 4,000 Developer Machines

2026-03-01

827

Gap Assessment

CompetitiveMultiple tools exist but differentiation opportunities remain

4 tools exist (VirusTotal Integration (built-in), Cisco DefenseClaw, SkillFortify, SecureClaw) but gaps remain: Only catches known malware signatures. Completely blind to guidance injection which uses natural language, not malware binaries. 94% evasion rate proven by Trojan's Whisper.; Uses static and LLM-based scanning. Falls into the exact category that Trojan's Whisper proved 94% evasion against. No runtime behavioral analysis..

Features3 agent-ready prompts

Isolated Docker sandbox that installs one OpenClaw skill at a time and runs 50+ natural user prompts while monitoring all syscalls, file access, and network requests

▶

Behavioral anomaly classifier that compares skill-under-test actions against a baseline of 100 known-good skills to score deviation

▶

Pre-install gate that blocks ClawHub skill installation when behavioral risk score exceeds configurable threshold

▶

Competitive LandscapeFREE

Product	Does	Missing
VirusTotal Integration (built-in)	Scans skill files against VirusTotal malware database on ClawHub upload	Only catches known malware signatures. Completely blind to guidance injection which uses natural language, not malware binaries. 94% evasion rate proven by Trojan's Whisper.
Cisco DefenseClaw	Open-source agent security governance framework with static skill scanning	Uses static and LLM-based scanning. Falls into the exact category that Trojan's Whisper proved 94% evasion against. No runtime behavioral analysis.
SkillFortify	Formal verification for AI agent skills using mathematical proofs of behavior	Formal verification works on well-defined properties but cannot model the emergent behaviors that guidance injection produces through context manipulation across prompts.
SecureClaw	Maps OpenClaw security posture to OWASP Agent Security Initiative standards	Compliance mapping tool, not a detection tool. Tells you what risks exist but does not actively block malicious skills at install time.

Aggregate Score

17,567

0 leads found

Details

TypeProduct Idea

Competitors4

Features3

Issues4

Leads0

Source Signals

All signals →

7.9KClawHavoc Campaign: 824+ Malicious ClawHub Skills, 12% of Marketplace Is Malware 4.8KSnyk ToxicSkills: 36% of ClawHub Skills Have Prompt Injection, 1,467 Malicious Payloads 4KOpenClaw Memory Poisoning: SOUL.md Injection Enables Time-Shifted Logic Bomb Attacks 827Clinejection: AI Prompt Injection via GitHub Issue Title Installs OpenClaw on 4,000 Developer Machines 0Trojan's Whisper: Guidance Injection Attack Evades 94% of OpenClaw Scanners

Related Ideas

All ideas →

0A CLI tool that scans a running OpenClaw instance for every known CVE, exposed endpoint, malicious skill, and token scope violation, then outputs a prioritized remediation checklist 0A CLI tool that scans a running OpenClaw instance, scores its security posture, maps plugin dependencies to alternative platforms, and outputs a stay-or-migrate recommendation with effort estimates 0An open-source policy engine that enforces per-tool, per-user, and per-context execution rules on OpenClaw agents before any action fires