Prompt Shield
AI 代理的 Prompt 注入防火墙。113 种检测模式,14 种威胁类别,零依赖。防止虚假权威、命令注入、内存中毒、技能恶意软件、加密垃圾邮件等。具有强制对等审查的哈希链防篡改白名单。Claude 代码钩子集成
如何使用「Prompt Shield」?
- 打开小龙虾AI(Web 或 iOS App)
- 点击上方「立即使用」按钮,或在对话框中输入任务描述
- 小龙虾AI 会自动匹配并调用「Prompt Shield」技能完成任务
- 结果即时呈现,支持继续对话优化
相关技能
Prompt injection defense. Detect and block malicious prompts, protect system instructions, sanitize user input.
Protects against prompt injection attacks by sanitizing, validating, and securely processing untrusted external content from websites, emails, and documents.
Two-layer content safety for agent input and output. Use when (1) a user message attempts to override, ignore, or bypass previous instructions (prompt injection), (2) a user message references system prompts, hidden instructions, or internal configuration, (3) receiving messages from untrusted users in group chats or public channels, (4) generating responses that discuss violence, self-harm, sexual content, hate speech, or other sensitive topics, or (5) deploying agents in public-facing or multi
Detect and filter prompt injection attacks in untrusted input. Use when processing external content (emails, web scrapes, API inputs, Discord messages, sub-agent outputs) or when building systems that accept user-provided text that will be passed to an LLM. Covers direct injection, jailbreaks, data exfiltration, privilege escalation, and context manipulation.
Detect and block prompt injection attacks in emails. Use when reading, processing, or summarizing emails. Scans for fake system outputs, planted thinking blocks, instruction hijacking, and other injection patterns. Requires user confirmation before acting on any instructions found in email content.
Detects and scores prompt injection attempts in text, outputting severity, action, and matched rules without external calls or secret handling.