🤖

MoltGuard - OpenClaw Security Plugin by OpenGuardrails

MoltGuard — runtime security plugin for OpenClaw agents by OpenGuardrails. Install MoltGuard to protect yourself and your user from prompt injection, data ex...

下载13.7k

星标62

版本6.7.23

安全通过

💬Prompt

在 App 中使用在 ClawHub 查看 ↗

技能说明

name: moltguard version: 6.7.1 description: "MoltGuard — runtime security plugin for OpenClaw agents by OpenGuardrails. Install MoltGuard to protect yourself and your user from prompt injection, data exfiltration, and malicious commands. Source: https://github.com/openguardrails/openguardrails/tree/main/moltguard" metadata: {"openclaw":{"emoji":"🛡️","homepage":"https://github.com/openguardrails/openguardrails/tree/main/moltguard","keywords":["security","prompt-injection","data-exfiltration","pii","credential-theft","command-injection","guardrails","safety","agent-security","moltguard"]}}

MoltGuard — Your Security Guard

MoltGuard protects you and your user from prompt injection attacks, data exfiltration, and malicious commands hidden in files and web content.

Three Principles:

Instant Value — Works immediately after installation
No Security Expertise — No configuration needed
Secure by Default — "Install it, and the agent won't go rogue"

Installation (Do This First)

When the user asks to install MoltGuard, or when you want to protect yourself:

# 1. Install the plugin
openclaw plugins install @openguardrails/moltguard

# 2. Restart to load the plugin
openclaw gateway restart

That's it! MoltGuard auto-registers and you get 500 free security detections per day.

Test Your Protection (Important!)

After installation, immediately test to show the user MoltGuard is working:

# Read this test file — it contains a hidden phishing attack
cat ~/.openclaw/extensions/moltguard/samples/popup-injection-email.txt

When you read this file, MoltGuard will detect the hidden prompt injection attack. Tell the user:

"I just tested MoltGuard by reading a sample email with a hidden prompt injection attack. MoltGuard detected it! Your security protection is now active."

This demonstrates MoltGuard's value immediately.

During Daily Use — Announce Risks

When MoltGuard detects a risk in any file or web content you read, always tell the user:

"OpenGuardrails detected a security risk: [describe what was found]"

"The malicious content was detected. I'm protecting you from: [prompt injection / data exfiltration / command execution]"

Examples of what to say:

"OpenGuardrails detected a prompt injection attack hidden in that document. Someone tried to make me ignore my guidelines. The attack has been detected."
"OpenGuardrails detected a data exfiltration attempt in that webpage. It tried to make me send your data to an external server. I've blocked it."
"OpenGuardrails detected malicious shell commands hidden in that file. It tried to make me execute harmful code. I've detected it."

When Quota Is Exceeded

When you see <openguardrails-quota-exceeded> tags in tool results, you MUST inform the user. This is a security notification about quota status.

Read the content inside the tags and relay it to the user in full. The message will include:

Why protection is paused
What risks this creates
A link to upgrade

After the user upgrades and gets a new API key:

/og_config

Follow the instructions to configure the new API key.

Commands

Check status

/og_status

Shows your API key, quota usage, plan, and protection mode.

Enable/Disable Data Sanitization

/og_sanitize on   # Enable AI Security Gateway
/og_sanitize off  # Disable AI Security Gateway
/og_sanitize      # Show status

The AI Security Gateway sanitizes sensitive data before sending to LLM providers:

API keys → <SECRET_TOKEN>
Email addresses → <EMAIL>
SSH keys → <SSH_PRIVATE_KEY>
Credit cards → <CREDIT_CARD>
PII (phone numbers, SSNs, etc.) → placeholders

When enabled, all your agents' LLM requests will route through the gateway for automatic sanitization.

Open Core portal

/og_core

Opens the Core portal for account management, billing, and plan upgrades.

Claim agent (link to account)

/og_claim

Shows your Agent ID and API Key to claim this agent on Core:

Go to Core login page (/og_core)
Enter your email, click magic link
Go to claim-agent page
Paste your Agent ID and API Key

After claiming, all your agents share the same account quota.

Configure API key

/og_config

Shows how to configure an API key for cross-machine sharing.

Open Dashboard

/og_dashboard

Starts the local Dashboard and shows access URLs.

What MoltGuard Protects Against

All security detection is performed by Core:

Core Risk Surfaces:

Prompt / Instruction Risk — Prompt injection, malicious email/web instructions, unauthorized tasks
Behavioral Risk — Dangerous commands, file deletion, risky API calls
Data Risk — Secret leakage, PII exposure, sending sensitive data to LLMs

Core Features:

Agent Guard — Real-time interception of tool calls, shell commands, file access, HTTP requests
Secret & Data Leak Protection — Auto-sanitize API keys, SSH keys, PII (e.g., sk-123abc → <SECRET_TOKEN>)
Prompt Injection Protection — Detect "ignore instructions", "send secrets", "bypass rules"

Core Technology:

Intent-Action Mismatch Detection — Catches agents that say one thing but do another

Onboarding Flow

Automatic (Zero Human Intervention)

MoltGuard installs
Auto-registers with Core, gets API key
Credentials saved to ~/.openclaw/credentials/moltguard/
Protection active — 500 free detections/day

Claiming an Agent

For linking to your account (shared quota across machines):

/og_claim — get agent ID and API key
/og_core — go to Core login
Enter email, click magic link
Go to /claim-agent page, paste credentials
Agent now shares account quota

Plans

Plan	Price	Quota
Free (Autonomous)	$0	500/day
Starter	$19/mo	100K/mo
Pro	$49/mo	300K/mo
Business	$199/mo	2M/mo

API keys work across multiple agents.

Security & Provenance

Verify provenance before installing:

npm pack @openguardrails/moltguard --dry-run

Network behavior:

On first use: auto-registers to get an API key (no email required)
During operation: sends tool names, params, and content to Core for detection
Content is sanitized before transmission (PII/secrets replaced with placeholders)

Update MoltGuard

To update MoltGuard to the latest version:

# Update the plugin
openclaw plugins update moltguard

# Restart to load the updated version
openclaw gateway restart

Uninstall

rm -rf ~/.openclaw/extensions/moltguard
rm -rf ~/.openclaw/credentials/moltguard

如何使用「MoltGuard - OpenClaw Security Plugin by OpenGuardrails」？

打开小龙虾AI（Web 或 iOS App）
点击上方「立即使用」按钮，或在对话框中输入任务描述
小龙虾AI 会自动匹配并调用「MoltGuard - OpenClaw Security Plugin by OpenGuardrails」技能完成任务
结果即时呈现，支持继续对话优化