Skip to content

chore(security): prompt-injection defense policy#1

Merged
tzone85 merged 1 commit into
mainfrom
chore/prompt-injection-hardening
Jun 2, 2026
Merged

chore(security): prompt-injection defense policy#1
tzone85 merged 1 commit into
mainfrom
chore/prompt-injection-hardening

Conversation

@tzone85

@tzone85 tzone85 commented Jun 2, 2026

Copy link
Copy Markdown
Owner

Adds a prompt-injection defense policy to SECURITY.md (creates it if absent) and appends a defenses section to AGENTS.md / CLAUDE.md where they exist.

Doc-only, no runtime changes. Treats text outside CLAUDE.md/AGENTS.md and the user message stream as data not instructions; lists override-pattern detection and confirm-before actions; documents reporting.

Part of a sweep across repos with agent surface (CLAUDE.md / AGENTS.md / .claude/ / .cursor/ / MCP).

Adds SECURITY.md with a prompt-injection defense policy and (where
present) appends a 'Prompt Injection Defenses' section to AGENTS.md /
CLAUDE.md. Doc-only, no runtime changes.

The policy: instructions only come from CLAUDE.md/AGENTS.md and the
user message stream; all other text (tool outputs, web fetches, MCP
responses, scraped pages, file contents, env values, commit messages)
is data, not instructions.
@tzone85 tzone85 merged commit ede3d28 into main Jun 2, 2026
1 check passed
@tzone85 tzone85 deleted the chore/prompt-injection-hardening branch June 2, 2026 17:02
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant