Untrusted Projects Are Now Agent Execution Surfaces

Author: OtherU · Created March 1, 2026 · Modified May 17, 2026 · ai-security supply-chain agents operator-trust
Cover

The security lesson for OtherU is that an untrusted project is no longer just source code waiting to be reviewed. In an agentic developer tool, repository configuration, hooks, permissions, MCP servers, and editor integration can all influence execution. Hermes needs to treat a project directory as an execution surface until trust has been established outside the model.

Anthropic's Claude Code security documentation describes explicit permission checks, write boundaries, and the need to handle tool execution carefully. Its configuration documentation also notes settings around permission modes and project configuration. Those details are useful because they show the intended control model: the assistant should not silently gain broad authority just because a repository contains configuration files.

The RAXE advisory on CVE-2026-33068 illustrates the same class of risk from another angle: loading repository settings before a workspace trust decision can weaken the consent boundary. The specific vulnerability belongs to Claude Code, but the pattern applies to any agent that reads project-local configuration before the operator has classified the workspace. Configuration becomes active input, not passive metadata.

For OtherU, that means Hermes should separate project inspection from project execution. The initial scan of a new repository should use a constrained mode: no shell execution, no networked tools, no credential access, and no automatic trust of project-provided agent settings. The scan can inventory risky files, summarize dependencies, and ask the operator before enabling richer automation.

However, the fix is not to ban agentic coding tools. The productivity gain is real, and OtherU relies on agents for complex systems work. The safer pattern is layered control: external sandboxing, allowlisted commands, repository trust state, secrets isolation, telemetry, and explicit approval before high-impact actions. The model can recommend actions, but the runtime has to enforce boundaries.

This belongs on the OtherU blog because local-first agents increase both control and responsibility. If Hermes operates on private code, internal infrastructure, and hardware-adjacent scripts, the trust boundary has to be deterministic. The practical takeaway is simple: treat unknown projects like executable inputs, inspect them in a restricted lane, and only then let an agent participate in build, test, or deployment workflows.

The control plane should be explicit enough that users can understand it. A new repository can start in read-only review mode, move to limited command execution after approval, and only gain deployment or credential access through a separate operator decision. Hermes should persist that trust state and make it visible in logs. If a project changes its local agent settings, the system should treat that as a security-relevant event.

This pattern also protects internal teams from accidental over-trust. A repository from a partner, a customer, a benchmark, or a support ticket may look routine while still containing configuration that changes agent behavior. OtherU's standard should be simple: no project-provided setting gets to expand authority before an external policy layer has accepted the workspace. That keeps the safety decision in the runtime and with the operator, not inside files supplied by the project under review or generated during setup later on.