← All primers

Primer

Governance

Helmwart is a tool for engineers and architects, not regulators. But the people who use it often need to defend their work to a CISO, a GRC team, or a legal counsel, and that audience asks "what regulatory frame does this fit into?" This primer answers that question with a single source: the ACM Europe Technology Policy Committee's May 2025 policy brief "Systemic Risks Associated with Agentic AI" (Bellogín, Giudici, Larsson, Pang, Schimpf, Sengupta, Solmaz). The brief is short (8 pages) and Europe-anchored, but its framing of what makes agentic AI a different regulatory object is widely usable.

The core argument

The brief's thesis: the EU AI Act, while a strong foundation, only partially addresses agentic AI. The Act was designed to regulate AI systems as products that pass through a one-time conformity assessment. Agentic systems don't sit still: they generate and deploy code, change behaviour faster than regulatory cycles, interact with other agents to produce emergent harm, and remove humans (and computing professionals) from the oversight loop. The brief calls for a shift from static, product-focused regulation to a dynamic governance regime: oversight that operates during the system's life, not just at certification.

That framing is congenial to Helmwart: this tool exists because static checklists also fail at the engineering level for the same reason. The catalog adapts to the architecture; the architecture is what's actually in production.

Three autonomy tiers

The brief proposes that the AI Act's generic "human oversight" requirement (Article 14) be replaced with risk-tiered autonomy:

These three labels are the values of Helmwart's AutonomyLevel field on every agent node: autonomous, supervised, advisory. When you set an agent to autonomous, you are effectively claiming the brief's "full autonomy" tier; that claim should be defensible against the application's risk class.

The brief also pushes a stronger version of oversight: alignment oversight: verifying not just that a human is present, but that the agent's actual behaviour tracks its declared objectives. This is what T10 Overwhelming Human-in-the-Loop breaks: when humans rubber-stamp because they're saturated, alignment drift is undetectable.

The EU AI Act gaps it identifies

The brief calls out specific articles where the AI Act needs amendment for agentic systems:

When this is useful in Helmwart

You're presenting a Helmwart canvas to internal compliance, security, or legal:

What this is not

Concrete example: why Article 14 gaps matter

A supervised-tier AI agent in a bank's loan-decision workflow is set to autonomous in the Helmwart canvas. The trifecta badge fires: the agent reads customer financial records (private), accepts applicant-supplied documents (untrusted), and emails decisions (outbound). A deployer might claim that a periodic human review log meets its Article 14 oversight obligations; this primer does not make that legal determination. In this scenario, no individual reviewer ever sees a full decision trace, only aggregated statistics. The ACM brief's proposed alignment oversight approach would demand stronger evidence that the agent's actual behaviour tracks its declared objectives. In Helmwart terms, that means pairing T10 Overwhelming HITL mitigations (structured review queues) with tamper-evident audit logs so reviewers can reconstruct individual agent reasoning traces, not just summary counts.

Where to go next