Playwright — agentic threat model

9.2AIVSS 9.2 · Critical

The Playwright plugin introduces high agentic risk due to its capability to execute arbitrary browser automation, form submissions, and DOM interactions, which can be abused for client-side attacks or unauthorized data exfiltration if the orchestrating agent is compromised.

OWASP AIVSS score rationale

AIVSS = (CVSS_Base + AARS) × Mitigation_Factor, where AARS = (10 − CVSS_Base) × (Factor_Sum / 10) × ThM

CVSS base 8.2AARS uplift 1.02Factor sum 5.4/10Threat ×1.05Mitigation ×1.0

Autonomy of Action		0.80
Goal-Driven Planning		0.70
Self-Modification		0.10
Dynamic Tool Use		0.90
Persistent Memory		0.20
Contextual Awareness		0.80
Dynamic Identity		0.50
Multi-Agent Interactions		0.30
Non-Determinism		0.60
Opacity & Reflexivity		0.50

Scored with the canonical OWASP AIVSS formula (AIVSS calculator reference); agentic risk factors estimated from the agent’s described capabilities.

MAESTRO 7-layer threat model

Per-layer threats for this agent. Layers tagged “not certain from listing” are general, caveated commentary where the public description didn’t pin that layer.

L1 · Foundation Models⚠ not certain from listing

Not certain from the listing — The underlying foundation model is Claude (via Claude Code), which is susceptible to prompt injection attacks that could force the model to generate malicious browser automation scripts or navigate to untrusted sites.

L2 · Data Operations⚠ not certain from listing

Not certain from the listing — The data operations layer is not detailed, but the agent processes live DOM structures, screenshots, and web page content, creating risks of indirect prompt injection from untrusted web pages parsed during execution.

L3 · Agent Frameworks✓ mapped

The agent framework integrates Microsoft's Playwright MCP server into Claude Code. This creates a high risk of tool misuse, where an attacker can manipulate the agent into executing unauthorized clicks, filling forms with malicious payloads, or exfiltrating sensitive session data via screenshots.

L4 · Deployment & Infrastructure⚠ not certain from listing

Not certain from the listing — The deployment context is Claude Code (typically local developer environments). If the browser instance is not strictly sandboxed, it could allow local file access, SSRF via the browser, or lateral movement within the developer's local network.

L5 · Evaluation & Observability⚠ not certain from listing

Not certain from the listing — There are no mentioned logging, auditing, or guardrail mechanisms to monitor the browser actions, screenshots taken, or forms submitted by the agent to detect anomalous or malicious behavior.

L6 · Security & Compliance (cross-cutting)⚠ not certain from listing

Not certain from the listing — No security policies, authentication controls, or authorization boundaries are specified to restrict which domains the browser can navigate to or what credentials it can input into forms.

L7 · Agent Ecosystem✓ mapped

The agent operates as a plugin within the Claude Code ecosystem. Compromise of the orchestrating agent or upstream MCP server could lead to cascading failures, allowing unauthorized browser control to be leveraged by other connected tools or agents.

MAESTRO — the 7-layer agentic threat-modeling framework (Cloud Security Alliance / Ken Huang).