Codex CLI — agentic threat model

5.5AIVSS 5.5 · Medium

Codex CLI presents a high-impact but well-mitigated risk profile; while its 'Full Auto' mode and code execution capabilities could lead to severe local compromise, its sandboxed, network-disabled environment and granular approval modes significantly reduce the active threat vector.

OWASP AIVSS score rationale

AIVSS = (CVSS_Base + AARS) × Mitigation_Factor, where AARS = (10 − CVSS_Base) × (Factor_Sum / 10) × ThM

CVSS base 8.4AARS uplift 0.74Factor sum 4.6/10Threat ×1.0Mitigation ×0.6

Autonomy of Action		0.80
Goal-Driven Planning		0.70
Self-Modification		0.20
Dynamic Tool Use		0.80
Persistent Memory		0.20
Contextual Awareness		0.70
Dynamic Identity		0.10
Multi-Agent Interactions		0.00
Non-Determinism		0.60
Opacity & Reflexivity		0.50

Scored with the canonical OWASP AIVSS formula (AIVSS calculator reference); agentic risk factors estimated from the agent’s described capabilities.

MAESTRO 7-layer threat model

Per-layer threats for this agent. Layers tagged “not certain from listing” are general, caveated commentary where the public description didn’t pin that layer.

L1 · Foundation Models✓ mapped

Utilizes OpenAI models (such as o4-mini) for code generation and reasoning. Primary threats include prompt injection, adversarial inputs (especially via multimodal screenshots/diagrams), and model-reprogramming to generate malicious code.

L2 · Data Operations✓ mapped

Operates on local source code and accepts multimodal inputs (text, screenshots, diagrams). The primary threat is data exposure or poisoning if malicious files/diagrams are introduced into the local workspace, though data remains local.

L3 · Agent Frameworks✓ mapped

Orchestrates tasks across three approval modes (Suggest, Auto Edit, Full Auto). In Full Auto mode, the risk of tool misuse (unintended file modification or execution of destructive commands) is highly elevated.

L4 · Deployment & Infrastructure✓ mapped

Runs in a local terminal environment. Mitigates network-based threats by executing within a sandboxed, network-disabled environment, though sandbox escape remains a critical threat vector to the host system.

L5 · Evaluation & Observability⚠ not certain from listing

Not certain from the listing — there is no explicit mention of logging, telemetry, guardrails, or observability tools to monitor the agent's actions or detect anomalous behavior during execution.

L6 · Security & Compliance (cross-cutting)✓ mapped

Provides strong security controls through local-only execution, a network-disabled sandbox, and user-configurable approval gates (Suggest, Auto Edit, Full Auto) acting as a human-in-the-loop policy.

L7 · Agent Ecosystem⚠ not certain from listing

Not certain from the listing — the agent appears designed for standalone local developer use and does not indicate any multi-agent collaboration or marketplace integration.

MAESTRO — the 7-layer agentic threat-modeling framework (Cloud Security Alliance / Ken Huang).