GPT Image 2 AI Photo Editor — agentic threat model

5.6AIVSS 5.6 · Medium

GPT Image 2 is a low-risk, human-in-the-loop conversational image editor with minimal autonomy, where the primary security risks are model-level prompt injections, generation of inappropriate content, and intellectual property concerns rather than systemic infrastructure compromise.

OWASP AIVSS score rationale

AIVSS = (CVSS_Base + AARS) × Mitigation_Factor, where AARS = (10 − CVSS_Base) × (Factor_Sum / 10) × ThM

CVSS base 4.3AARS uplift 1.25Factor sum 2.2/10Threat ×1.0Mitigation ×1.0

Autonomy of Action		0.10
Goal-Driven Planning		0.10
Self-Modification		0.00
Dynamic Tool Use		0.20
Persistent Memory		0.10
Contextual Awareness		0.20
Dynamic Identity		0.00
Multi-Agent Interactions		0.00
Non-Determinism		0.80
Opacity & Reflexivity		0.70

Scored with the canonical OWASP AIVSS formula (AIVSS calculator reference); agentic risk factors estimated from the agent’s described capabilities.

MAESTRO 7-layer threat model

Per-layer threats for this agent. Layers tagged “not certain from listing” are general, caveated commentary where the public description didn’t pin that layer.

L1 · Foundation Models✓ mapped

Uses underlying generative models for text-to-image synthesis and conversational interaction. Primary threats include adversarial prompt injections to bypass safety filters, generating copyrighted or brand-damaging material, and model reprogramming.

L2 · Data Operations⚠ not certain from listing

Not certain from the listing — No details are provided regarding training data, vector stores, or RAG. General threats include training data poisoning, copyright infringement from training sets, and lack of data lineage for generated assets.

L3 · Agent Frameworks⚠ not certain from listing

Not certain from the listing — The orchestration framework for translating conversational prompts into image editing commands is unspecified. General threats involve prompt injection manipulating the tool-calling mechanism used to modify images.

L4 · Deployment & Infrastructure⚠ not certain from listing

Not certain from the listing — Hosting and infrastructure details are absent. General threats include GPU resource exhaustion from denial-of-service attacks and potential remote code execution if user-uploaded images exploit vulnerabilities in image processing libraries.

L5 · Evaluation & Observability⚠ not certain from listing

Not certain from the listing — No monitoring, logging, or input/output guardrails are described. General threats include blind spots in detecting the generation of deepfakes, NSFW content, or policy-violating imagery.

L6 · Security & Compliance (cross-cutting)⚠ not certain from listing

Not certain from the listing — No compliance certifications (e.g., SOC2, GDPR) or identity management controls are mentioned. General threats include lack of audit trails for generated content and potential violations of intellectual property regulations.

L7 · Agent Ecosystem✓ mapped

The agent operates as a standalone horizontal tool with no described multi-agent or marketplace integrations, making ecosystem-level threats like cascading agent-to-agent failures minimal.

MAESTRO — the 7-layer agentic threat-modeling framework (Cloud Security Alliance / Ken Huang).