Mergify — agentic threat model

9.0AIVSS 9.0 · Critical

Mergify acts as a highly autonomous agentic plugin with direct write access to code repositories and CI/CD pipelines, presenting a high-impact risk profile if compromised due to its ability to merge code and bypass protections.

OWASP AIVSS score rationale

AIVSS = (CVSS_Base + AARS) × Mitigation_Factor, where AARS = (10 − CVSS_Base) × (Factor_Sum / 10) × ThM

CVSS base 8.8AARS uplift 0.71Factor sum 5.4/10Threat ×1.1Mitigation ×0.95

Autonomy of Action		0.80
Goal-Driven Planning		0.60
Self-Modification		0.20
Dynamic Tool Use		0.80
Persistent Memory		0.40
Contextual Awareness		0.70
Dynamic Identity		0.50
Multi-Agent Interactions		0.30
Non-Determinism		0.50
Opacity & Reflexivity		0.60

Scored with the canonical OWASP AIVSS formula (AIVSS calculator reference); agentic risk factors estimated from the agent’s described capabilities.

MAESTRO 7-layer threat model

Per-layer threats for this agent. Layers tagged “not certain from listing” are general, caveated commentary where the public description didn’t pin that layer.

L1 · Foundation Models⚠ not certain from listing

Not certain from the listing — the underlying LLM/foundation model powering the Mergify CLI plugin is not specified. Standard risks of prompt injection leading to unauthorized merge actions or configuration bypass apply.

L2 · Data Operations✓ mapped

The agent processes pull request metadata, test results, and configuration files. Gaps in data provenance or poisoned test insights could allow malicious code to bypass quarantine.

L3 · Agent Frameworks✓ mapped

The agent framework orchestrates terminal-based CLI commands for merge queue management and stacked PRs. Vulnerabilities here could lead to tool misuse, allowing arbitrary git commands or unauthorized code merges.

L4 · Deployment & Infrastructure✓ mapped

The agent runs within the user's terminal/CI environment and interacts with Mergify's cloud infrastructure. Compromise of the local CLI environment or API secrets could lead to repository-wide privilege escalation.

L5 · Evaluation & Observability⚠ not certain from listing

Not certain from the listing — it is unclear what guardrails or real-time evaluation systems monitor the agent's terminal actions to prevent anomalous or malicious merge decisions.

L6 · Security & Compliance (cross-cutting)✓ mapped

The agent enforces merge protections and branch policies, but must itself be tightly bound by strict IAM, repository branch protection rules, and credential scoping to prevent abuse.

L7 · Agent Ecosystem✓ mapped

The agent operates within a broader CI/CD ecosystem, interacting with GitHub/GitLab APIs, testing suites, and potentially other developer bots, creating risks of cascading trust failures.

MAESTRO — the 7-layer agentic threat-modeling framework (Cloud Security Alliance / Ken Huang).