buildkite/buildkite-mcp-server — agentic threat model

9.7AIVSS 9.7 · Critical

This agent exposes highly sensitive CI/CD control planes (pipeline mutation, build triggering, and failure diagnosis) over the Buildkite API, presenting a critical risk of supply chain compromise or unauthorized code execution if compromised.

OWASP AIVSS score rationale

AIVSS = (CVSS_Base + AARS) × Mitigation_Factor, where AARS = (10 − CVSS_Base) × (Factor_Sum / 10) × ThM

CVSS base 9.3AARS uplift 0.42Factor sum 5.4/10Threat ×1.1Mitigation ×1.0

Autonomy of Action		0.80
Goal-Driven Planning		0.70
Self-Modification		0.10
Dynamic Tool Use		0.90
Persistent Memory		0.20
Contextual Awareness		0.60
Dynamic Identity		0.50
Multi-Agent Interactions		0.30
Non-Determinism		0.70
Opacity & Reflexivity		0.60

Scored with the canonical OWASP AIVSS formula (AIVSS calculator reference); agentic risk factors estimated from the agent’s described capabilities.

MAESTRO 7-layer threat model

Per-layer threats for this agent. Layers tagged “not certain from listing” are general, caveated commentary where the public description didn’t pin that layer.

L1 · Foundation Models⚠ not certain from listing

Not certain from the listing — The underlying LLM is not specified, but the model is highly vulnerable to indirect prompt injection via build logs, pipeline configurations, or commit messages, leading to unauthorized tool execution.

L2 · Data Operations⚠ not certain from listing

Not certain from the listing — Data operations are primarily transactional over the Buildkite API, but ingestion of untrusted build logs and error outputs for 'failure diagnosis' presents a data poisoning and injection risk.

L3 · Agent Frameworks✓ mapped

The MCP server framework exposes powerful tools for pipeline creation, build triggering, and job monitoring. Insecure tool integration or lack of strict input validation on these tools could allow arbitrary command injection into CI/CD pipelines.

L4 · Deployment & Infrastructure✓ mapped

The agent relies on Buildkite API tokens. Compromise of the hosting environment or secrets storage directly exposes the entire Buildkite organization to lateral movement, unauthorized runner access, and infrastructure compromise.

L5 · Evaluation & Observability⚠ not certain from listing

Not certain from the listing — There is no mention of built-in guardrails, execution monitoring, or human-in-the-loop (HITL) approval steps before executing destructive actions like triggering builds or mutating pipelines.

L6 · Security & Compliance (cross-cutting)✓ mapped

Security relies entirely on the external Buildkite API token permissions. The agent itself lacks fine-grained access control (TBAC/RBAC) to restrict which users or prompts can trigger high-privilege pipeline mutations.

L7 · Agent Ecosystem✓ mapped

As an MCP server, this agent is designed to be called by other orchestrators or agents. In a multi-agent ecosystem, a compromised upstream agent could abuse this agent's tools to inject malicious code into production pipelines.

MAESTRO — the 7-layer agentic threat-modeling framework (Cloud Security Alliance / Ken Huang).