Hebbia AI — agentic threat model

7.4AIVSS 7.4 · High

Hebbia AI presents a high-consequence risk profile due to its integration with sensitive financial and legal enterprise data, balanced by its emphasis on transparency, citations, and enterprise-grade security controls.

OWASP AIVSS score rationale

AIVSS = (CVSS_Base + AARS) × Mitigation_Factor, where AARS = (10 − CVSS_Base) × (Factor_Sum / 10) × ThM

CVSS base 8.5AARS uplift 0.75Factor sum 5.0/10Threat ×1.0Mitigation ×0.8

Autonomy of Action		0.70
Goal-Driven Planning		0.80
Self-Modification		0.10
Dynamic Tool Use		0.50
Persistent Memory		0.60
Contextual Awareness		0.80
Dynamic Identity		0.20
Multi-Agent Interactions		0.50
Non-Determinism		0.50
Opacity & Reflexivity		0.30

Scored with the canonical OWASP AIVSS formula (AIVSS calculator reference); agentic risk factors estimated from the agent’s described capabilities.

MAESTRO 7-layer threat model

Per-layer threats for this agent. Layers tagged “not certain from listing” are general, caveated commentary where the public description didn’t pin that layer.

L1 · Foundation Models⚠ not certain from listing

Not certain from the listing — Hebbia likely leverages proprietary or third-party frontier foundation models to handle its infinite context window. Primary threats include prompt injection that could hijack multi-step workflows or cause misaligned outputs in sensitive financial/legal contexts.

L2 · Data Operations✓ mapped

Ingests massive structured and unstructured datasets across multiple formats. Key threats include data exfiltration of highly confidential enterprise documents, knowledge-base poisoning, and unauthorized access across tenant boundaries.

L3 · Agent Frameworks✓ mapped

Orchestrates complex, multi-step workflows and end-to-end tasks via its 'Matrix' interface. Threats include insecure tool integration, workflow bypass, and manipulation of the agent's planning logic during execution.

L4 · Deployment & Infrastructure⚠ not certain from listing

Not certain from the listing — likely deployed as a secure SaaS platform with enterprise-grade hosting. Threats include container escape, insecure API endpoints, and lack of strict sandboxing during heavy document parsing and execution.

L5 · Evaluation & Observability✓ mapped

Provides strong observability through 'total transparency' and a spreadsheet-like interface delivering answers with direct citations. This mitigates opacity, but risks of evaluation gaming or undetected drift in automated workflows remain.

L6 · Security & Compliance (cross-cutting)✓ mapped

Explicitly claims 'enterprise-grade security' tailored for finance, law, and Fortune 500 companies. This implies robust access controls, audit logging, and compliance frameworks, though specific certifications are not detailed in the listing.

L7 · Agent Ecosystem✓ mapped

Allows users to build and run multiple AI agents to complete tasks. Threats include cascading failures across interdependent workflows and potential unauthorized cross-agent data sharing within the enterprise tenant.

MAESTRO — the 7-layer agentic threat-modeling framework (Cloud Security Alliance / Ken Huang).