Teenage-AGI — agentic threat model
Teenage-AGI presents a moderate-to-high risk profile primarily driven by its infinite persistent memory via Pinecone, making it highly susceptible to long-term memory poisoning and indirect prompt injection that can persistently alter the agent's behavior across sessions.
OWASP AIVSS score rationale
| Autonomy of Action | 0.40 | |
| Goal-Driven Planning | 0.70 | |
| Self-Modification | 0.50 | |
| Dynamic Tool Use | 0.20 | |
| Persistent Memory | 1.00 | |
| Contextual Awareness | 0.80 | |
| Dynamic Identity | 0.10 | |
| Multi-Agent Interactions | 0.20 | |
| Non-Determinism | 0.70 | |
| Opacity & Reflexivity | 0.60 |
Scored with the canonical OWASP AIVSS formula (AIVSS calculator reference); agentic risk factors estimated from the agent’s described capabilities.
MAESTRO 7-layer threat model
Per-layer threats for this agent. Layers tagged “not certain from listing” are general, caveated commentary where the public description didn’t pin that layer.
Utilizes GPT-4 as its foundation model. Primary threats include prompt injection that can hijack the agent's 'thinking' phase, and model alignment risks where the model generates inappropriate or harmful reasoning steps.
Integrates with Pinecone for vector storage to achieve infinite memory recall. This introduces severe risks of memory/knowledge-base poisoning, where malicious inputs are permanently stored and retrieved later, as well as data exfiltration of sensitive personal history stored in the vector database.
Based on a custom Python framework inspired by BabyAGI and Generative Agents. Vulnerable to memory poisoning and state manipulation, where corrupted memory recall disrupts the agent's planning and reasoning loops.
Not certain from the listing — likely run locally as a Python script. Main threats include insecure local storage of OpenAI and Pinecone API keys, and the lack of a sandboxed execution environment for the Python runtime.
Not certain from the listing — no observability, logging, or guardrail mechanisms are mentioned. This creates a significant blind spot, making it difficult to detect when the agent's memory or reasoning has been compromised.
Not certain from the listing — being an open-source hobby/technology project, it lacks enterprise security controls, access management, or compliance alignments (such as GDPR for the 'infinite' personal data stored).
Not certain from the listing — although inspired by interactive simulacra papers, there is no explicit multi-agent coordination or ecosystem integration described, meaning agent-to-agent trust abuse is currently a theoretical risk.
MAESTRO — the 7-layer agentic threat-modeling framework (Cloud Security Alliance / Ken Huang).