AI Security Framework

PHANTOMRISE

An AI-native security framework with 11 categories — threats, failure modes, and conditions STRIDE wasn't built for. Mapped to OWASP LLM Top 10 (2025) and OWASP Agentic Top 10 (2026).
11
Threat Categories
20
OWASP Items Covered
3
Novel Categories
100%
OWASP Coverage
P
Poisoning
Training data, RAG stores, embeddings, agent memory, eval datasets
Every data surface an AI system relies on is an attack surface. An attacker who poisons a RAG store manipulates every answer the system retrieves. Poisoning the evaluation dataset — corrupting the data used to test model safety — can hide malicious behaviors during testing, a subtle attack vector neither OWASP list names explicitly.
Threat Mappings
Threat
OWASP Mapping
Notes
Training Data Poisoning
Corrupting model weights via backdoored training data
LLM04ASI05
Core OWASP coverage; backdoor insertion at training time
RAG / Memory Poisoning
Injecting malicious content into retrieval stores or agent memory
LLM08ASI05
Runtime memory manipulation persists across agent sessions
Eval Set Poisoning
Corrupting evaluation data to hide malicious model behavior during testing
LLM04
Not explicitly named in any OWASP list — PHANTOMRISE original
H
Hallucination
False outputs, overconfidence, cascading errors across agent chains
AI systems produce false outputs with high confidence. In multi-agent systems, cascading hallucination occurs when one agent's false output becomes factual input for the next — amplifying the error across the pipeline in ways a single-model system cannot.
Threat Mappings
Threat
OWASP Mapping
Notes
Misinformation / Hallucination
Confidently false outputs causing harmful downstream decisions
LLM09
Primary OWASP coverage
Cascading Hallucination
False output from one agent propagates as fact through downstream agents
ASI06
Multi-agent amplification of errors; extends ASI06
A
Autonomy Failures
Excessive scope, missing HITL checkpoints, unconstrained planning
The threat here isn't an adversary doing something the system wasn't designed for — it's the system doing exactly what it was designed for, unsupervised, beyond the boundaries anyone intended. STRIDE has no category for this failure mode.
Threat Mappings
Threat
OWASP Mapping
Notes
Excessive Agency
Agent acts beyond intended scope without human oversight
LLM06ASI08
Core OWASP coverage for both lists
Tool Misuse
Agent invokes tools outside its intended use or chains tools destructively
ASI02
Agentic-specific; destructive tool chaining
Unconstrained Planning
Agent pursues goals beyond its mandate without HITL gates
ASI08ASI09
Missing kill-switch or scope boundary enforcement
N
Non-determinism
Stochastic behavior as a security property — not a threat, but the condition that changes everything
★ Novel
Non-determinism isn't an attack vector — it's the property that makes AI systems uniquely difficult to secure. Stochastic outputs invalidate deterministic threat models, break traditional testing, and make incident response unreliable. Statistical jailbreaking — repeating the same prompt until safety boundaries fail through sampling variance — is a documented exploit of this property. No existing framework names this concept.
Threat Mappings
Threat
OWASP Mapping
Notes
Statistical Jailbreaking
Exploiting sampling variance to bypass safety controls through repeated attempts
★ No OWASP equivalent
Not in OWASP, STRIDE, or MITRE ATLAS
Inconsistent Safety Behavior
Same prompt produces unsafe output under certain sampling conditions
★ No OWASP equivalent
Not named in any existing framework — this is the point
T
Trust & Supply Chain
Malicious base models, compromised MCP servers, marketplace agents, poisoned plugins
OWASP covers supply chain in one category; MAESTRO distributes it across four layers. PHANTOMRISE consolidates it as T — a single practitioner prompt: where does this system trust things it shouldn't?
Threat Mappings
Threat
OWASP Mapping
Notes
Compromised Base Model
Backdoored weights from malicious model provider or fine-tuning pipeline
LLM03ASI04
Supply chain entry point at the foundation layer
Malicious MCP Server / Plugin
Compromised tool server or plugin injecting malicious behavior into agent workflows
ASI04LLM03
Marketplace agent risk: not explicitly named in ASI04
O
Observability Gaps
Absent monitoring framed as a first-class security threat, not an ops problem
★ Novel
Absent from the OWASP LLM Top 10 entirely — and that absence is itself the problem. A goal-hijacked agent looks like a functioning system until something goes wrong downstream. You can't detect or respond to threats you can't see. PHANTOMRISE treats observability not as an operational nicety but as a first-class security control.
Threat Mappings
Threat
OWASP Mapping
Notes
Agent Trace Blind Spots
No causal tracing for multi-step autonomous decisions
ASI09
ASI09 partially covers; LLM Top 10 has no equivalent
Silent Failures in Pipelines
Agent takes harmful action with no anomaly signal — looks like normal operation
★ Novel framing
Treating absent monitoring as a security threat — not in any framework
M
Manipulation
Prompt injection, goal hijacking, intent drift, multimodal injection
The most well-documented AI attack surface. Intent drift — the gradual shift of an agent's objective across a long session through accumulating adversarial context — is distinct from a single-event goal hijack and deserves explicit attention in threat modeling sessions.
Threat Mappings
Threat
OWASP Mapping
Notes
Prompt Injection (Direct & Indirect)
Adversarial instructions overriding system prompt via user input or retrieved content
LLM01ASI01
Most prevalent AI attack; indirect via RAG retrieval
Goal Hijacking
Attacker redirects agent objective through adversarial context
ASI01ASI07
Single-event manipulation of agent goals
Intent Drift
Gradual shift of agent objective across long session through accumulating adversarial context
ASI07
Distinct from goal hijacking; slow and hard to detect
R
Resource Exhaustion
DoS, Denial of Wallet, multi-agent amplification loops
Denial of Wallet — where API costs become a financial attack surface — is an AI-specific threat with no STRIDE equivalent. Recursive tool calls between agents can multiply resource consumption beyond single-agent rate limits, bypassing controls designed for isolated systems.
Threat Mappings
Threat
OWASP Mapping
Notes
Denial of Wallet
API cost exploitation as a financial attack — triggering massive inference spend
LLM10
AI-specific financial DoS; no STRIDE equivalent
Multi-Agent Amplification Loops
Recursive tool calls between agents multiply resource consumption beyond rate limits
LLM10ASI10
Bypasses single-agent rate limiting controls
I
Identity & Privilege
NHI credentials, delegation chains, agent impersonation, inter-agent tampering
As AI agents proliferate, they acquire their own credentials, roles, and delegation chains — a non-human identity (NHI) attack surface that neither STRIDE's Elevation of Privilege nor the OWASP LLM Top 10 adequately addresses. Delegation chain attacks — where privilege escalates through a sequence of individually authorized agent-to-agent delegations — are invisible to STRIDE.
Threat Mappings
Threat
OWASP Mapping
Notes
NHI Credential Abuse
Agent credentials over-permissioned or misused across systems
ASI03
LLM Top 10 has no equivalent; agentic-specific
Delegation Chain Attack
Privilege escalates through legitimate agent-to-agent delegation sequence
ASI03ASI07
Each delegation individually authorized; invisible to STRIDE
S
Sharing & Isolation
Context window pollution, memory bleed, cross-tenant inference
★ Novel
Memory bleed — one session's content leaking into another — and cross-tenant inference — one customer's data influencing another customer's LLM responses in a multi-tenant SaaS — are named by neither OWASP list. STRIDE would flag a shared database; it has no model for inference-layer contamination. These are PHANTOMRISE originals.
Threat Mappings
Threat
OWASP Mapping
Notes
Memory Bleed
One session's content leaks into a different user's session via shared memory stores
★ Novel
Not named in any existing framework
Cross-Tenant Inference
One customer's data influences another customer's LLM responses in multi-tenant SaaS
★ Novel
Inference-layer contamination; STRIDE cannot model this
Context Window Pollution
Adversarial content injected into shared context affecting all downstream processing
ASI10
Partial OWASP coverage under shared resource misuse
E
Exposure & Output Handling
PII leakage, system prompt extraction, unsafe output piping
E consolidates three OWASP output-side risks into one practitioner prompt: what is this system putting out, and where is it going? Exfiltration via tool invocation — a manipulated agent using a legitimate tool call to leak data — sits at the intersection of E and M and represents one of the most concrete AI attack chains in practice.
Threat Mappings
Threat
OWASP Mapping
Notes
PII / Sensitive Data Leakage
Model outputs containing personal data, credentials, or confidential content
LLM02
Core OWASP coverage
System Prompt Extraction
Adversarial prompts revealing confidential system instructions
LLM07
Reveals security controls and proprietary instructions
Unsafe Output Piping
Unvalidated LLM output passed to downstream systems (shells, DBs, browsers)
LLM05
Output as attack vector for downstream injection

Each cell shows the PHANTOMRISE category covering that OWASP risk. ★ marks novel categories not in any existing framework.

Every OWASP item mapped to its PHANTOMRISE category. Full 20/20 coverage across both lists.

Practical mitigations for each PHANTOMRISE category. ★ marks novel categories not in any existing framework.

Framework Reference
OWASP LLM Top 10 (2025)
LLM01 Prompt Injection
LLM02 Sensitive Info Disclosure
LLM03 Supply Chain
LLM04 Data & Model Poisoning
LLM05 Improper Output Handling
LLM06 Excessive Agency
LLM07 System Prompt Leakage
LLM08 Vector & Embedding Weaknesses
LLM09 Misinformation
LLM10 Unbounded Consumption
OWASP Agentic Top 10 (2026)
ASI01 Agent Goal Hijack
ASI02 Tool Misuse & Exploitation
ASI03 Identity & Privilege Abuse
ASI04 Agentic Supply Chain
ASI05 Data & Memory Poisoning
ASI06 Cascading Hallucination
ASI07 Intent Breaking & Goal Manipulation
ASI08 Excessive Autonomy
ASI09 Observability & Monitoring Failures
ASI10 Shared Resource Misuse