Update ASI06_Memory_and_Context_Poisoning .md #718

Josh-Beck · 2025-09-17T18:51:22Z

[ASI06 V1]

Main work done in Google Doc, transferring to here for review and comments.

Signed-off-by: Joshua Beck <[email protected]>

itskerenkatz

Hi team!
Great great work,
Added a few comments
Do not forget to add a comparison to our existing frameworks

...e/agentic-top-10/Sprint 1-first-public-draft-expanded/ASI06_Memory_and_Context_Poisoning .md

itskerenkatz

Hi team!
Great great work,
Added a few comments
Do not forget to add a comparison to our existing frameworks

Signed-off-by: Joshua Beck <[email protected]>

Josh-Beck · 2025-09-22T19:24:07Z

@itskerenkatz - I have added changes based on your feedback. Can we get this PR through and then go back to add references to other OWASP works? I am not sure I can do that change now, I would like to separate that PR.

thismohsin · 2025-09-22T19:56:39Z

...e/agentic-top-10/Sprint 1-first-public-draft-expanded/ASI06_Memory_and_Context_Poisoning .md

+**Access Control & Retention Policies:**
+* Limit access to trusted sources only, using authentication and authorization for user access, and curated data streams for ingesting potentially dangerous data.
+* Apply context-aware policies so an agent only accesses memory relevant to its current task.
+* Limit retention durations based on data sensitivity to reduce long-term risk.


for prevention: additional pov

Temporal Drift Monitoring: Detect slow memory poisoning by watching behavioral, goal, or plan drift over time. Treat memory like cache and evicted at regular interval with strong forget policies.

Mohsin ([email protected])

thismohsin · 2025-09-22T20:07:29Z

...e/agentic-top-10/Sprint 1-first-public-draft-expanded/ASI06_Memory_and_Context_Poisoning .md

+3. Systemic Misalignment and Backdoors: Memory poisoning can have more subtle and severe consequences than simply producing wrong results. A poisoned LLM can take on a new, malicious persona, deviating from its intended purpose. Attackers can also use this technique to install a backdoor, such as a secret instruction that remains inactive until a certain trigger phrase is entered. When the LLM encounters this sentence, it carries out the disguised malicious instructions, such as producing destructive code or transmitting sensitive data.
+
+4. Cascading failures and data exfiltration: A single poisoned memory entry in a sophisticated, multi-agent system (MAS) might have a domino effect, resulting in cascading failure. One agent may retrieve damaged data and then share it with others, leading the system to become unstable. Malicious instructions can also be placed in the memory as persistence instructions, allowing the LLM to access and communicate sensitive user or enterprise data to an attacker. This data exfiltration poses a significant risk since the model might be allowed valid access to data repositories but then altered to use that access maliciously.



Proposing in cause:

Cognitive Drift refers to the slow, unintended divergence of an agent’s internal understanding or memory from the real world due to:

Context accumulation noise (imprecise memory blending over time)

Incomplete or partial rollbacks after memory poisoning or error correction

Benign feedback loops (e.g. agents "confirming" each other's summaries or plans)

Stale or decayed memory vectors causing off-target retrievals

Summary hallucinations in memory compression steps (e.g. distillation of chat history)

Unlike direct attacks, cognitive drift is unintentional and often undetected until it becomes a cascade trigger — leading to systemic failure without a clear attacker.

Signed-off-by: Joshua Beck <[email protected]>

Josh-Beck · 2025-09-25T12:42:25Z

Changes have been added, this is in a good initial state. I would like to get the PR through so comments can be separated into additional PRs rather than cluttering this one. If we want a different approach that's fine, please let me know!

Update ASI06_Memory_and_Context_Poisoning .md

fc1a238

Signed-off-by: Joshua Beck <[email protected]>

Josh-Beck requested review from guerilla7, hoeg and itskerenkatz as code owners September 17, 2025 18:51

itskerenkatz requested changes Sep 21, 2025

View reviewed changes

Update ASI06_Memory_and_Context_Poisoning .md

41c9683

Signed-off-by: Joshua Beck <[email protected]>

thismohsin reviewed Sep 22, 2025

View reviewed changes

added T&M

bce1ce7

Signed-off-by: Joshua Beck <[email protected]>

Josh-Beck requested a review from itskerenkatz September 25, 2025 12:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Update ASI06_Memory_and_Context_Poisoning .md #718

Update ASI06_Memory_and_Context_Poisoning .md #718

Uh oh!

Josh-Beck commented Sep 17, 2025

Uh oh!

itskerenkatz left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

itskerenkatz left a comment

Uh oh!

Josh-Beck commented Sep 22, 2025

Uh oh!

thismohsin Sep 22, 2025 •

edited

Loading

Uh oh!

thismohsin Sep 22, 2025 •

edited

Loading

Uh oh!

Josh-Beck commented Sep 25, 2025

Uh oh!

Uh oh!

		3. Systemic Misalignment and Backdoors: Memory poisoning can have more subtle and severe consequences than simply producing wrong results. A poisoned LLM can take on a new, malicious persona, deviating from its intended purpose. Attackers can also use this technique to install a backdoor, such as a secret instruction that remains inactive until a certain trigger phrase is entered. When the LLM encounters this sentence, it carries out the disguised malicious instructions, such as producing destructive code or transmitting sensitive data.

		4. Cascading failures and data exfiltration: A single poisoned memory entry in a sophisticated, multi-agent system (MAS) might have a domino effect, resulting in cascading failure. One agent may retrieve damaged data and then share it with others, leading the system to become unstable. Malicious instructions can also be placed in the memory as persistence instructions, allowing the LLM to access and communicate sensitive user or enterprise data to an attacker. This data exfiltration poses a significant risk since the model might be allowed valid access to data repositories but then altered to use that access maliciously.

Uh oh!

Update ASI06_Memory_and_Context_Poisoning .md #718

Are you sure you want to change the base?

Update ASI06_Memory_and_Context_Poisoning .md #718

Uh oh!

Conversation

Josh-Beck commented Sep 17, 2025

[ASI06 V1]

Uh oh!

itskerenkatz left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

itskerenkatz left a comment

Choose a reason for hiding this comment

Uh oh!

Josh-Beck commented Sep 22, 2025

Uh oh!

thismohsin Sep 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

thismohsin Sep 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Josh-Beck commented Sep 25, 2025

Uh oh!

Uh oh!

thismohsin Sep 22, 2025 •

edited

Loading

thismohsin Sep 22, 2025 •

edited

Loading