Prompt injection is exploiting enterprise AI's biggest design flaws by targeting agents, RAG pipelines and model routers

u7277289442 A modern interpretation of cybersecurity. 3D ar f806db26 b93b 427c b0c9 42caa83dfe92 0
Over the past two years, businesses have been trying to fit large language models (LLM) into support, analytics, development, and internal automation like never before.

Along with the increase in adoption of AI technology, another trend is gaining momentum – cyber criminals are exploiting the gap between perceptions about LLMs and their actual characteristics.

In 2025 and 2026, multiple independent sources have highlighted the same trend: Prompt Injection remains one of the most influential and widely demonstrated attack vectors against LLM systems. The OWASP LLM Top 10 (2025) lists prompt injection as LLM01, identifying it as the most significant category of LLM-specific vulnerabilities for the second consecutive edition. OWASP’s ranking reflects the fact that LLMs still struggle to reliably distinguish instructions from data, making them vulnerable to manipulation through crafted input.

CrowdStrike’s 2026 Global Threat Report – built on frontline intelligence from more than 280 tracked adversaries – documented that threat actors injected malicious signals into legitimate generative AI tools in more than 90 organizations in 2025. The report clearly states: "Indications are new malware." AI-enabled opponents increased their overall attack volume by 89% year-over-year, with quick injections serving as both entry points and force multipliers.

Real-world events reflect operational impact. In August 2024, researchers at PromptArmor disclosed an instant injection vulnerability in Slack AI that allowed an attacker to exfiltrate data from private Slack channels to which they had no access – including API keys shared in private developer channels – by placing a malicious directive in a public channel or embedding it in an uploaded document.

In June 2025, researchers at AIM Security disclosed EchoLeak (CVE-2025-32711, CVSS 9.3), the first documented zero-click prompt injection exploit against production AI systems targeting Microsoft 365 Copilot. By sending a single crafted email, requiring no user interaction, an attacker can cause Copilot to access internal files and transmit their contents to an attacker-controlled server.

Both vulnerabilities were fixed. These incidents underscore the fact that early injection is not a theoretical weakness, but a practical, repeatable threat organizations must address as they deploy AI systems at scale.

Recent years have seen major developments in prompt injection techniques, now targeting multi-agent architectures, retrieval-augmented generation (RAG) pipelines, model routers, and long-term memory capabilities.

EEnterprise Challenge: Too Much Trust

Businesses deploy LLMs to process instructions, summarize information, and trigger automated workflows, but it’s difficult for an LLM to tell:

  • Iinstructions from data

  • Iinformation from context

  • Cmetadata to ontext

  • Youser intent from metadata

This creates an opportunity for attackers to directly or indirectly manipulate and influence the behavior of the model.

modern prompt injection

cross-model prompt injection

The use of LLM is a common practice among enterprises. Attackers corrupt the output of a particular model, knowing full well that other models may be processing the content. Therefore, corruption spreads through all AI systems.

RAG supply chain poisoning

AAttackers create malicious information – documentation, blog articles, GitHub READMEs. They then wait until this malicious information reaches enterprises’ RAG pipelines, then use it as an attack vector.

agent kidnapping

AI agents have evolved to the point where they can send emails, modify cloud infrastructure, execute code snippets, and interact with internal corporate systems. Agents need only one instruction to act differently in a harmful way.

Context overflow attack

With the help of the million-token context window, attackers insert malicious code within the document and hope that the LLM will stumble upon it and execute it, thus overriding all previous instructions.

memory poisoning

Due to the implementation of long-term memory in LLMs, attackers can inject instructions that permanently reconfigure their state.

model-router manipulation

Enterprises are increasingly using model routers to choose between multiple LLMs. The attacking craft gives signals that force one to move towards the weakest or least protected model.

Why does this matter to business leaders?

Early injection is not a theoretical problem. It directly affects:

  • CCustomer-facing systems (chatbots, support agents)

  • IInternal co-pilot (developer equipment, safety assistant)

  • AAutomation workflows (ticketing, cloud operations, HR processes)

  • DATA Governance (RAG Pipeline, Knowledge Base)

The risk is no longer limited to this "The model said something she shouldn’t have said."

In 2026, prompt injection may:

  • Tearigging in unauthorized works

  • levery sensitive data

  • CDisrupting internal workflow

  • mAnalyze Analytics

  • Alater business logic

  • CCompromise multi-agent systems

The attack surface has expanded dramatically.

What should enterprises do now

1. Constrain Model Permissions

Limit what the model can do, not just what it should do.

2. Segment unreliable content

Treat all external data, including RAG sources, as potentially hostile.

3. Monitor Tool Invocation

High-impact actions require human approval.

4. Validate Content Origin

Ensure that RAG pipelines do not ingest toxic foreign materials.

5. Rigid Model Router

Prevent attackers from forcibly rooting vulnerable models.

6. Treat LLMs as untrusted components

This mindset shift is the foundation of modern AI security.

bottom line

Prompt injection is the most effective method for compromising enterprise AI systems because it exploits the fundamental way LLMs interpret text. Unless organizations treat LLMs as untrusted interpreters – not autonomous decision-makers – quick injection will continue to dominate the AI ​​threat landscape.

Julie Brunias is an AI security architect.



<a href

Leave a Comment