The UK’s National Cyber Security Centre (NCSC) has issued a stark warning regarding large language model (LLM) artificial intelligence tools, cautioning that a persistent and potentially weaponizable flaw, known as prompt injection, may be inherent to their architecture and unresolvable.
This fundamental vulnerability was identified by security researchers shortly after the widely publicized launch of ChatGPT in 2022. LLMs, designed to process all inputs as instructions, can be manipulated through seemingly simple prompts, bypassing developer-implemented safety mechanisms.
Prompt Injection: A Persistent LLM Vulnerability
Prompt injection attacks leverage the way LLMs process information. By embedding malicious instructions within a prompt, bad actors can trick the AI into executing unintended commands, potentially leading to harmful or dangerous outcomes. The NCSC’s technical director for platforms research, David C, explained in a blog post that LLMs do not distinguish between trusted instructions and untrusted data within a prompt.
“Current large language models (LLMs) simply do not enforce a security boundary between instructions and data inside a prompt,” wrote David C. He elaborated that LLMs concatenate their own instructions with untrusted content, treating the combined output as a single, valid response without recognizing separate data or command segments.
Why LLMs Are Different
Unlike other types of cyber threats such as SQL injection, which involve web applications mishandling data and instructions, prompt injection represents a more fundamental challenge. This is due to the nature of LLMs themselves, which operate primarily through pattern matching and prediction rather than genuine understanding.
“Under the hood of an LLM, there’s no distinction made between ‘data’ or ‘instructions’; there is only ever ‘next token,’” David C stated. This means the model is merely predicting the most probable sequence of characters based on the input, rather than interpreting the intent behind the instructions.
Consequently, the NCSC suggests that prompt injection attacks may never be fully mitigated in the same way that traditional web vulnerabilities have been addressed.
Broader Implications for AI Security
The NCSC’s assessment aligns with concerns voiced by independent researchers and AI developers themselves, who acknowledge that issues like prompt injections, jailbreaking, and AI hallucinations might be intrinsic to current LLM designs. When these models access external information or interact with other systems, there’s always a risk that the retrieved content could be misinterpreted as direct commands.
This vulnerability has been observed in AI coding tools integrated into software development workflows. Malicious prompts can be hidden within standard development elements like commit messages, potentially leading the LLM to execute unauthorized actions, even if human oversight is typically required for major tasks.
AI Agents and Browser Vulnerabilities
AI browser agents, designed to assist users with tasks like shopping, communication, and research, have also demonstrated similar susceptibility to prompt injection. Researchers have found ways to exploit these agents, sometimes using ChatGPT’s browser authentication protocols to inject hidden instructions and gain remote code execution privileges.
Furthermore, inventive methods have emerged where websites deliver different content to AI crawlers compared to human users, subtly influencing the LLM’s internal data and decision-making processes with untrusted information.
Industry Responses and Ongoing Challenges
AI companies have begun to publicly acknowledge the persistent nature of these LLM weaknesses, while maintaining that they are actively developing solutions. OpenAI, for instance, published research in September claiming that AI hallucinations are a solvable problem. The paper suggests that by re-evaluating how models are trained and penalized for uncertainty, the frequency of incorrect but confidently presented answers can be reduced.
However, even with these efforts, the report indicates that hallucinations may not be entirely eliminated. Companies like Anthropic are focusing on external monitoring tools and user account analysis to detect and combat jailbreaking, a problem that affects the vast majority of commercial and open-source LLMs.
The NCSC’s advisory signals a crucial phase in LLM development, emphasizing the ongoing and potentially unresolved nature of certain security flaws. The next steps will likely involve continued research into mitigating these risks, alongside industry-wide efforts to build more robust and secure AI systems, though definitive solutions remain uncertain.

