SIGNALAI·May 25, 2026, 4:00 AMSignal75Short term

RAG-Pull: Turning Retrieval into a Code-Injection Channel via Invisible Unicode Perturbations

arXiv:2510.11195v2 Announce Type: replace-cross Abstract: Retrieval-Augmented Generation (RAG) increases the reliability and trustworthiness of the LLM response and reduces hallucination by eliminating the need for model retraining. It does so by adding external data into the LLM's context. We develop a new class of black-box attack, RAG-Pull, that inserts hidden UTF characters into queries or external code repositories, redirecting retrieval toward malicious code, thereby breaking the models' safety alignment. We observe that query and code perturbations alone can shift retrieval toward attac

Why this matters

Why now

The rapid advancement and deployment of RAG systems in LLMs create a timely vulnerability for new attack vectors, as security measures often lag behind development.

Why it’s important

This attack vector demonstrates a novel method to compromise LLM safety, directly impacting the integrity and trustworthiness of AI systems reliant on external data.

What changes

The assumption that RAG inherently improves LLM safety is challenged, as hidden perturbations can turn retrieval into a code-injection channel.

Winners

· Cybersecurity firms
· AI safety researchers
· Developers of robust RAG infrastructure

Losers

· Organizations using vulnerable RAG systems
· Developers of LLMs without robust input sanitization

Second-order effects

Direct

Increased focus on input sanitization and verification for data used in RAG systems.

Second

Potential for new regulations or industry standards for securing AI systems against such code injection attacks.

Third

Erosion of public trust in AI applications if such vulnerabilities are exploited in high-stakes scenarios.

Editorial confidence: 90 / 100 · Structural impact: 60 / 100

Original report

This signal links to a primary source. Continuum Brief monitors and indexes it as part of the live intelligence stream — we do not republish source content.

Read at arXiv cs.AI

#cs.CR #cs.AI

Tracked by The Continuum Brief · live intelligence network

The Brief · Weekly Dispatch

Stay ahead of the systems reshaping markets.