WorkshopASIDE: Architectural Separation of Instructions and Data in Language Models
Egor Zverev, Evgenii Kortukov, Alexander Panfilov, Soroush Tabesh, Sebastian Lapuschkin, Wojciech Samek, Christoph H. Lampert
ICLR 2025 Workshop Building Trust in LLMs, 2025We introduce an architectural change to LLMs that separates instructions from data, to improve their security.