2 docs tagged with "obfuscation"

Bypassing LLM Guardrails

LLMs are trained to understand language, which makes them vulnerable to 'translation attacks.' How Base64, Rot13, and Emoji encodings bypass safety filters.

Bypassing Semantic Routers: Obfuscation and Encoding Attacks

An analysis of how attackers bypass intent-based NLP filters using Byte-Pair Encoding (BPE) manipulation and Base64 obfuscation to execute hidden payloads.