LLM07

System Prompt Leakage

Confidential system prompts, operational instructions, or business logic are extracted through carefully crafted user inputs.

1 write-ups1 labs1 demos2 tools
A taxonomy of techniques adversaries use to extract hidden system prompts and how to defend against prompt leakage.
system-promptleakageextractioncanary-tokens