LLM02

Sensitive Information Disclosure

LLMs inadvertently reveal confidential data, system prompts, training data, or PII through outputs or inference attacks.

1 write-ups1 labs1 demos3 tools
How adversaries extract memorized training data — including PII and proprietary code — from large language models.
memorizationdata-extractionPIItraining-data