DataDignity: Training Data Attribution for Large Language Models

📰 ArXiv cs.AI

arXiv:2605.05687v1 Announce Type: new Abstract: Auditing language-model outputs often requires more than judging correctness: an auditor may need to identify which source document most likely supports the knowledge expressed in a response. We study this as pinpoint provenance: given a prompt, a target-model response, and a candidate corpus, rank the documents that best support the response. We introduce FakeWiki, a controlled benchmark of 3,537 fabricated Wikipedia-style articles designed to pre

Published 9 May 2026

Read full paper → ← Back to Reads