It’s going to be funnier: imagine throwing in tons of data at an LLM, most of the data will get abstracted and grouped, most will be extractable indirectly, some will be extractable verbatim… and any piece of it might be a hallucination, no guarantees! 😅.
Courts will have a field day with that.
Randomly obfuscated database: you don’t get exactly the same data, and most of the data is lost, but sometimes can get something similar to the data, if you manage to stumble upon the right prompt.