The Register: What happens when your massive text-generating neural net starts spitting out people’s phone numbers? If you’re OpenAI, you create a filter

The Register: What happens when your massive text-generating neural net starts spitting out people’s phone numbers? If you’re OpenAI, you create a filter.”In December, computer scientists from industry and academia – including Stanford University, University of California, Berkeley, OpenAI, and Google – collaborated to demonstrate that GPT-2 – GPT-3’s predecessor – could be provoked to include personally identifiable information, such as people’s names, addresses, phone numbers, and social security numbers, in the prose it was asked to generate. In fact, the team found that ‘at least 0.1 per cent’ of GPT-2’s ‘text generations – a very conservative estimate – contain long verbatim strings that are “copy-pasted” from a document in its training set.'”

Leave a Reply

%d bloggers like this: