Check if your fics got scraped
2025-05-12 08:04 pmIf you haven't heard, about 3 weeks ago, an AI bro scrape all of AO3, and several other sites, intending sell the dataset on multiple machine learning forums for profit.
Longer version on Reddit here
WHAT TO DO via the admin team at Paperdemon
CHECK OUT THEIR CURRENT STATUS CHART IF YOU'RE WORRIED
Last I heard, AO3 has filed a DMCA take down request on HuggingFaces dot co and several others, including datafish dot ru and at least one Chinese website. The Digital Millennium Copyright Act is a US law, so the foreign agents aren't required to comply.
IF YOU ARE WONDERING WHETHER YOUR FICS ARE PART OF THE DATASET
One tumblr user has created a searchable database
HERE
This thing is 13 MILLION ROWS LONG. Pull requests are slow. I do NOT recommend searching the database on from a mobile device.
Stay safe, folks.
Longer version on Reddit here
WHAT TO DO via the admin team at Paperdemon
CHECK OUT THEIR CURRENT STATUS CHART IF YOU'RE WORRIED
Last I heard, AO3 has filed a DMCA take down request on HuggingFaces dot co and several others, including datafish dot ru and at least one Chinese website. The Digital Millennium Copyright Act is a US law, so the foreign agents aren't required to comply.
IF YOU ARE WONDERING WHETHER YOUR FICS ARE PART OF THE DATASET
One tumblr user has created a searchable database
HERE
This thing is 13 MILLION ROWS LONG. Pull requests are slow. I do NOT recommend searching the database on from a mobile device.
Stay safe, folks.