If you haven't heard, about 3 weeks ago, an AI bro scrape all of AO3, and several other sites, intending sell the dataset on multiple machine learning forums for profit.

Longer version on Reddit here

WHAT TO DO via the admin team at Paperdemon
CHECK OUT THEIR CURRENT STATUS CHART IF YOU'RE WORRIED

Last I heard, AO3 has filed a DMCA take down request on HuggingFaces dot co and several others, including datafish dot ru and at least one Chinese website. The Digital Millennium Copyright Act is a US law, so the foreign agents aren't required to comply.

IF YOU ARE WONDERING WHETHER YOUR FICS ARE PART OF THE DATASET
One tumblr user has created a searchable database
HERE

This thing is 13 MILLION ROWS LONG. Pull requests are slow. I do NOT recommend searching the database on from a mobile device.

Stay safe, folks.

March 2026

S M T W T F S
1234567
891011121314
15161718192021
22232425262728
293031    

Syndicate

RSS Atom

Most Popular Tags

Style Credit

Expand Cut Tags

No cut tags
Page generated 2026-03-23 08:47 am
Powered by Dreamwidth Studios