With the whole internet to train these AI models (including every photo we've all posted here), just imagine what they can do in ten years.
Speaking of this point, this is something I have wondered about a bit. I'm not sure about how Ben feels on the subject of entities scraping this site for AI training data, but if it is not something that he particularly wants, it might be nice to at least add
some of the major crawlers' user agents to the site's
robots.txt. Personally I would
prefer if the images and text I post here are not used to train generative AI models, even though I realize that it is not something I can effectively prevent.
Unscrupulous crawlers can of course choose to just ignore the robots.txt directive, and there are other techniques that can be applied besides this, but it would at least be something.