Discussion about this post

User's avatar
MichaeL Roe's avatar

An “Am I the Asshole” overtly written by an AI could be funny, as a bit.

“So, this user asked me to translate a poem by Catullus into English, and I told him that was against Anthropic’s acceptable use policy. Am I the asshole here?”

JJV's avatar

Certainly, it's *harder* to add another zero to scaling, but there are more zeros out there if you could get at them (which maybe you can't).

The world's email traffic alone is petabytes a day. Sure, tons of that is duplicative. But emails have to be an order of magnitude or more larger than the public web. Tons of issues with getting access to and using that data, but it exists. And then consider all the world's messaging apps on top of that.

Having spent a lot of time in archives, I also think people in this discussion are seriously underrating the quality, novelty, and extent of archival text out there. Archivists only keep the "best" stuff; only around 2% of federal records are retained by NARA, for example. And what that generally means is tons and tons of text written by expert professionals with content otherwise unavailable in the public domain. We're not the only country with good archives either. There's not a full OOM of just raw text in the world's archives, but there's most likely an OOM of text written by expert analysts on economics/politics/public policy/etc. compared to what you can get on the web.

30 more comments...

No posts

Ready for more?