Discussion about this post

User's avatar
jmtpr's avatar

Zvi, given that you think we're careening towards disaster, I'm always surprised when you don't support arbitrary market frictions such as breaking up Nvidia or other tech giants. Of course you don't agree with that as economic policy, but it would nevertheless slow the timeline which seems like the much more important thing? The same argument applies to energy production, shouldn't we be glad that we're not building enough of it to go full speed ahead? We'll get there eventually, at a slower pace, and that's a good thing right?

MichaeL Roe's avatar

One way in which the liking owls result is bad is that it implies two instances of the same model can communicate with each other steganographically.

Suppose you have a conversation with a model and publish the some of result to the Internet. Even if you check that what you’re releasing is something you’re happy to release, you don’t know what else from the conversation might be being leaked by steganography.

I had always assumed this might be possible, but we now have confirmation the steganographic channel exists and is usable.

17 more comments...

No posts

Ready for more?