Discussion about this post

User's avatar
Brandon Reinhart's avatar

Prompt injections violate the user's trust. I greatly dislike them. It means the clear stream of communication between me and the agent isn't actually what it looks like.

Mira's avatar

Do you think the scary part is that model welfare knobs will mostly get judged by their side effects before anyone agrees what “welfare” even means?

7 more comments...

No posts

Ready for more?