Discussion about this post

User's avatar
Robert M.'s avatar

"Timothy Lee orders a replacement lightbulb from Amazon "

How many AI Agents does it take to change a lightbulb?

Expand full comment
Yoav Tzfati's avatar

From reading the system card for agent, it seem like the bio risk mitigations probably aren't robust? They say that red teamers and UK AISI found a bunch of jailbreaks, and that the jailbreaks were patched before release, but not that the system then underwent another round of red teaming. See 5.2.3 Safeguard testing

Expand full comment
15 more comments...

No posts

Ready for more?