Discussion about this post

User's avatar
Robert M.'s avatar

"Timothy Lee orders a replacement lightbulb from Amazon "

How many AI Agents does it take to change a lightbulb?

Yoav Tzfati's avatar

From reading the system card for agent, it seem like the bio risk mitigations probably aren't robust? They say that red teamers and UK AISI found a bunch of jailbreaks, and that the jailbreaks were patched before release, but not that the system then underwent another round of red teaming. See 5.2.3 Safeguard testing

15 more comments...

No posts

Ready for more?