Discussion about this post

User's avatar
Performative Bafflement's avatar

> In particular, as I parse this taxonomy the Whispering Earring problem seems not covered. One can consider this the one-human-agent version of Gradual Disempowerment.

I wrote a post unironically arguing that we should be pro "whisper earring."

Basically, at *least* 2/3 of people do a terrible job running their lives, and having Phd-smart or smarter personal assistants that know everything about them arguing and convincing them into better habits and behaviors in terms that resonate most strongly with their own values and thinking styles is an unmitigated good. People themselves would love to have this option today. We're just talking price.

In other words, there's a massive Pareto optimal frontier that makes both individuals and societies better off on a huge number of metrics if we're pro "whisper earring."

And this extends - I close with an argument that you should even do this for your kids. I think this is basically a "consequentalist vs virtue ethicist" scissor scenario.

Still a draft scheduled to come out a week from now, but happy to preview it and get hate mail / comments / feedback from the savvy Zvi crowd here: https://performativebafflement.substack.com/p/69613283-ff5c-4f1f-a84d-d918f3f07c96

Expand full comment
Katalina Hernández's avatar

This was so, so informative. I'm a lawyer in AI governance (doing some AI safety research too) and I was planning to blindly tackle this paper but... 120 pages! You've given me a lot to ponder.

With the way regulation is going, specially in Europe, it's more likely that we'd be able to advocate for Control mechanisms to be embedded in some sort of Best Practices or Standard for AI companies, than alignment.

And that's a problem.

"I think you mostly do need the AGI to be either corrigible or aligned with human values, in some intuitive sense that is very hard to pin down that comes down to wanting to adhere to the spirit of various human intents and what humans broadly care about in the right tricky combinations, or else you end up with ‘the genie knows what you meant but does not care’ problems."

I absolutely agree. And I'm hoping to translate as much as this into governance...

Thanks again for the great read ☺️

Expand full comment
16 more comments...

No posts