Discussion about this post

User's avatar
Kevin's avatar

I have a bit of experience with the "catching bad actors" sort of work at large internet companies. The main difficulty with spam filtering is that it isn't just a classification problem. It's an adversarial system and the spammers get to test against your filter. So catching 99.99% of the spam in your test set can easily mean you miss millions of spam messages once the spammers test and see which 0.01% you miss.

So the problem isn't just classification, it's operational, like how can you delay the testing procedures of the spammers, or how quickly can you deploy new filters. To fix your spam you also have to fix mundane software engineering issues like "it's hard for us to continually and quickly do an automatic deploy to production" and often the "speeding up your infrastructure" tasks end up being the majority of the actual work of the spam-fighting teams.

Expand full comment
Askwho Casts AI's avatar

First, Podcast episode for this post:

https://open.substack.com/pub/dwatvpodcast/p/ai-120-while-o3-turned-pro

Second, you'll be shocked to hear I have opinions on the humanlike AI voices from the "Fun With Media Generation" section. For some reason all of the development effort is going into "conversational" voice models, with ums and ahs and verbal tics. It's clear people are driving towards voices that can either explicitly trick you, or fool your subconscious into believing you're talking to a human conversation partner. It is really frustrating because, in my opinion, AI narrators with solid, clear, narrative delivery are much more valuable, and development in this area towards quality at a much cheaper price would do worlds of good for accessibility etc.

Expand full comment
41 more comments...

No posts