Discussion about this post

User's avatar
Alistair Windsor's avatar

The hallucinated citations in the MAHA report are the tip of the iceberg and, like a real iceberg, it is the submerged portion that sinks ships. The submerged part of this are the real publications whose study results are incorrectly reported. This is the thing that dooms AI research in my experience. Not the fake citations but the studies that are correctly cited that do not show what they are purported to show. AI is not a substitute for expertise and I want my policy to be informed by expertise.

Expand full comment
Jeffrey Soreff's avatar

"Alex Albert (the claim Blow was quoting): Since Claude 4 launch: SWE friend told me he cleared his backlog for the first time ever, another friend shipped a month's worth of side project work in the past 5 days, and my DMs are full of similar stories. I think it's undebatable that devs are moving at a different speed now. You can almost feel it in the air that this pace is becoming the default norm."

I don't think the importance of "cleared his backlog" can be overstated. While I retired 5 years ago, in the course of my programming career, the backlog of known bugs _never_ got cleared. And every one of those bugs caused someone enough pain to go through the hassle of reproducing it, constructing a case that provoked it, reporting it, and (usually) arguing about its priority. If Claude is now reliable enough to routinely (help) fix bugs as fast as they are reported, this is a _major_ improvement, even if AI never did anything else. A lot of the world now runs on software. Routinely improving the reliability of ordinary software is a _very_ big deal.

Expand full comment
31 more comments...

No posts