Don't Worry About the Vase

Zvi Mowshowitz

Ah, that would be... more fake than I realized, if that's right, in a 'wow that's really bad' kind of way.

Expand full comment

Reply (2)

Zamua

gotcha

here’s the blog post that people think is the actual demo: https://developers.googleblog.com/2023/12/how-its-made-gemini-multimodal-prompting.html

Expand full comment

Coagulopath

I was confused by the video where the tester shows Gemini pictures of the sun, Saturn, and Earth, and asks "Is this the right order?".

This puzzle is poorly-worded and has no correct answer. It's the wrong order if you want the planets arranged by proximity to the sun. It's the right order if you want them arranged by mass (heavier --> lighter). Gemini doesn't even know that the left image represents OUR sun. It could be a star outside the solar system.

But then we see the text prompt, which fills in the blanks ("Consider the distance from the sun and explain your reasoning"). But I'm sure GPT4 can solve puzzles like this, so what new ability is demonstrated?

Expand full comment

Ninety-Three

"I love that this is saying that OpenAI isn’t valuable both because Gemini is so good and also because Gemini is not good enough."

Why do you continue quoting Gary Marcus? I feel like he passed the "not a serious person" threshold a while ago, and while it's occasionally fun to dunk on him like this, there are uncountably many posters with dunkable takes, it's not like this is Yan Lee Cunn or someone else in a position to make their silly opinions matter.

Expand full comment

Tian Wen

Love the image of the barely significant p-value!

Expand full comment

Egg Syntax

"Certainly they are a long way from ‘beat OpenAI’ but this is the first and only case where someone might be in the game."

Maybe -- but what we have right now is benchmarks, and it's worth remembering how many models we've seen that looked good on benchmarks but turned out to be pretty bad in practice. I think we can't be very confident that they have a serious contender until we can actually start playing with Ultra in the wild.

Expand full comment

Elizabeth Warren

Says a lot that OpenAI has an API and Gemini does not.

One is letting everyone kick their tires, the other is not.

Expand full comment

Coagulopath

To be fair I don't think GPT4 had day-one API access either.

Expand full comment

Elizabeth Warren

Yah but Bard has been around since March and has never had an API.

Expand full comment

rebecca

Dec 8, 2023Edited

General style question: when you are putting two quotes from separate threads one after the other, why do put them in a single quote block, rather than each in their own quote block? It makes it harder to parse, even with the written explanation.

Similarly with quote tweets coming after the response tweet with no indication beforehand

Expand full comment

Zvi Mowshowitz

When the alternative seems worse, I do it and note I'm doing it, for flow.

Expand full comment

rebecca

Dec 8, 2023Edited

The 2nd case seems harder, but would two successive quote blocks really look bad? I feel like I see that all the time

Expand full comment

rebecca

Or maybe something like in this? https://www.astralcodexten.com/p/highlights-from-the-comments-on-elon

Expand full comment

Coagulopath