3.7 did score higher in coding benchmarks but in practice 3.5 is much better at ...

sannee · on May 2, 2025

I suspect that is precisely why it got better at coding benchmarks.

spaceman_2020 · on May 2, 2025

3.7 is too overactive

I prefer Gemini 2.5 pro for all code now

hombre_fatal · on May 2, 2025

Gemini 2.5 Pro has solved problems that Claude 3.7 cannot, so I use it for the hard stuff.

But Gemini is at least as overactive as Claude, sometimes even more overactive when it comes to something like comment spam.

Of course, this can be fixed with prompting. And sometimes it feels sheepish complaining about the machine god doing most of my chore work that didn't even exist a couple years ago.

conception · on May 2, 2025

2.5 is my “okay Claude can’t get it” but first I check my “bank account” to see if I can afford it.

ralusek · on May 2, 2025

Isn’t 2.5 pro significantly cheaper?

yunwal · on May 2, 2025

They're the same price, and Gemini has a large free tier.

conception · on May 3, 2025

Not when you’re doing 500k tokens per query.

UncleEntity · on May 2, 2025

I think it just does that to eat up your token quota and get you to upgrade.

Like, ask it a simple question and it comes up with a full repo, complete with a README and a Makefile, when all you wanted to know was how efficient a particular algorithm would be in the included code.

Can't wait until the add research to the Pro plan because, you know, I have questions...

vineyardmike · on May 2, 2025

> I think it just does that to eat up your token quota and get you to upgrade.

If you pay for a subscription then they don’t have an incentive to use more tokens for the same answer.

It’s definitely because feedback from people has “taught” it that more boilerplate is better. It’s the same reason ChatGPT is annoyingly complementary.

suyash · on May 2, 2025

That has been the most annoying thing about it, so glad not paying for it anymore.

danw1979 · on May 2, 2025

Can’t you still use Sonnet 3.5 anyway ? or is that a paying subscriber feature only ?