With 2T params (!!), it better outperform everything else.

amarcheschi · 2025-04-05T19:28:38 1743881318

Given that the comparison doesn't include O3 or gemini pro 2.5, I'd say it doesn't. Looking both at the comparison table available for llama 4 behemoth and gemini pro 2.5 it seems like at least a few of the comparable items might be won by gemini

https://blog.google/technology/google-deepmind/gemini-model-...

wmf · 2025-04-05T19:36:09 1743881769

We don't know how many params GPT-4, Claude, and Gemini are using so it could be in the ballpark.