Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

With 2T params (!!), it better outperform everything else.


Given that the comparison doesn't include O3 or gemini pro 2.5, I'd say it doesn't. Looking both at the comparison table available for llama 4 behemoth and gemini pro 2.5 it seems like at least a few of the comparable items might be won by gemini

https://blog.google/technology/google-deepmind/gemini-model-...


We don't know how many params GPT-4, Claude, and Gemini are using so it could be in the ballpark.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: