Hacker Newsnew | past | comments | ask | show | jobs | submit | sssummer's commentslogin

Why frontier models can both achieve silver medal in Math Olympiad but also fail to answer "which number is bigger, 9.11 or 9.9"?


..because not all systems are of the same quality.


Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: