Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I think this is fairly easily debunked by o1, which is basically just 4o in a thinking for loop, and performs better on difficult tasks. Not a LOT better, mind you, but better enough to be measurable.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: