Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
mediaman
15 days ago
|
parent
|
context
|
favorite
| on:
Measuring AI Ability to Complete Long Tasks
Much of this is due to vastly better posttraining RL, not models that are much bigger. The idea that most of these gains comes from training really big models, or throwing immensely larger amounts of compute at it, is not really true.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: