Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

For majority of those benchmarks it produced the right code from the first go. It struggled more with Java VirtualThreads - probably fewer programs in the training set. It also had a tendency to overcomplicate things (adding unncecessary code). So there were a few iterations needed plus some hand edits.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: