Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I agree. I've seen people insist moving to a newer model or fine tuning will make the output more clever, "trust me", sometimes without providing any evidence of before and after for the specific use case. One LLM project I saw released was prettymuch useless, but it wasn't the use case or the architectural limitations that were the problem, nope the next thing on the roadmap was "fixing" it by plugging in a better LLM.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: