Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Isn't it the case that the latest models actually hallucinate more than the ones that came before? Despite best efforts to prevent it.


The o3 model card reports a so far unexplained uptick in hallucination rate from o1 - on page 4 of https://cdn.openai.com/pdf/2221c875-02dc-4789-800b-e7758f372...

That is according to one specific internal OpenAI benchmark, I don't know if it's been replicated externally yet.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: