Isn't it the case that the latest models actually hallucinate more than the ones... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		codr7 9 months ago \| parent \| context \| favorite \| on: Jagged AGI: o3, Gemini 2.5, and everything after Isn't it the case that the latest models actually hallucinate more than the ones that came before? Despite best efforts to prevent it.

simonw 9 months ago [–]

The o3 model card reports a so far unexplained uptick in hallucination rate from o1 - on page 4 of https://cdn.openai.com/pdf/2221c875-02dc-4789-800b-e7758f372...

That is according to one specific internal OpenAI benchmark, I don't know if it's been replicated externally yet.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact