I think you guys are on the right track here. I’d love to learn more about the m... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		nextzck 22 days ago \| parent \| context \| favorite \| on: The First Fully General Computer Action Model I think you guys are on the right track here. I’d love to learn more about the math behind the FDM. I don’t think folks realize how behind we are on vision, thank you for your work here.

nee1r 22 days ago [–]

thanks! the math and architecture of the FDM (no video encoder) is pretty simple, its a regular transformer with next-token predictions but with frames interleaved.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact