Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Next try actually teaching it to see by training a projector with a vision encoder on gpt-oss.


About to release GPT-OSS-120B-Vision and GPT-OSS-20B-Vision, how about that! :D


too slow bro


might be slower, but then it can get the actual image as input, not just some description of it




Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: