Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

but this is still great trick if you want to reduce latency or inference speed even with local models e.g. in realtime chatbot


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: