Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Actually that seems exactly wrong. unless you set temperature 0, converting logits to tokens is a random pull. so in principle it should be possible for an llm to recognize that it's being asked for a random number and pull tokens exactly randomly. in practice it won't be exact, but you should be able to rl it to arbitrary closeness to exact


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: