Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Yes, but I thought we're talking about category difference.

Proper RLHF surely boosts "predicted next token until it couldn't" to feel more like "actually recalled".



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: