Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Why frame it as rigging? I assume they would teach the models to improve on tasks the public find interesting. Then we just have to come up with more challenges for it.


It's not rigging—it's just RL.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: