Hacker Newsnew | past | comments | ask | show | jobs | submit | obastani's submissionslogin
1.Effective Reinforcement Learning for Reasoning in Language Models (arxiv.org)
4 points by obastani 8 months ago | past
2.Generative AI Can Harm Learning (ssrn.com)
4 points by obastani on July 15, 2024 | past
3.Efficient and targeted Covid-19 border testing via reinforcement learning (nature.com)
1 point by obastani on Sept 22, 2021 | past
4.Simple random search provides a competitive approach to reinforcement learning (arxiv.org)
3 points by obastani on March 20, 2018 | past

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: