Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

The title is a bit much, no?


Yes, it violates site guidelines and should be "AlpacaFarm: A Simulation Framework for Methods that Learn from Human Feedback"


Not really. Pretty much the "killer app" feature of ChatGPT is RLHP. Whether or not the current RLHP-ed Alpca really beats ChatGPT, it is pretty obvious that local LLMs can be RLHP-ed and it is only a matter of time before people realize running an RLHP-ed LLM locally is a better option than running ChatGPT with all the security concerns of running something "in the cloud" (which is just "somebody else's computer" in the famous saying).


I was referring to the HN guidelines against editorializing titles.


I’m sorry what’s RLHP? I’m not able to Kagi that


The P should be an F, it's reinforcement learning from human feedback


Reinforcement learning through human feedback.

Took me a bit of searching too.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: