The title is a bit much, no?

version_five · on May 24, 2023

Yes, it violates site guidelines and should be "AlpacaFarm: A Simulation Framework for Methods that Learn from Human Feedback"

jhbadger · on May 24, 2023

Not really. Pretty much the "killer app" feature of ChatGPT is RLHP. Whether or not the current RLHP-ed Alpca really beats ChatGPT, it is pretty obvious that local LLMs can be RLHP-ed and it is only a matter of time before people realize running an RLHP-ed LLM locally is a better option than running ChatGPT with all the security concerns of running something "in the cloud" (which is just "somebody else's computer" in the famous saying).

monkpit · on May 24, 2023

I was referring to the HN guidelines against editorializing titles.

fnordpiglet · on May 24, 2023

I’m sorry what’s RLHP? I’m not able to Kagi that

version_five · on May 24, 2023

The P should be an F, it's reinforcement learning from human feedback

stavros · on May 24, 2023

Reinforcement learning through human feedback.

Took me a bit of searching too.