Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Is that technically not a new pretrained model?

(Also not sure how that would work, but maybe I’ve missed a paper or two!)





I'd say for it to be called a new pretrained model, it'd need to be trained from scratch (like llama 1, 2, 3).

But it's just semantics.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: