Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

That's really the issue isn't it. Many of the LLMs are trained uncritically on very thing. All data is viewed as viable training data, but it's not. Reddit clearly have good data, but it's probably mostly garbage.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: