Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

This specific model is only trained on 100 billion tokens, so it's not SOTA by any means, but we've got designs on larger training runs later :)


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: