Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I consider the weights a binary program and the source code is the training data. The training algorithm is the compiler.

I agree this isn't standard terminology, but it makes the most sense to me in terms of power dynamics and information flow.

We know from interpretability research that the weights do algorithms eg sin approximation etc. So they feel like binary programs to me.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: