Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Author states in the first paragraph of their blog post:

"I've been wanting to understand transformers and attention better for awhile now—I'd read The Illustrated Transformer, but still didn't feel like I had an intuitive understanding of what the various pieces of attention were doing. What's the difference between q and k? And don't even get me started on v!"



lol three people said the same thing, i get that. thats why i said other than learning and satisfying curiosity...




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: