Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Very good content! It's interesting that you use Rust here.


(author here)

I actually did a v0 writeup in Go, but I wanted a language that had a bit more powerful type system and more support for a fluent/functional style in some of the expressions. I was optimizing for what I know and felt was highly expressive; I've since gotten a bunch of feedback from people who are interested in the content but not comfortable reading Rust, so perhaps it wasn't the best choice.


What does the`.0` calling convention mean in Rust? e.g. ``` for (i, r) in right.0.iter().enumerate() { out.0[i] += r; } ```

Aside from that, I thought it was fairly legible. Great write-up by the way. Squashing things into state helps get rid of some of the spookiness created by matrix multiplication and back-propagation. I also really appreciated seeing the explanation on the actual MLP part of the transformer as that is typically assumed to be prior knowledge in other tutorials.


It's to access the tuple's single element, but the author could have used #[repr(transparent)]


You should mention that you’re using rust. I got distracted trying I figure out what language you were using.


Python would be awesome!


I think it’s better to use other languages for pseudo code. Rust is not the easiest language to read and understand for people who never used it.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: