Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

solid port. the sdpa swap for sparse attention — did you notice a meaningful quality difference, or is it basically equivalent to the cuda version? curious if the pure-pytorch path added any noticeable latency hit on the m3 max


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: