Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

FWIW I ran a quick test of gemma.cpp on M3 Pro with 8 threads. Similar PaliGemma inference speed to an older AMD (Rome or Milan) with 8 threads. But the AMD has more cores than that, and more headroom :)


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: