Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

How is it that these models boast these amazing benchmark results, but using it for 30 seconds it feels way worse than Gemma3?


Are you running the full versions, or quantized? Some models just don't quantize well.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: