Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Google reports a lower score for Gemini 3 Pro on SWEBench than Claude Sonnet 4.5, which is comparing a top tier model with a smaller one. Very curious to see whether there will be an Opus 4.5 that does even better.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: