Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

really? >>many tasks that do not suffer from "need to check every single time"

like which tasks?

How do you decide whether you need to check or not?

If you're asking it to complete 100 sequences, and if the error rate is 5%, which 5% of the sequences do you think it messed up or _thought_ otherwise? if the 5% is in the middle, would the next 50 sequences be okay?



> really? >>many tasks that do not suffer from "need to check every single time"

> like which tasks?

Making slop.


If I ask an LLM to guess what number I’m thinking of and it’s wrong 99.9% of the time, the error is not in the LLM.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: