Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Shhhhh no one cares about data contamination anymore.




Then write something down yourself and upload a picture to gemini.google.com or chatgpt. Hell, combine it. Make yourself a quick math test, print it, solve with pen and ask these models to correct it.

They're very good at it.


I don't know how to write like a 19th century mathematician, nor anyone earlier. I'm not sure OCR on Carolingian Miniscule has been solved, let alone more ancient styles like Roman cursive or, god forbid, things like cuneiform. Especially since the corpora on these styles is so small, dataset contamination /is/ a major issue!

For that to be relevant to this post, they would need to write with secretary hand.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: