Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Agreed, this is what makes evaluating this very hard. A 1700 Elo chess player would never make an illegal move, let alone have 12% illegal moves.

So from the model's perspective, we have at the same time display of both brilliancy (most 1700 chess players would not be able to solve as many puzzles by looking just at the FEN notation) and on the other side complete lack of any understanding of what is it trying to do from a fundamental, human-reasoning level.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: