> At this point, I think it can only be explained by ignorance, bad faith, or fear of becoming irrelevant.
Based on the past history with frontier-math & AIME 2025 [1],[2] I would not trust announcements which cant be independently verified. I am excited to try it out though.
Also, the performance of LLMs was not even bronze [3].
Finally, this article shows that LLMs were just mostly bluffing [4].
Based on the past history with frontier-math & AIME 2025 [1],[2] I would not trust announcements which cant be independently verified. I am excited to try it out though.
Also, the performance of LLMs was not even bronze [3].
Finally, this article shows that LLMs were just mostly bluffing [4].
[1] https://www.reddit.com/r/slatestarcodex/comments/1i53ih7/fro...
[2] https://x.com/DimitrisPapail/status/1888325914603516214
[3] https://matharena.ai/imo/
[4] https://arxiv.org/pdf/2503.21934