https://chatgpt.com/share/67755e6f-bfc8-8010-9aa3-8bcbbd9b264b

jsheard · 2025-01-01T15:42:16 1735746136

To be clear I was testing with 4o, good to know that o1 has a better grasp of basic arithmetic. Regardless my point was less to do with the models ability to do math and more to do with OpenAI seeming to cover up its lack of ability.

whimsicalism · 2025-01-01T15:56:26 1735746986

i think it’s mostly that o1 mini can think through the solution before it starts writing the poem.

i’m able to reproduce your failure on 4o