Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
whimsicalism
11 months ago
|
parent
|
context
|
favorite
| on:
30% drop in O1-preview accuracy when Putnam proble...
https://chatgpt.com/share/67755e6f-bfc8-8010-9aa3-8bcbbd9b26...
jsheard
11 months ago
[–]
To be clear I was testing with 4o, good to know that o1 has a better grasp of basic arithmetic. Regardless my point was less to do with the models ability to do math and more to do with OpenAI seeming to cover up its lack of ability.
whimsicalism
11 months ago
|
parent
[–]
i think it’s mostly that o1 mini can think through the solution before it starts writing the poem.
i’m able to reproduce your failure on 4o
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: