Hacker Newsnew | past | comments | ask | show | jobs | submit | PunchTornado's commentslogin

I think the vast majority of coding us done through claude and gemini

Funny. What lead? Gemini and Claude are much better.

Yeah. I also do not see the lead. Claude Opus writes better code and for conversation all models even Le Chat is just better than ChatGPT currently.

> for conversation all models even Le Chat is just better than ChatGPT currently

Not sure what you’re smoking, but I want some.


I am smoking this thing called: putting same prompt in four different apps and seeing which ones give me answers and which ones hallucinate and patronize me, but considering your comment I can see how you would prefer ChatGPT

Having the same experience during development of my MCP App. ChatGPT is by far the worst, slow, hallucinating or just quitting. Claude is the best with amazing results and Mistral Le Chat surprisingly good.

Have you tried actually holding a conversation with it? I'm really puzzled in which world Gemini/Claude is better than ChatGPT for day-to-day tasks/conversations.

Claude can't even search products on Amazon, Jesus.


Jesus doesn't care I guess.

come on, using a monthly paid subscription to obtain auth tokens to use claws bots is quite obviously agains T&C. you need to pay api prices for that. I am sure 100% of those knew they were doing something wrong but proceeded anyway.

Sometimes I wonder where I am when people are so shocked. I genuinely don’t understand who would think this is allowable? Is this simply a younger generation and I am old now? API keys vs the auth tokens smells the same as public vs private APIs, don’t be surprised you get shut off if you are using a private API.

To the extent that that's true, it would be in the opposite direction? Auth tokens are meant to be used by the User Agent to effect the wishes of user, often encode permissions the user has, and are used with public APIs like those intended for web browsers. API keys are usually for private communication like server to server.

The usual expectation is you don't care what agent the user is running. You just care about what they're doing with it (permissions, rate limits, etc.).


Honestly that’s a detail far removed from the discussion. Folks are surprised they cannot use something that would obviously be against the T&Cs.

Everyone knows no one reads terms and that it isn't feasible for a normal person to do so, so I don't know why it would "obviously" be against them to anyone. If you're paying for a subscription with known limits, you'd expect you can use up to those limits. It's no more obvious to me than if you used the API token and got banned for using another client, or if a website decided to ban Firefox users.

I just fail to see your argument. You are paying for Claude code or Antigravity. Not for the raw underlying compute. It’s not about reading T&Cs but the expectation is just because you are paying for a service does not give you the right to freely use the API however you want. Hence why I said it really reminds me of a private vs public API. Don’t be surprised if you get shutout of the private API. All subscriptions are bound by acceptable use.

Maybe I am out of touch but I struggle why folks are surprised by this. I would argue that banning accounts is probably too harsh but we will see if that is a short term remedy.

There is a reason that in general the cost of a token via API is more expensive than when using the consumer tool.


I wouldn't expect consumers to even be aware that API keys exist, much less know the pricing differences. When I go to the Google One plans page, it just says I get all these AI things with higher limits. Then there's some tools that can use my account to do cool stuff. I wouldn't expect that a program that's logging into an AI service that I pay for as me to do AI things is it all untoward? No more than running a bot that just did high level control and delegated to their specific program (which is what all of this AI stuff and really software in general is about: automating whatever you're doing). Or when I give codex an auth token to use Jira or Gitlab. I expect that's the intended purpose of the auth token: let me perform whatever actions I need to do that I'm authorized to do within whatever limits the service sets.

Literally the entire buzz around all this AI stuff is that it lets you automate stuff and do more things faster. Why would you not expect people to automate their interactions with the AI service itself? AI automating its own interactions with itself is what all the AI companies are pushing as the immediate future and paradigm shift for everyone to hop onto.


That’s a fair point if you are just shooting from the hip then I can see it happening and being shocked. Still surprising to see the shock here on HN I would expect most to understand why it would not be a viable path.

Ah this would be too good to be truth. I had my iphone for 6 years and finally had to ditch it this year because of the battery. otherwise the device was good, all i needed. felt so bad that I should discard it just for the battery.

I'm waiting to see a more egregious company than openai and a bigger scammer ceo like altman. no, thank you. i hope openai goes bankrupt. especially since the ousting of ilya.

no. only tpus

I don't think models blow other models anymore. We have the big 3 which are neck to neck in most benchmarks and the rest. I doubt that 5.3 will blow the others.

easy now

The biggest increase is LiveCodeBench Pro: 2887. The rest are in line with Opus 4.6 or slightly better or slightly worse.

but is it still terrible at tool calls in actual agentic flows?

At least it's not Facebook or other online monster. When I read the piece I thought if it wasn't for superbowl that money would have went straight in the garbage.

The system data issue on iphone annoyed me since forever. It is The reason i switched to android.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: