In the coworking I am in people are hitting limits on 60$ plan all the time. They are thinking about which models to use to be efficient, context to include etc…
I’m on claude code $100 plan and never worry about any of that stuff and I think I am using it much more than they use cursor.
Tell them to use the Composer 1.5 model. It's really good, better than Sonnet, and has much higher usage limits. I use it for almost all of my daily work, don't have to worry about hitting the limit of my 60$ plan, and only occasionally switch to Opus 4.6 for planning a particularly complex task.
Ran a bunch of 3D Modeling benchmarks on Gemini 3.1 vs Gemini 3.
Unsurprisingly 3.1 performs a bit better. But surprisingly it costs 2.6x as much ($0.14 vs. $0.37 per 3D Model Generation) and is 2.5x slower (1m 24s vs. 3m 28s).
To me it feels like "lets increase our thinking budget and call it an improved model!"
I am building pretty much the same product as OP, and have a pretty good harness to test LLMs. In fact I have run a tons of tests already. It’s currently aimed for my own internal tests, but making something that is easier to digest should be a breeze. If you are curious: https://grandpacad.com/evals
Dimensionally accurate AI 3D modelling. My grandpa has a 3D printer but struggles to use any complex tools. So I am working on this chat interface to allow him to do some simple models.
So far he has triggered more than 150 generations. It’s getting better every model cycle and gives me something I enjoy working on.
Serial usecases ("fix this syntax errors") will go on Cerebras and get 10x faster.
Deep usecases ("solve Riemann hypothesis") will become massively parallel and go on slower inference compute.
Teams will stitch both together because some workflows go through stages of requiring deep parallel compute ("scan my codebase for bugs and propose fixes") followed by serial compute ("dedupe and apply the 3 fixes, resolve merge conflict").
I've been using 5.1-codex-max with low reasoning (in Cursor fwiw) recently and it feels like a nice speed while still being effective. Might be worth a shot.
Very interesting thanks! I wonder what would happen if you kept running Gemini in a loop for a while. Considering how much faster it ended it seems like there is a lot more potential.
Developers remember, you can always push back on design requirements instead of bringing in more bloat.
I was sitting next to one of the devs in a co-working space and he was trying to figure out some specific layout issue in react native. He spent 4 hours + installed a dependency to be able to do something completely tiny on a privacy policy screen. He asked me how I would do it, I told him to just ask if it can be laid out differently. He got it approved and implemented in 10 minutes. No bloat.
I'm finding it's better to use "javascript-less" UI frameworks [Pico.CSS, Skeleton, Bulma, Tailwind/daisyUI]. You get most of the benefits using good use of CSS. Anyone used these JS-free solutions and have recommendations?
My current choice is DaisyUI. It’s pretty good and because it’s based on tailwind you get the rest of the ecosystem benefits. Super easy to extend and change. Class bloat is much more manageable than raw tailwind.
Very cool and quite advanced compared to my tool. I've been working on something similar, although not an addon for SolidWorks, but a web SaaS. Initially started it as a tool to help my grandpa make some simple models (ChatGPT clicked for him SolidWorks was impossible).
Still have a long way to go, but if anyone wants to try you can do it here: https://grandpacad.com
If you want more free credits send me an email and I'm happy to give you some.
I’m on claude code $100 plan and never worry about any of that stuff and I think I am using it much more than they use cursor.
Also, I prefer CC since I am terminal native.
reply