More

tarr11 · 2026-03-07T15:59:59 1772899199

52 here, been a full time people manager for about a decade now. Coding manually makes me tired just thinking about it. When I think about embark on a new project my mind goes back to all the times I worked 12 hour days trying to get some basic system to function. I’m too old for that now, my back hurts if I sit too long and occasionally get migraines if I look at a screen too much.

Using AI has been really perfect for me. I can build stuff while I do other things, walk the dog, make lunch, sit on the porch.

Sometimes i realize that my design was flawed and I just delete it all and start again, with no loss aversion.

sidrag22 · 2026-03-07T16:10:30 1772899830

> Using AI has been really perfect for me. I can build stuff while I do other things, walk the dog, make lunch, sit on the porch.

this resonates with me strongly, while i like coding, and understanding it, I understand my human limitations. I couldn't possibly write by hand the stuff I've been making, in the time I am making it, without a team these past few months. I would be coding literally all day, which while I sometimes enjoy the zoning out process of wiring stuff up, what i really enjoy is exactly what you described.

I enjoy being outside and walking my dog, taking a long shower, and cooking. All of these things are simple tasks with a good bit of repetition, and unlike wiring up some code or whatever, they allow my thoughts to flow, and I can think about where my projects are likely heading and what needs to be done next.

Those moments, even before heavy AI assisted coding, have always been the moments i cherish about software development.

tarr11 · 2026-03-06T15:43:32 1772811812

It says this in bold red at the top - "This is a preprint; it has not been peer reviewed by a journal."

I am not a climate scientist - how should I think about this statement? Normally I am looking for some statement that shows a document has been vetted.

epistasis · 2026-03-06T19:39:29 1772825969

For non-specialists, I think the most important view on papers is to not view them as nuggets of truth, but communications of a group of people who are trying to establish truth. No single paper is definitive!

Peer review is an important part of scientific publication, but it's also important for the general public to not view peer review as a full vetting. Peer reviewers look for things like reproducibility of the analysis, suitability of the conclusions given the methods, discussions of the limitations of the data and methods, appropriate statistical tests, correct approval from IRBs if there are humans or animals involved, and things like that. For many journals, the editors are also asking if the results are interesting and significant enough to meet the prestige of the journal.

Peer review misses things like intentional fraud, mistakes in computations, and of course any blind spots that the field has not yet acknowledged (for example, nearly every scientific specialty had to rediscover the important of splitting training and testing datasets for machine learning methods somewhat on their own, as new practitioners adopted new methods quickly and then some papers would slip through at the beginning when reviewers were not yet aware of the necessity of this split...)

Any single paper is not revealed truth, it's a step towards establishing truth, maybe. Science is supposed to be self-correcting, which also necessitates the mistakes that need correction. Climate science is one of the fields that gets the most attention and scrutiny, so a series of papers in that field goes a long ways towards establishing truth, much more so than, say, new MRI technology in psychology.

tgsovlerkhgsel · 2026-03-07T10:08:11 1772878091

Sometimes reviewers also look for whether the paper cites enough of their own papers, who is publishing it (regardless of whether the review is supposed to be anonymous or not), whether it clashes with a paper they're about to publish... science is just as full of politics and corruption (if not more) as any other field.

epistasis · 2026-03-08T05:01:57 1772946117

I almost added "place the research into the context of other relevant research" as another way of saying "cite enough of the peer reviewer's papers" but fair enough.

I'm not sure if science has as much corruption as other fields, but it definitely has politics. PIs get to their position without the typical selection process for leadership that happens in most larger orgs, so there's more fragile and explosive personalities than I find in other management/leadership positions.

jfengel · 2026-03-06T21:35:52 1772832952

I'd say that for a non-scientist, you should treat it as a non-event -- a paper that hasn't happened yet.

The climate is not something for which you need daily, weekly, or even monthly updates. Rather, this paper is just one more on top of a gigantic pile of evidence that that climate change is serious, something that we can and should do something about.

If the paper passes muster, you'll hear about it then, though all it'll do is very slightly increase your confidence in something that is already very well confirmed. Or, the paper may not pass review, in which case it doesn't mean anything at all, and you fall back on the existing mountain of evidence.

If the paper had reached the opposite conclusion, that might merit more investigation by you now, since that would potentially be a significant update to your beliefs. And more importantly, it would certainly be presented as if it were a fait accompli, even before peer review.

Instead, you can simply say, "I don't know what this paper means, but I already have a very well-founded understanding of climate change and its significance."

sleet_spotter · 2026-03-06T15:52:43 1772812363

Peer review is still very relevant in climate science. But given it is from well-respected authors, I am more inclined to trust the results at this stage.

Nevermark · 2026-03-07T05:37:14 1772861834

There is no need for "trust".

There is no benefit in non-expert readers inserting their own subjectivity into an already complex topic. Even for themselves.

What we know: It is an interesting paper. It is going to get attention.

Good to be aware. It is also good to reserve judgement while the community evaluates the results.

juujian · 2026-03-06T16:23:51 1772814231

It is already published at Geophysical Research Letters, a highly (if not the most) reputable source in the area. But that journal is behind a paywall: https://agupubs.onlinelibrary.wiley.com/doi/full/10.1029/202...

FabHK · 2026-03-07T01:55:43 1772848543

Oh, that contains an ELI5:

> Plain Language Summary The rise in global temperature has been widely considered to be quite steady for several decades since the 1970s. Recently, however, scientists have started to debate whether global warming has accelerated since then. It is difficult to be sure of that because of natural fluctuations in the warming rate, and so far no statistical significance (meaning 95% certainty) of an acceleration (increase in warming rate) has been demonstrated. In this study we subtract the estimated influence of El Niño events, volcanic eruptions and solar variations from the data, which makes the global temperature curve less variable, and it then shows a statistically significant acceleration of global warming since about the year 2015. Warming proceeding faster is not unexpected by climate models, but it is a cause of concern and shows how insufficient the efforts to slow and eventually stop global warming under the Paris Climate Accord have so far been.

jaredklewis · 2026-03-07T07:23:18 1772868198

A paper being peer reviewed is a good sign, but I feel like the signal is usually over interpreted.

Peer reviewed does not mean the findings of the paper are established fact or scientific consensus. It does not mean that the findings have been replicated by other scientists. It does not mean that the paper relied on a robust methodology, is free of basic statistical errors, or even free of logical fallacies.

Some of these limitations are due to the limitations of peer review itself. Others are just side effects of the way science works (for example, some ideas start as small, unimpressive experiments that are reported on in papers, and the strength of the findings is gradually developed over time). Obviously sometimes the prestige (or lack thereof) of the journal the paper is in decreases (or increases) some of these issues.

Anyway, peer review is a very noisy channel (IMHO).

tialaramex · 2026-03-06T21:27:20 1772832440

For one thing, some of the places which would publish this kind of thing will authorize authors to provide anybody and everybody pre-prints but not the final copy they published.

In principle you could go (pay to†) read the actual final published copy, maybe it's different, but almost always it's basically the same, the text is enough to qualify.

If you go to https://eel.is/c++draft/ you'll find the "Draft" C++ standard, and it has this text:

Note: this is an early draft. It's known to be incomplet and incorrekt, and it has lots of bad formatting.

Nevertheless, the people who wrote your C++ compiler used that "draft" document, because it isn't reasonable to wait a few years for ISO to publish the "real" document which is identical other than lacking that scary text and having a bunch of verbiage about how ISO owns this document and it mustn't be republished.

And you might be thinking "OK, I'm sure those GNU hippies don't pay for a real published copy, but surely the Microsoft Corporation buys their engineers a real one". Nope. Waste of money.

† If you have a relationship with a research institution it might have this or be willing to help you order it from somewhere else at no personal cost.

bjourne · 2026-03-07T04:11:47 1772856707

Pre-prints exists because it can take up to 18 months to get a paper published in a journal or reputable conference. Since lots of people can publish pre-prints[1] what you should think depends on the authors. If they have a record of publishing good research you should think highly of the paper.

[1] - Actually, there are hoops on pre-print repositories, such as arXiv, so not everyone can post there. I guesstimate that 99% of the public has no means of posting on arXiv.

mold_aid · 2026-03-07T01:11:11 1772845871

Are you? How many preprints are posted here every day?

tarr11 · 2026-02-19T18:16:41 1771525001

What do you think this particular prompt is evaluating for?

The more popular these particular evals are, the more likely the model will be trained for them.

Gander5739 · 2026-02-19T18:23:16 1771525396

Sea https://simonwillison.net/2025/Nov/13/training-for-pelicans-...

tarr11 · 2025-12-01T17:05:01 1764608701

This has convinced many non-programmers that they can program, but the results are consistently disastrous, because it still requires genuine expertise to spot the hallucinations.

I've been programming for 30+ years and now a people manager. Claude Code has enabled me to code again and I'm several times more productive than I ever was as an IC in the 2000s and 2010s. I suspect this person hasn't really tried the most recent generation, it is quite impressive and works very well if you do know what you are doing

stingraycharles · 2025-12-01T17:10:45 1764609045

If you’ve been programming for 30+ years, you definitely don’t fall under the category of “non-programmers”.

You have decades upon decades of experience on how to approach software development and solve problems. You know the right questions to ask.

The actual non-programmers I see on Reddit are having discussions about topics such as “I don’t believe that technical debt is a real thing” and “how can I go back in time if Claude Code destroyed my code”.

buildbot · 2025-12-01T18:05:41 1764612341

People learning to code always have had those questions and issues though. For example, “git ate my code’ or “I don’t believe in python using white space as a bracket so I’m going to end all my blocks with #endif”

stingraycharles · 2025-12-02T01:14:02 1764638042

Yes, I’m not saying this is new, I’m giving examples of what struggles non-programmers have.

agubelu · 2025-12-01T17:07:17 1764608837

Isn't that what the author means?

"it still requires genuine expertise to spot the hallucinations"

"works very well if you do know what you are doing"

pzo · 2025-12-01T17:18:34 1764609514

The author headline starts with "LLMs are a failure", hard to take author seriously with such a hyperbole even if second part of headline ("A new AI winter is coming") might be right.

hombre_fatal · 2025-12-01T17:16:41 1764609401

But it can work well even if you don't know what you are doing (or don't look at the impl).

For example, build a TUI or GUI with Claude Code while only giving it feedback on the UX/QA side. I've done it many times despite 20 years of software experience. -- Some stuff just doesn't justify me spending my time credentializing in the impl.

Hallucinations that lead to code that doesn't work just get fixed. Most code I write isn't like "now write an accurate technical essay about hamsters" where hallucinations can sneak through lest I scrutinize it; rather the code would just fail to work and trigger the LLM's feedback loop to fix it when it tries to run/lint/compile/typecheck it.

But the idea that you can only build with LLMs if you have a software engineer copilot isn't true and inches further away from true every month, so it kinda sounds like a convenient lie we tell ourselves as engineers (and understandably so: it's scary).

int_19h · 2025-12-01T21:00:34 1764622834

> Hallucinations that lead to code that doesn't work just get fixed

How about hallucinations that lead to code that doesn't work outside of the specific conditions that happen to be true in your dev environment? Or, even more subtly, hallucinations that lead to code which works but has critical security vulnerabilities?

hombre_fatal · 2025-12-01T23:36:16 1764632176

Replace "hallucination" with "oversight" or "ignorance" and you have the same issue when a human writes the code.

A lot of that will come to the prompter's own foresight much like the vigilance of a beginner developer where they know they are working on a part of the system that is particularly sensitive to get right.

That said, only a subset of software needs an authentication solution or has zero tolerance to some codepath having a bug. Those don't apply to almost all of the apps/TUIs/GUIs I've built over the last few months.

If you have to restrict the domain to those cases for LLMs to be "disastrous", then I'll grant that for this convo.

What about everything else?

lelanthran · 2025-12-02T14:50:36 1764687036

> A lot of that will come to the prompter's own foresight

And, on the current trend, how on earth are prompters supposed to develop this foresight, this expertise, this knowledge?

Sure, fine, we have them now, in the form of experienced devs, but these people will eventually be lost via attrition, last even faster if companies actually do make good on their threat to replace a team of 10 devs with a team of three prompters (former senior devs).

The short-sightedness of this, the ironic lack of foresight, is troubling. You're talking about shutting off the pipeline that will produce these future prompters.

The only way through, I think, will be if (very big if) the LLMs get so much better at coding (not code-gen) that you won't need a skilled prompter.

Good luck with that.

hombre_fatal · 2025-12-03T02:27:33 1764728853

> how on earth are prompters supposed to develop this foresight, this expertise, this knowledge?

I suppose curiosity. The same way anyone develops expertise in the abstractions below after getting excited about the higher layer.

fzeroracer · 2025-12-02T07:45:16 1764661516

Have you checked your package imports lately?

seaucre · 2025-12-01T17:30:03 1764610203

I have a journalist friend with 0 coding experience who has used ChatGPT to help them build tools to scrape data for their work. They run the code, report the errors, repeat, until something usable results. An agent would do an even better job. Current LLMs are pretty good at spotting their own hallucinations if they're given the ability to execute code.

The author seems to have a bias. The truth is that we _do not know_ what is going to happen. It's still too early to judge the economic impact of current technology - companies need time to understand how to use this technology. And, research is still making progress. Scaling of the current paradigms (e.g. reasoning RL) could make the technology more useful/reliable. The enormous amount of investment could yield further breakthroughs. Or.. not! Given the uncertainty, one should be both appropriately invested and diversified.

chomp · 2025-12-01T17:09:44 1764608984

For toy and low effort coding it works fantastic. I can smash out changes and PRs fantastically quick, and they’re mostly correct. However, certain problem domains and tough problems cause it to spin its wheels worse than a junior programmer. Especially if some of the back and forth troubleshooting goes longer than one context compaction. Then it can forget the context of what it’s tried in the past, and goes back to square one (it may know that it tried something, but it won’t know the exact details).

asah · 2025-12-01T17:25:56 1764609956

That was true six months ago - the latest versions are much better at memory and adherence, and my senior engineer friends are adopting LLMs quickly for all sorts of advanced development.

lm28469 · 2025-12-01T18:35:29 1764614129

Last week I gave antigravity a try, with the latest models and all, it generated subpar code that did the job very quickly for sure, but no one would have ever accepted this code in a PR, it took me 10x more time to clean it up than to have gemini shit it out.

The only thing I learned is that 90% of devs are code monkeys with very low expectations which basically amount to "it compiles and seems to work then it's good enough for me"

weare138 · 2025-12-01T17:18:56 1764609536

..and works very well if you do know what you are doing

That's the issue. AI coding agents are only as good as the dev behind the prompt. It works for you because you have an actual background in software engineering of which coding is just one part of the process. AI coding agents can't save the inexperienced from themselves. It just helps amateurs shoot themselves in the foot faster while convincing them they're a marksman.

Lionga · 2025-12-01T17:09:40 1764608980

It seems to work well if you DONT really know what you are doing. Because you can not spot the issues.

If you know what you are doing it works kind of mid. You see how anything more then a prototype will create lots of issues in the long run.

Dunning-Kruger effect in action.

tarr11 · 2025-11-22T16:35:42 1763829342

The cars are usually stolen

tarr11 · 2025-11-07T15:51:04 1762530664

I’ve discovered a lot of research via two minute papers on YT. Entertaining and easy to understand.

https://youtube.com/@twominutepapers?si=hyvCvW4UwS0QBbrZ

tarr11 · 2025-10-18T17:59:43 1760810383

The drumming technique works for me for a few minutes if I need some temporary relief

https://treblehealth.com/tapping-technique-for-temporary-tin...

tarr11 · 2025-10-05T17:54:30 1759686870

Chicago had lower annualized endowment returns than similar universities, and so it couldn't support it's aggressive expansion.

https://www.ft.com/content/4501240f-58b7-4433-9a3f-77eff18d0...

UChicago’s strains came after its $10bn endowment — a critical source of revenue — delivered an annualised return of 6.7 per cent over the 10 years to 2024, among the weakest performances of any major US university.

The private university has taken a more conservative investment approach than many peers, with greater exposure to fixed income and less to equities since the global financial crisis in 2008.

“If you look at our audits and rating reports, they’ve consistently noted that we had somewhat less market exposure than our peers,” said Ivan Samstein, UChicago’s chief financial officer. “That led to less aggregate returns over a period of time.”

An aggressive borrowing spree to expand its research capacity also weighed on the university’s financial health. UChicago’s outstanding debt, measured by notes and bonds payable, climbed by about two-thirds in the decade ending 2024, to $6.1bn, as it poured resources into new fields such as molecular engineering and quantum science.

DiscourseFan · 2025-10-05T19:25:54 1759692354

A combination of bad bets and mismanagement. Ah! Well I have a friend who is currently going their for law school, so I shouldn't be celebrating this, it harms them and their career prospects.

tarr11 · 2025-08-17T15:38:50 1755445130

One of my hobbies is Houdini which is like Blender. While I agree with you that you can build a nice parameterised model in a few days - if you want to make an entire scene or a short film, you will need hundreds if not thousands of models, all textured and topolgized and many of them rigged, animated or even have simulations.

What this means is that making even a 2 minute short animation is out of reach for a solo artist. Your only option today is to go buy an asset pack and do your best. But then of course your art will look like the asset pack.

AI Tools like this reduce one of the 20+ stages down to something reachable by someone working solo.

thwarted · 2025-08-17T17:06:38 1755450398

> What this means is that making even a 2 minute short animation is out of reach for a solo artist.

Is it truly the duration of the result that consumes effort and the number of people required? What is the threshold for a solo artist? Is it expected that a 2 minute short takes half as much effort/people as a 4 minute short? Does the effort/people scale linearly, geometrically, or exponentially with the duration? Does a 2 minute short of a two entity dialog take the same as a 4 minute short of a monologue?

> Your only option today is to go buy an asset pack and do your best. But then of course your art will look like the asset pack.

What's more valuable? That you can create a 2 minute short solo or that all the assets don't look like they came from an asset pack? The examples shown in TFA look like they were procedurally generated, and customizations beyond the simple "add more vertexes" are going to take time to get a truly unique style.

> AI Tools like this reduce one of the 20+ stages down to something reachable by someone working solo.

To what end? Who's the audience for the 2 minute short by a solo developer? Is it meant to show friends? Post to social media as a meme? Add to a portfolio to get a job? Does something created by skipping a large portion of the 20+ steps truly demonstrate the person's ability, skill, or experience?

latexr · 2025-08-17T16:19:34 1755447574

> Your only option today is to go buy an asset pack and do your best.

There is a real possibility the assets generated by these tools will look equally or even more generic, the same generated images today are full of tells.

> What this means is that making even a 2 minute short animation is out of reach for a solo artist.

Flatland was animated and edited by a single person. In 2007. It’s a good movie. Granted, the characters are geometric shapes, but still it’s a 90 minute 3D movie.

https://en.wikipedia.org/wiki/Flatland_(2007_Ehlinger_film)

Puparia is a gorgeous 2D animated film done by a single person in 2020.

https://en.wikipedia.org/wiki/Puparia

These are exceptional cases (by definition, as there aren’t that many of them), but do not underestimate solo artists and the power of passion and resilience.

oblio · 2025-08-17T16:38:52 1755448732

Puparia is a 3 minute short film that took a veteran artist 3 years to make. I think you're making OP's point.

Ey7NFZ3P0nzAe · 2025-08-17T16:41:15 1755448875

There are always exceptions. I think the parent is refering to the many solo artists that would almost be able to make such great movies if not for some of the time constraints or life event etc. I'm sure there are countless solo artists that made 75% of a great movie then lacked time for unforeseeable reasons. Making the creation a bit easier allows much more solo artists to create!

tarr11 · 2025-06-25T00:53:17 1750812797

40% of US households pay no federal income tax.

https://taxpolicycenter.org/taxvox/tpc-number-those-who-dont...