More

nyeah · 2026-04-08T16:11:09 1775664669

"Lies are all we have."

If so, how do we distinguish between code that works and code that doesn't work? Why should we even care?

ajross · 2026-04-08T16:49:42 1775666982

> If so, how do we distinguish between code that works and code that doesn't work?

Hilariously, not by using our brains, that's for sure. You have to have an external machine. We all understand that "testing" and "code review" are different processes, and that's why.

nyeah · 2026-04-08T17:11:57 1775668317

Good point. We choose certain tests to perform. We choose certain test results to pay attention to. We don't just keep chatting about (reviewing) the code. We do something else.

If lies are all we have, then how is this behavior possible?

ajross · 2026-04-08T17:20:47 1775668847

LLMs can write and run tests though.

You're cherry picking my little bit of wordsmithing. Obviously we aren't always wrong. I'm saying that our thought processes stem from hallucinatory connections and are routinely wrong on first cut, just like those of an LLM.

Actually I'm going farther than that and saying that the first cut token stream out of an AI is significantly more reliable than our personal thoughts. Certainly than mine, and I like to think I'm pretty good at this stuff.

nyeah · 2026-04-08T18:57:40 1775674660

I don't think the complaint about cherry picking is quite fair. Most of your original comment consists of claims that we're bullshit machines, our internal dialog is almost 100% fantasy, we're hallucinating, etc. Those claims may be true. But I'm not carefully like curating them out of nowhere.

nyeah · 2026-04-06T16:55:10 1775494510

If a known-broken calculator claims it's broken, I more or less concur. (Chain of reasoning omitted here.)

nyeah · 2026-04-06T16:52:10 1775494330

It's disconcerting. But in 2026 it's not very surprising.

nyeah · 2026-04-06T16:26:40 1775492800

But, Doctor, the data does back that up. The US middle class is shrinking, and most of the shrinkage is on the low end. There's no mystery about this, only potential for distractions.

nyeah · 2026-04-01T14:16:28 1775052988

Rage farming with no scientific interest. Sad to see this upvoted to front page.

giantg2 · 2026-04-01T14:41:23 1775054483

There is an interesting question - how can we prove paternity or other DNA based questions with identical twins (full sequencing looking for mutations?) and if we can't, how do we handle legal responsibilities in this sort of case?

There's a lot of good material to discuss here.

mieses · 2026-04-02T07:43:45 1775115825

no there isn't but i appreciate your amusing stupidity. this is a good example of the state of exception that most people with common sense intuitively understand.

nyeah · 2026-03-30T13:29:19 1774877359

Can you give a few penciled numbers?

paulddraper · 2026-03-30T13:53:41 1774878821

You can rent a H100 GPU for $4/hour. [1]

300k tokens for that hour.

OpenAI charges $6.

Those are pessimistic assumptions.

[1] https://lambda.ai/instances

hajile · 2026-03-30T16:22:39 1774887759

Can you keep that GPU 100% saturated at least 16 hours per day every day of the week?

If not, you aren't breaking even.

paulddraper · 2026-03-30T18:50:34 1774896634

Note this is also assuming you

(1) Rent your GPUs.

(2) Pay list price, no volume breaks.

(3) Get only 85 tokens/sec. Realistically, frontier models would attain 200+ tokens/second amortized.

Inference is extremely profitable at scale.

aurareturn · 2026-03-30T19:46:54 1774900014

Assuming 80GB H100 and you inference a model that is MoE and close to the size of the 80GB VRAM, you're going to see around 10k tokens/second fully batched and saturated. An example here might be Mixtral 8x7B.

You're generating about 36 million tokens/hour. Cost of Mixtral 8x7b on Open router is $0.54/M input tokens. $0.54/M output tokens.

You're looking at potentially $38.88/hour return on that H100 GPU. This is probably the best case scenario.

In reality, inference providers will use multiple GPUs together to run bigger, smarter models for a higher price.

drakythe · 2026-03-30T14:07:06 1774879626

3.99 at 8x instances, with a minimum 2 week commitment. Good luck getting 70% usage average during that time. Useful when you're running a training round and can properly gauge demand, not so great when you're offering an API.

infecto · 2026-03-30T14:14:33 1774880073

Is it not a good penciled number? It helps set the directional tone that at inference cost is being covered.

drakythe · 2026-03-30T14:29:14 1774880954

It says the numbers are theoretically possible. Requiring a 66% usage to break even when 100% usage will piss off customers by invoking a queue means it’s a balancing act.

“Technically correct. The best kind of correct”. So inference may technically be _capable_ of being profitable, but I have question’s about them being profitable in _practice_.

nyeah · 2026-03-26T16:51:14 1774543874

The article is professional.

nyeah · 2026-03-26T16:34:15 1774542855

Soooo many comments here cite the "overreacting" point and then go on to prove it.

nyeah · 2026-03-26T13:16:16 1774530976

Another pack of evil volunteers trying to give us free stuff. Vote with your wallet. Stop paying your $0 per month until these crooks feel the pain.

nyeah · 2026-03-23T12:45:34 1774269934

One important thing is whether the tutoring is making better students, or just gaming the test.

CalRobert · 2026-03-23T12:54:34 1774270474

And after graduation they can grind leetcode, and after that they can practice social cues to get in the management class. It's gamed tests all the way down.

nyeah · 2026-03-23T13:01:22 1774270882

For people who choose that career path. Still, somewhere somebody is doing some work.

CalRobert · 2026-03-23T13:32:45 1774272765

The uggos I guess

jimbokun · 2026-03-23T18:47:37 1774291657

Are those independent?

nyeah · 2026-03-27T14:54:27 1774623267

That's tricky. I think it depends on what kind of gaming and what kind of test.