More

Marha01 · 2026-04-03T09:23:50 1775208230

> Seems like a big issue is I'm guessing insistence on having this be a solo operation for cultural reasons.

Hmm... perhaps train a robot arm to do it?

Marha01 · 2026-03-30T15:13:29 1774883609

Demand for top models is definitely not saturated, at least when it comes to programming. If I could afford to use 5x more Claude Opus 4.6 tokens, I would!

hajile · 2026-03-30T16:13:53 1774887233

Demand is relative. How many Claude tokens would you buy if they had a 10x price hike?

The market has achieved it's current saturation level with loss-leader prices that remind me of the Chinese bike share bubble[0]. Once those prices go up to break even levels (let alone profitable levels), the number of people who can afford to pay will go down dramatically (and that's not even accounting for the bubble pop further constricting people's finances).

[0] https://www.youtube.com/watch?v=FQrEDq8KPiU

HDThoreaun · 2026-03-30T19:15:17 1774898117

There is no evidence that labs are losing money on inference subscriptions. The labs have massive fixed costs, but as long as inference spend is higher than the datacenters they use for inference cost all they need to do to become profitable is scale up. Right now software engineers are basically the only ones actually paying for inference, the labs just need to create coding assistants for everything that are good enough that every white collar worker in the country(world?) is paying a $1000/yr subscription. Certainly theres a lot of risk, will models become commoditized and everyone switches to open models? can they actually get non software engineers to pay for inference in mass? But its not like theres no path

pigpop · 2026-03-30T17:15:43 1774890943

If they've already built themselves a loyal customer base (which is usually the point of fighting a price war) and the customers are happy with the technology they have, then if funding is tight and turning a profit is more important why wouldn't they pivot to optimizing inference by stopping further training, freezing the model versions, burning the weights into silicon and building better caching strategies and improving harnesses and tools that lower their cost and increase their margin?

If all they do is hike prices then they'll lose customers to competitors who don't or who find a way to serve a similar model cheaper.

The demand isn't going to go away purely through higher prices. Once people know something is possible they will demand it whether supply is constrained or not. That's a huge bounty for anyone who can figure out how to service that demand.

philistine · 2026-03-30T19:05:12 1774897512

Easier said than done. What you're describing can take years to implement. Can OpenAI et al. keep burning cash at the same rate for two years while they wait for the salvation of custom silicon if the investments dry up?

pigpop · 2026-04-02T13:38:55 1775137135

You mean like their deal with Broadcom to put their own custom silicon into production this year?

eru · 2026-03-31T06:53:51 1774940031

They could stop further training right this very second.

Marha01 · 2026-03-26T05:55:38 1774504538

Don't you see the massive problem with requiring visual input? Are blind people not intelligent because they cannot solve ARC-AGI-3 without a "harness"?

A theoretical text-only superintelligent LLM could prove the Riemann hypothesis but fail ARC-AGI-3 and won't even be AGI according to this benchmark...

cubefox · 2026-03-26T06:09:46 1774505386

Well, it would be AGI if you could connect a camera to it to solve it, similar to how blind people would be able to solve it if you restored their eyesight. But if the lack of vision is a fundamental limitation of their architecture, then it seems more fair not to call them AGI.

Marha01 · 2026-03-26T07:13:14 1774509194

People blind from birth literally lack the neural circuits to comprehend visual data. Are they not intelligent?

maldev · 2026-03-26T15:22:29 1774538549

I think I can confidently say they are not visually intelligent at all.

If you were phrasing things to quantify intelligence, you would have a visual intelligence pillar. And they would not pass that pillar. It doesn't make them dysfunctional or stupid, but visual intelligence is a key part of human intelligence.

notnullorvoid · 2026-03-26T18:46:11 1774550771

Visual intelligence is a near meaningless term as it's almost entirely dependant on spatial intelligence. The visually impaired do have high spatial intelligence, I wouldn't be surprised if their spatial intelligence is actually higher on average than those without visual impairment.

cubefox · 2026-03-26T07:25:22 1774509922

I think they don't actually lack them, or lack only a small fraction (their brains are ≈99% like a normal human brain), such that if they were an AI model, they could be fairly trivially upgraded with vision capability.

notnullorvoid · 2026-03-26T18:52:20 1774551140

Think of it as spatial input, not visual. Blind people do have spatial inputs, and high spatial intelligence.

Marha01 · 2026-03-26T05:49:07 1774504147

Does this mean blind people are not intelligent?

degamad · 2026-03-26T07:55:58 1774511758

Blind people do function within the context of a human-centric world, though, so they would qualify as intelligent.

Marha01 · 2026-03-26T15:31:36 1774539096

Yes, but they use various "harnesses" to do so (dog guides, text to speech software, assistance of other humans when needed..). Why can't AI?

Rastonbury · 2026-03-26T22:44:50 1774565090

Assistance of other humans? You do realise we're talking about an intelligence test right, at that point what are you even testing for. I'm sure you've taken exams where you couldn't bring your own notes, use Google or get help from someone, even though real life doesn't have those constraints

Marha01 · 2026-03-24T18:28:42 1774376922

Well said. That's exactly what has been rubbing me the wrong way with all those "LLMs can never *really* think, ya know" people. Once we pass some level of AI capability (which we perhaps already did?), it essentially turns into an unfalsifiable statement of faith.

Marha01 · 2026-03-17T16:15:09 1773764109

From your source:

> Ultra-processed foods: Ultra-processed foods typically have more than 1 ingredient that you never or rarely find in a kitchen. They also tend to include many additives and ingredients that are not typically used in home cooking, such as preservatives, emulsifiers, sweeteners, and artificial colours and flavours. These foods generally have a long shelf life.

Are there ingredients actually in the Beyond burger?

TeMPOraL · 2026-03-17T16:40:44 1773765644

Also:

> many additives and ingredients that are not typically used in home cooking, such as preservatives, emulsifiers

Since when? Salt is a highly effective preservative, egg yolk is a powerful emulsifier, and they're largely used for those exact purposes.

The amount of bullshit in "healthy eating" and fitness fad space never ceases to amaze me.

oblio · 2026-03-17T18:49:44 1773773384

Is that comment trying to be intentionally dense? They're talking about E123 & co, synthetic ones.

TeMPOraL · 2026-03-18T15:32:57 1773847977

"E123 & co" are descriptors covering both "organic" and "synthetic" substances, because their role is to add precision and clarity to an engineering process, not entertain the pseudoscientific naturalistic bullshit masses buy into (which by itself is just a way for another industry to make money - or do you think people come up with those fitness/healthy eating fads all on their own?).

Marha01 · 2026-03-05T16:46:14 1772729174

It is magical thinking to claim that LLMs are definitely physically incapable of thinking. You don't know that. No one knows that, since such large neural networks are opaque blackboxes that resist interpretation and we don't really know how they function internally.

You are just repeating that because you read that before somewhere else. Like a stochastic parrot. Quite ironic. ;)

tovej · 2026-03-05T20:20:41 1772742041

They really aren't that mysterious. We can confidently say that they function at the lexical level, using Monte Carlo principles to carve out a likely path in lexical space. The output depends on the distribution of n-grams in the training set, and the composition of the text in it's context window.

This process cannot produce reasoning.

1) an LLM cannot represent the truth value of statements, only their likelihood of being found in its training data.

2) because it uses lexical data, an LLM will answer differently based on the names / terms used in a prompt.

Both of these facts contradict the idea that the LLM is reasoning, or "thinking".

This isn't really a very hit take either, I don't think I've talked to a single researcher who thinks that LLMs are thinking.

Marha01 · 2026-02-06T21:46:08 1770414368

I don't find it interesting in an artistic way, but I do find it very interesting from an "AI experiment" angle.

altmanaltman · 2026-02-06T21:59:12 1770415152

I don't get what the "AI experiment" angle here is? The fact that AI can write python code that makes sounds? And if the end product isn't interesting or artistically worthwhile, what is the point?

jablongo · 2026-02-07T01:13:38 1770426818

I have a deep background in music and I think that while the creation was super basic, the way the output was so unconstrained (written by a model fine-tuned for coding), is really interesting. Listen to that last one and tell me it couldn't belong on some tv show. I've had always issues with any ai generated music because of the constraints and the way the output is so derivative. This was different to me.

TheOtherHobbes · 2026-02-06T22:34:26 1770417266

What's the point if human-made art isn't interesting or artistically worthwhile?

(Most of it isn't.)

Art is on a sliding scale from "Fun study and experiment for the sake of it" to "Expresses something personal" to "Expresses something collective" to "A cultural landmark that invents a completely new expressive language, emotionally and technically."

All of those options are creatively worthwhile. Or maybe none of them are.

Take your pick.

altmanaltman · 2026-02-06T22:52:22 1770418342

> What's the point if human-made art isn't interesting or artistically worthwhile?

Because it is a human making it, expressing something is always worthwhile to the individual on a personal level. Even if its not "artisticallly worthwhile", the process is rewarding to the participant at the very least. Which is why a lot of people just find enjoyment in creating art even if its not commercially succesful.

But in this case, the criteria changes for the final product (the music being produced). It is not artistically worthwhile to anyone, not even the creator.

So no, a person with no talent (self claim) using an LLM to create art is much less worthwhile than a human being with no/any talent creating art on their own at all times by default.

kevin42 · 2026-02-07T01:34:10 1770428050

>Even if its not "artisticallly worthwhile", the process is rewarding to the participant at the very least

I think that's the point though. What op did was rewarding to themselves, and I found it more enjoyable than a lot of music I've heard that was made by humans. So don't be a gatekeeper on enjoyment.

altmanaltman · 2026-02-07T07:56:06 1770450966

How am I a gatekeeper? I provided my own opinions; you are free to enjoy what you want or disagree with me. If you want to get into an objective discussion of why you find it enjoyable more than human works or what is art, we can do that but I do not like the personal slights.

pizza · 2026-02-07T03:46:18 1770435978

I think you're mistaking the .wav as the final product, whereas instead it's really the .html blog post and this discussion.

altmanaltman · 2026-02-07T07:54:26 1770450866

I was discussing it on the basis of music with the commentator and the actual product. Sure if you want to go all Andy Kaufman then yeah the .html and this discussion is art but I wasn't talking about it in the original context of the conversation.

tadfisher · 2026-02-07T00:00:05 1770422405

At least it wrote a song, instead of stably-diffusing static into entire tracks from its training data. I can take those uninteresting notes, plug them into a DAW and build something worthwhile. I can only do this with Suno-generated stems after much faffing about with transposing and fixing rhythms, because Suno doesn't know how to write music, it just creates waveforms.

AI tools are decent at helping with code because they're editing language in a context. AI tools are terrible at helping with art because they are operating on the entirely wrong abstraction layer (in this case, waveforms) instead of the languages humans use to create art, and it's just supremely difficult to add to the context without destroying it.

smallerize · 2026-02-07T01:59:32 1770429572

I just want to know what's in there. It doesn't need to be artistic at all. They put terabytes of data into the training process and I want to know what came through.

Marha01 · 2026-02-06T21:42:52 1770414172

Very interesting experiment! I tried something related half a year ago (LLMs writing midi files, musical notation or guitar tabs), but directly creating audio with Python and sine waves is a pretty original approach.

Marha01 · 2026-02-05T20:09:11 1770322151

Even with 1 TB of weights (probable size of the largest state of the art models), the network is far too small to contain any significant part of the internet as compressed data, unless you really stretch the definition of data compression.

jesse__ · 2026-02-05T20:49:34 1770324574

This sounds very wrong to me.

Take the C4 training dataset for example. The uncompressed, uncleaned, size of the dataset is ~6TB, and contains an exhaustive English language scrape of the public internet from 2019. The cleaned (still uncompressed) dataset is significantly less than 1TB.

I could go on, but, I think it's already pretty obvious that 1TB is more than enough storage to represent a significant portion of the internet.

FeepingCreature · 2026-02-05T21:52:41 1770328361

This would imply that the English internet is not much bigger than 20x the English Wikipedia.

That seems implausible.

jesse__ · 2026-02-05T22:54:39 1770332079

> That seems implausible.

Why, exactly?

Refuting facts with "I doubt it, bro" isn't exactly a productive contribution to the conversation..

onraglanroad · 2026-02-06T17:51:49 1770400309

Because we can count? How could you possibly think that Wikipedia was 5% of the whole Internet? It's just such a bizarrely foolish idea.

kgeist · 2026-02-05T21:46:34 1770327994

A lot of the internet is duplicate data, low quality content, SEO spam etc. I wouldn't be surprised if 1 TB is a significant portion of the high-quality, information-dense part of the internet.

FeepingCreature · 2026-02-05T21:53:02 1770328382

I would be extremely surprised if it was that small.

artisin · 2026-02-06T14:52:55 1770389575

I was curious about the scale of 1TiB of text. According to WolframAlpha, it's roughly 1.1 trillion characters, which breaks down to 180.2 billion words, 360.5 million pages, or 16.2 billion lines. In terms of professional typing speed, that's about 3800 years of continuous work.

So post-deduplication, I think it's a fair assessment that a significant portion of high-quality text could fit within 1TiB. Tho 'high-quality' is a pretty squishy and subjective term.

FeepingCreature · 2026-02-09T09:14:48 1770628488

Yes, a million books is a reasonably big library.

But I would be surprised if the internet only filled a reasonably big library.

kaibee · 2026-02-06T04:31:02 1770352262

Well, a terabyte of text is... quite a lot of text.

gmueckl · 2026-02-05T21:34:32 1770327272

This is obviously wrong. There is a bunch of knowledge embedded in those weights, and some of it can be recalled verbatim. So, by virtue of this recall alone, training is a form of lossy data compression.