Why Nvidia Keeps Winning

synergy20 · on July 7, 2023

It's not that Nvidia 'keeps winning', Nvidia failed mobile computing, failed embedded computing, had no market share in the server market or data centers, was left in the niche market of gaming as a companion to PC and was not that impressive all along.

then its parallel GPU got "lucky", first the bitcoin mining, then the AI. it probably did not expect and plan for this, to some extent, it got super lucky.

credit must be given to its CUDA ecosystem and the ability to better itself when chances knocked its door, it so far left all competitors in the dust, its showtime arrived, finally.

tverbeure · on July 7, 2023

Nvidia got lucky because every quarterly all-hands meeting Jensen repeated that he kept on investing in CUDA and adding more silicon to the GPUs than strictly needed because, one day, an application would come along that would make it all worth it.

TFA says that Nvidia started aggressively seeding CUDA and GPUs for research in the early 2010s. It was much earlier than that: it started pretty much immediately after CUDA was introduced late 2006. And every new generation there were hardware features added to make GPU programming and porting of applications less painful. The first Nvision conference, precursor of GTC, was in 2008. That’s how you make your own luck.

I’ll never forget when, sometime around 2012?, he answered the question: “aren’t you afraid of Intel?”

His answer: “Not at all. Intel should be afraid of us. We will be bigger than them.” There was not a trace of doubt.

ThrowawayR2 · on July 7, 2023

> "His answer: 'Not at all. Intel should be afraid of us. We will be bigger than them.' There was not a trace of doubt."

Given all the times that HN readers have derided grandiose executive pronouncements preceding flops, more people should recognize the above for what it is: not profundity but just puffery that happened to pan out. Not that skill and effort weren't involved in making it pan out but that any of a zillion things could have gone wrong to make that statement false and part of any manager's job is to project confidence and instill motivation despite knowing that.

barrkel · on July 7, 2023

I think he had a strategy - utilizing the massively parallel computation of GPUs for more general purpose compute as Moore's law tailed off - and he noticed that Intel couldn't see the lights of this in the rear view mirror.

Everybody's known that Moore's law was on its way out, for speed increases at least, since the mid 2000s - the seminal article was by Herb Sutter [1]. So hardware needed to get more parallel. But multicore is a distinctly different paradigm to CUDA, which is closer to SIMD but on a completely different order of magnitude. So Intel was never going to get to where the puck was skating.

[1] http://www.gotw.ca/publications/concurrency-ddj.htm

scns · on July 7, 2023

Such behaviour is off putting to many geeks, myself included. Still i'd disagree. He had a vision, followed it and made it a reality.

comfypotato · on July 7, 2023

That’s the point, though. This is no different than any other statement made by a CEO with good engineers behind them.

This time it worked out. Can’t give it a survivorship bias. I don’t personally mind CEOs being encouraging, but at least understand that they don’t really ever know.

m_mueller · on July 7, 2023

IMO one big factor is that Nvidia is still fully engineering driven - it's engineers all the way to the top making the calls. Intel was like that as well, and then lost it (until Gelsinger). IMO you need domain experts in charge of companies, or they can't thrive in the long run, not unless there is an actual, almost unsurpassable moat.

iancmceachern · on July 7, 2023

Exactly, and they invest in their talent pool. They dont over-hire and then lay off thousands.

mr_00ff00 · on July 7, 2023

Was gonna say, this easily could have been Steve Balmer saying the iPhone will fail because it doesn’t appeal to business customers.

Credit for going after a vision, but every CEO has a vision and acts like it will inevitably happen.

imchillyb · on July 7, 2023

Let us suppose that NVidia did not reach success with their mining, or with cuda.

The statement: "Not at all. Intel should be afraid of us. We will be bigger than them." is still true.

How is it true? Gaming rigs. No matter what processor people chose, people overwhelmingly choose Nvidia graphics cards.

If Intel was inside 50% of the market's rigs, Nvidia is in 70+% of those same rigs.

The statement wasn't puffery. The statement was made with naked, and overwhelming, confidence.

IshKebab · on July 7, 2023

Exactly, what CEO has not confidently said that they're great?

MegaDeKay · on July 7, 2023

Stephen Elop when he was CEO of Nokia said "we are standing on a burning platform".

dizhn · on July 7, 2023

Spez said they are not profitable right as they are trying to get their shit together for a favorable IPO.

o1y32 · on July 7, 2023

Nah, you did not answer the question.

bovinejoni · on July 7, 2023

Gerald Ratner

WeylandYutani · on July 7, 2023

It's called leadership. George Washington wasn't a brilliant general but he was able to convince people they were going to win against an empire. Whether he actually believed it himself we'll never know.

red-iron-pine · on July 7, 2023

fake it till you make it while people are dying. takes serious confidence.

deagle50 · on July 7, 2023

The gentry of that era was trained for this from a young age.

foo-bar-bat · on July 7, 2023

Sources please.

mrandish · on July 7, 2023

> Nvidia started aggressively seeding CUDA and GPUs for research in the early 2010s

I was at a niche graphics app startup circa 2000-2005 and even then NVidia invested enough to be helpful with info and new hardware, certainly better than other GPU companies. Post 2010 I was at a F500, industry leading tech company and an NVidia Biz Dev person came to meet with us every quarter usually bearing info and sometimes access to new hardware.

It's also worth noting that NVidia has consistently invested more than their peers in their graphics drivers. While the results aren't always perfect, NVidia usually has the best drivers in their class.

kibibyte · on July 7, 2023

Oh interesting. I remember that Folding@Home way back then (ca 2009) was already testing protein folding on GPUs and it took advantage of CUDA. I never really thought much of it other than how cool it was that my mid-tier Nvidia graphics card could be used for something else other than games, but this explains how this ended up happening.

(Bit of a tangent, but that project was very influential in getting me interested in computer science because, wow, how cool is it that we can use GPUs to do insane parallel computing. So I guess, very very indirectly, Nvidia had a part in me being a software engineer today.)

josemanuel · on July 7, 2023

By 2007/2008 there was a trend in HPC research called GPGPU. This involved havky techniques to get the shaders to do the computations you wanted. CUDA started appearing in 2008 with a framework (compiler, debugger) to do GPGPU in a proper way. It got the monopoly. They’ve been benefiting from first movers advantage ever since.

tverbeure · on July 7, 2023

GPGPU was a thing well before that! In 2004, it was ready covered in a few chapters of GPU Gems 1, increasing to 18 chapters of the 2005 GPU Gems 2, which including an FFT implementation.

https://developer.nvidia.com/gpugems/gpugems2/part-iv-genera...

I don’t remember it getting much more traction after that: as soon as CUDA was released, end of 2006, it was game over.

sahaj · on July 7, 2023

Excellent point! Jensen is very focused and the company has worked incredibly hard on whatever they've put out there. The Shield is a testament of this focus. They find a budding niche and double down on building it from nothing. Most "self-driving" cars have Nvidia gear for a reason.

elefanten · on July 7, 2023

Can you elaborate about the Shield? Seems like a perpetual flop.

sahaj · on July 7, 2023

It's the best Android TV/Gaming device around, even by todays standards. It's stable and does what it was designed to do. With zero marketing from Google or Nvidia, the mass market obviously didn't care for this type of product and category but the device itself is great and works flawlessly. The Nvidia Game app also bundled and pushed the Nvidia game streaming concept around GeForce Now. Overall, Nvidia gave put their best foot forward with this device, providing standout support for both HW and SW.

https://www.nvidia.com/en-us/shield/support/shield-tv/introd... https://www.nvidia.com/en-us/geforce-now/

meibo · on July 7, 2023

The chip line they made for it powers the most popular console on the market and basically locked its manufacturer into Nvidia chips until they're willing to drop compatibility, so financially it probably worked out for them, even if the Shield line itself wasn't extremely financially successful.

mrguyorama · on July 7, 2023

But nVidia was building Tegra stuff well before the switch was even someone's wet dream.

benjaminwootton · on July 7, 2023

I don’t know about it’s market success, but it’s a great product. We use it for as the frontend for all of the streaming platforms and PLEX as well as running some stuff directly from a NAS and IPTV.

I use them everywhere and have a big pile of chromecasts, satellite boxes, remote controls and Apple tvs now ready for eBay!

88j88 · on July 8, 2023

Some years before CUDA there was a lot of hype when the first GPGPU papers published in 2003 which showed significantly increasing performance using parallel computation from consumer graphics cards. At the time, it looked like competing on general purpose computation was a solid strategy: multi-core CPU from intel was still years away, showing up in 2005; starting from 2000 the rate of increase of clock speeds started slumping. We saw Intel started releasing more variants of processors, but the clock speeds weren't advancing exponentially anymore. The new battle for core supremacy was on the horizon.

pjmlp · on July 8, 2023

I have papers collecting digital dust already doing compute with GPGPU assembly language.

We already knew some of the possibilities when looking at Renderman, or early GPGPU attempts like the TMS34010.

flamedoge · on July 7, 2023

that was incredible gamble that paid off.

dotnet00 · on July 7, 2023

It's a bit unfair to ascribe it to sheer luck. They were focused on the compute related possibilities all the way back with the GeForce 3 in 2001. Presentations from its launch were already talking about the potential of a "parallel compute monster" [1].

They saw the potential of GPU compute very early on, invested in it long term and as a result eventually ended up dominating the market. The others didn't seriously commit and so they fell behind. AMD still can't seem to commit, while Intel seems to be working hard on catching up but isn't quite there yet.

[1] https://developer.download.nvidia.com/assets/gamedev/docs/GF...

kijiki · on July 7, 2023

The actual quote from your link is: "Expect a massively programmable, massively parallel and pipelined graphics monster", not a "compute monster".

While I agree that Nvidia positioned themselves well, they were not looking as far forward as you suggest. As I recall it, they seemed surprised by BrookGPU, though they moved quickly to embrace the model.

And there were earlier proto examples of GPU compute that that were completely overlooked: https://web.archive.org/web/20010607021839/http://freespace.... (scroll down to "Optimising with 3D Rendering Hardware")

brigadier132 · on July 7, 2023

> then its parallel GPU got "lucky", first the bitcoin mining, then the AI. it probably did not expect and plan for this, to some extent, it got super lucky.

I feel like saying they got "lucky" after trying and failing in multiple other endeavors requires a special definition of luck. If someone rolls a 6 sided die six times and they roll a six once did they get lucky?

hsbauauvhabzb · on July 7, 2023

Being good also has the facade of luck, every venture is a gamble none of which are guaranteed, but putting yourself in the most optimal positions (including diversifying) will result in some successes and some failures. You don’t end up with a $1t valuation based purely on luck.

nradov · on July 7, 2023

For corporations, just staying alive that long without a huge hit is very lucky. Look how many other chip companies failed or were acquired cheaply during that period.

Rapzid · on July 7, 2023

Well, you could roll the dice 1000 times and not get a 6.

semireg · on July 7, 2023

The problem with randomness is “Randomness does not look random” - from the book Fooled by Randomness

oh_sigh · on July 7, 2023

I bet you couldn't.

Rapzid · on July 7, 2023

I'm too lucky.

thou4o2i34u234 · on July 7, 2023

Given how AMD/ATI has fared with their shitty software ecosystem after close to 20 years (the first paper on using GPUs for CNN was in '05/'06 -ish where they were using shaders), calling Nvidia 'lucky' is quite unfair.

Those of us who were unlucky enough to buy a AMD GPU based on 'flop-count' were hurt quite badly. Nvidia's software compute-infra is simply unmatched.

mrguyorama · on July 7, 2023

Having a competitor that seemingly refuses to compete is pretty damn lucky. There's not a single thing nVidia could have legally done to make that happen.

dylan604 · on July 7, 2023

>then its parallel GPU got "lucky", first the bitcoin mining...credit must be given to its CUDA ecosystem

this seems like a bit of contradiction to me. they either got luck as you claim, or CUDA was good to keep things going. from my experience, CUDA was the only way to go for GPU accelerated processing. specifically, my experience was with Resolve color correction and using CUDA was the only way to go. memory is fuzzy, but the timeline on crypto was sametime-ish. because of the audience, this forum is naturally going to trend crypto vs niche video post workflows, but CUDA was definitely kicking Radeon/AMD ass in other aspects besides useless crypto.

wmf · on July 7, 2023

I remember when miners were complaining how unfair it is that Radeons mined faster than GeForce but that was basically prehistory at this point.

Gordonjcp · on July 7, 2023

Same. I wanted to buy a really chunky graphics card for BMD Fusion, but the price got pushed up by the cryptobro wankers mining their Dunning-Krugerrand, and now the price is getting pushed up by all the chatgpt wankers trying to run hardware-accelerated Eliza bots.

esjeon · on July 7, 2023

It's not pure luck tho. The GPGPU research started in early 2000s IIRC. Nvidia had invested into it earlier than anyone and more than anyone. That's how they got CUDA. Nvidia was ready for the next generation computing. It's just that no one, including Nvidia, knew when it's gonna hit the market.

pjmlp · on July 8, 2023

In consumer graphics cards yes, compute in graphics chips is as old as the games industry, see chips like TMS34010.

pjmlp · on July 7, 2023

I bet NVidia is quite happy with the Switch deal.

Secondly, the anti-CUDA alternatives all suck, as they keep heads down on their C first approach instead of being a polyglot ecosystem.

SPIR, HIP and whatever else, lack the tooling, had a very cold embrace of C++ features, and little else, and the drivers, oh boy.

imtringued · on July 8, 2023

The problem isn't C. The problem is that the buggy OpenCL garbage doesn't work at all. I once used hashcat with OpenCL in a security course on an AMD GPU and it made my system completely unstable. I couldn't care less what the kernel is written in. I'm never going to use hashcat on AMD GPUs ever again. The other problem is that companies invest in some alternative to OpenCL which fragments the non-CUDA ecosystem.

nirui · on July 7, 2023

I thought that I should mention a book because it got talked a lot under Nvidia's "lucky success". The book is titled: Why Greatness Cannot Be Planned: The Myth of the Objective (https://link.springer.com/book/10.1007/978-3-319-15524-1#toc).

I don't really think Nvidia's luck is really just a luck. After all, they has the vision for CUDA, bet money on it, and succeeded. They CAN stop the effort at any point during the adventure, but they choose to continue. That's not luck, that's a vision.

BTW, the same book got picked up by some Chinese talk heads as an perfect demonstration showing the good side of capitalism, which I totally agreed. After all, if Nvidia was a Chinese company, they'll probably instead put their efforts into developing some massive-spying citizen-incriminating policewrongs-neverseening camera with funding provided by the government, thus almost guaranteed "success". I'm really glad that Nvidia is operating in a country where "making hardware so everyday people can use their computers to play games" is not something to be mocked at by it's (abusive)parents-acting government. Now, look who's paying smugglers for those A100 chips at double the price while crying? A well deserved fate on my book.

So again, "lucky success" maybe not just luck.

sledgehammers · on July 7, 2023

I started reading that book (seems great!) and stumbled upon this: "And it’s hardly clear that computer scientists will succeed in creating a convincingly-human artificial intelligence any time soon." Made me smile. Took but 8 years, the book was published in 2015. :)

(BTW. I'm super concerned about all the incredible amount of power nvidia has now and in the future. They get to decide the fate of the human race, I feel. And their incentive is just to make as much money as possible, while all negative externalities including the likes of extinction is left to the society to deal with. Sigh.)

pjmlp · on July 7, 2023

Yeah, the usual overnight success, 7 years in the making, kind of luck.

fragmede · on July 7, 2023

There's a lot of hard work that goes into luck, but one aspect of the Nvidia story is of AMD's mismanagement. There's an alternate reality where OpenCL became the default instead of CUDA but that's not our reality.

two_in_one · on July 7, 2023

> Nvidia failed mobile computing, failed embedded computing,

Really? What do you think high-end drones and self-driving cars are using. They invested in generic robotics software and hardware probably more than all others put together. When you go beyond Raspberry Pi it's only NVidia. Their day is coming. Another similar ecosystem will be hard to create.

DeathArrow · on July 7, 2023

> Really? What do you think high-end drones and self-driving cars are using.

What are they using?

ptsneves · on July 7, 2023

Nvidia Tegra[1].

It allows for machine vision and heavy maths to be executed very fast. There is basically nothing similar in an embedded format in the market. The remainder would be SoCs for Android phones (Samsung Exynos?).

Thsese SoCs have lots of hacky drivers and mostly support Android, which is not a very good fit for real time and high customization required for drones and self driving cars (AOSP build system, a google class piece of crap).

[1] https://en.wikipedia.org/wiki/Tegra

dilyevsky · on July 7, 2023

No market share in server market if you dont count all those server GPUs and NICs and core/spine/tor switches and so on…

foobiekr · on July 7, 2023

They barely exist in the network switch market. The new NVLink switches may change that for tier one inter-chassis transport to replace Infiniband RDMA, though.

Reality is Mellanox has never been a substantial networking player. Yes, in HPC, but that's a tiny market even today.

It doesn't help that Mellanox acquired Cumulus just prior to Nvidia acquiring Mellanox, and Cumulus was basically DOA. Now they are split SONiC/Cumulus with a lot of internal infighting trying to keep Cumulus relevant despite industry trends.

dilyevsky · on July 7, 2023

Heh yeah sonic seems to be eating cumulus lunch downmarket but upmarket maybe not?

foobiekr · on July 8, 2023

Cumulus is dead. Mellanox only bought it because there wasn't much else. I another 2-3 years it will be as relevant as Vyatta.

scns · on July 7, 2023

And 5 super computers in the top ten, starting at fourth place behind two Cray/AMD and one Fujitsu:

https://top500.org/lists/top500/2023/06/

[edit] correct the cut off link

causality0 · on July 7, 2023

Don't forget the drivers. As someone who just switched from Nvidia to AMD, it is downright painful how bad AMD's implementations of Vulkan and OpenGL are. I might be getting more bang for my buck but damn do I miss not having unfixable glitches.

synergy20 · on July 7, 2023

AMD is good at being the underdog, hopefully it will focus more on its software, the ROCm thing, which really needs some love, a lot of love indeed. The software ecosystem for AMD's RDNA(gpu) and CDNA(MI2xx MI300) is at its best a mess.

AMD shall own OpenCL and boost it heavily and make it a central piece for its ROCm framework as the preferred backend, in my opinion.

pjmlp · on July 7, 2023

No one cares about OpenCL and its C only programming model.

thou4o2i34u234 · on July 7, 2023

OpenCL is deprecated.

Apple has ditched it for metal. AMD has ditched it for ROCm which is just a AMD compiler for CUDA PTX.

zorgmonkey · on July 8, 2023

ROCm does not compile PTX IR to AMD assembly, if it did you'd be able to run nearly any compiled CUDA program on and AMD GPU without the source. ROCm is a source level compiler for CUDA (technically HIP is the actual compiler, but whatever), it allows you to a substantial fraction of the CUDA APIs. This notably also means that any Nvidia open source library that uses inline PTX assembly won't work, but fortunately AMD does have alternatives to many of the Nvidia libraries.

synergy20 · on July 7, 2023

it's not, Nvidia just added support to OpenCL 3.0, AMD also continues to support it. Apple always does things its own way.

pjmlp · on July 7, 2023

OpenCL 3.0 is basically OpenCL 1.0 rebranded, hence why NVidia didn't had any issue adding support for what they already had anyway.

synergy20 · on July 7, 2023

both OneAPI and ROCm are, or should, or could provide higher level APIs to isolate you from OpenCL's C APIs, or just leverage SPIR-V. Other than CUDA, I failed to see any other open alternatives to get heterogeneous computing working yet, OpenCL is the only one on the table as far as I can tell for now. Yes there are Vulkan compute shader etc but they're still pretty far behind, and they could be made OpenCL compatible too. While CUDA is great, OpenCL can be made by OneAPI and ROCm more open source friendly, I hope.

pjmlp · on July 7, 2023

C APIs are one half of the coin, the other being what is actually running on the GPU.

OneAPI is focused on C++, and it remains to be seen if Data Parallel C++ isn't also Intel's CUDA, even if based on SYCL.

SPIR so far has hardly got any adoption to the level of PTX, regarding polyglot compute.

mpreda · on July 7, 2023

OpenCL works. Give me anything that works, and I can wrap around it if I don't like the interface it provides, and still use it because it works.

pjmlp · on July 7, 2023

It needs a bit more than that to gain adoption.

nightski · on July 7, 2023

Your are joking, right? A niche market like gaming? lol

scns · on July 7, 2023

With a higher revenue than the movie industry.

Gordonjcp · on July 7, 2023

Video is a niche market, and NVidia has had that sewn up for 20 years or so now.

chrischen · on July 7, 2023

PC gamers with GPUs is probably much smaller than the console gaming market. And Nvidia did not entirely own that market either.

eek2121 · on July 7, 2023

You would cream your pants if you happened to have even 10% of the PC gaming GPU market. Like literally.

Stop downplaying markets because of X. The market-share of the PC gaming market is higher than most modern countries of the world.

TulliusCicero · on July 7, 2023

Not so sure about that, I'd expect prices/margins are much better on PC GPU's compared to the deals struck with console makers.

elefanten · on July 7, 2023

Historically, it was typically quite a bit smaller than consoles. Recently, they’re pretty similar in size but both significantly smaller than mobile.

jayd16 · on July 7, 2023

Nvidia also powers the Nintendo Switch.

bigdict · on July 7, 2023

> it probably did not expect and plan for this

You'll find out when you read TFA!

foobiekr · on July 7, 2023

By reputation, Jensen doesn't really have much in the way of SW understanding, and in that sense he is like a lot of former chip guys. They got _insanely_ lucky that CUDA took off and there's an amazing irony in SW being their primary lock on their current market.

empiricus · on July 7, 2023

To win, they just needed to show up. Which is more than can be said about the competition. AMD alternatives to CUDA are/were fumbling in the dark for many years, and more open alternatives like OpenCL are too limited (by design?).

To me the situation looks quite clear: a GPU has vastly more compute than a CPU. As time goes, we will need and use more and more compute. You just need a way (general purpose language or API) to use that GPU. For some reason, other companies in this space did not see this.

formerly_proven · on July 7, 2023

CUDA and close to metal happened at the same time.

One is a framework, language extensions etc., the other is “heyy here’s an assembler for this year’s GPUs”

Also in ~2008 nVidia already made dedicated hardware for GPU compute like this thing: https://www.nvidia.com/docs/io/43395/d870-systemspec-sp-0371...

pjmlp · on July 8, 2023

They keep failing to see this, while CUDA is a polyglot programming model, a couple of years ago at an OpenCL conference (IWOCL), someone asked the panel when Fortran support was going to happen.

Everyone on the panel reacted surprised that it would be something that anyone would want to do, and most of the answers were the kind of talk to us later.

Meanwhile PGI was shipping Fortran for CUDA, this was before them being acquired by NVidia.

foobiekr · on July 8, 2023

I agree with this. All they had to do was the bare minimum and actually keep it alive for a few years.

This pattern is pretty common in industry. Almost all the huge companies that are winners in technology are those that got on the market and kept the thing alive - that's not sufficient, but it is necessary.

0xDEF · on July 7, 2023

Crypto miners were never "Nvidia-only". Probably because hashing algorithms were trivial to implement on OpenCL and later mROC.

Nvidia's moat is AI.

nullc · on July 7, 2023

Nvidia gpus were massively inferior for Bitcoin mining, in fact, because ATI/AMD had some integer operations that allowed SHA256 to be several times more efficient.

Ironically, that's what ultimately made Nvidia the winner for gpu mining: ATI gpus had been massively deployed for Bitcoin mining prior to the dominance of mining asics. When people created new altcoins they specifically designed their work functions so that they the inventors could have an advantage vs the general public, so they designed them for nvidia gpus rather than what was already deployed. This let them buy up gpus before shortages came into effect and delayed competition from the installed base.

Sure, you can trivially port whatever to whatever, but outside of startup effect mining is naturally perfectly competitive. Being 20% less efficient vs costs means bankruptcy.

> Nvidia's moat is AI.

Nvidia had a gpu computing moat before the current AI fad, due to maturity of the CUDA ecosystem. At least AI codes are generally pretty easy to port to other architectures, similar to mining in that sense-- but the AI designers don't have a profit motive to make sure they choose algorithms that are more efficient on their hardware than yours, and your AI hardware doesn't become useless if does happen to be a few percent less efficient.

paulmd · on July 7, 2023

It totally depends on the era and the algorithm.

The first big mining wave was 2014-2015. This was done on 5850s, and GCN 1 and 2 GPUs. This was Bitcoin, so it was computation-focused, and I think in this era it was definitively AMD dominated due to VLIW allowing very dense execution resources plus early GCN having a large amount of raw integer processing power.

The next was 2017-2018. By this era bitcoin itself had moved off to FPGAs and ASICs, so this was around Ethereum which primarily worked based on proof of memory bandwidth. AMD GPUs were falling behind Maxwell/Pascal in terms of their memory compression (although they did use it) so they were equipped with more memory bandwidth to compensate. So for a given AMD card (Polaris, Vega, etc) you got more memory bandwidth per $, but in terms of the actual compute efficiency, NVIDIA had already pushed ahead even in Ethereum. NVIDIA was usually superior per-watt with cards like 1060, 1070, 1070 Ti, and 1080 Ti, and AMD just let you burn more watts.

However when it came to altcoins, where it was not just raw bandwidth, NVIDIA's superior compute/GPGPU efficiency took over and there were some coins that NVIDIA was 2x or more more efficient on per-watt and also the winner in absolute performance.

(the thing to remember is that compute is the part that ASICs can do efficiently, and I always questioned whether those altcoins were really ASIC-resistant. But the ProgPOW-style algorithms doing better on NVIDIA cards never bothered/confused me, the reality is that most GPGPU programs "favored NVIDIA" during this era and Ethereum's proof-of-bandwidth model was an exception. Pascal was an efficiency beast and Polaris and Vega were only ok at best, outside their enormous, dangling memory buses.)

This state of affairs persisted throughout the 5700 series until AMD launched the 6000 series, where they shifted to a design with smaller memory buses and more cache, which put them in the inverse situation of 2014-2015 where they were getting more out of a weaker memory subsystem than NVIDIA. And I think they did this on purpose because they wanted to "opt out" of the mining boom/bust cycle, and NVIDIA made a similar approach with Ada that has been extremely unpopular (despite AMD leading the way on this a few years before on their cards too).

Isn't it wonderfully coincidental that out of any amount NVIDIA could have put into the LHR to slow down when it detected mining, that they put in the exact amount that dropped their cards to the same relative mining performance as AMD, and moved to the same cache-based approaches in the next generation as well? That has always been my take around LHR - it's not that they didn't like mining revenue, it's that they didn't want NVIDIA cards to be disproportionately pulled off shelves like happened to AMD in the 2014 and 2017 mining booms. People remember the "AMD was $1600, NVIDIA was $2400" situation already, they didn't want that to persist and turn into actual marketshare.

pjmlp · on July 8, 2023

Nvidia's moat is a polyglot GPGPU programming environemnt, great tooling and libraries.

raincole · on July 7, 2023

By this logic all companies' success stories are luck. I mean, I've met people who think this way, but it's HN...

rob74 · on July 7, 2023

I won't comment on the actual facts, I'll just say that I couldn't bear reading more than the first few paragraphs of the article because both guys sounded like such rabid fanbois...

DeathArrow · on July 7, 2023

>then its parallel GPU got "lucky"

You can only profit from luck if you prepare for it.

Gordonjcp · on July 7, 2023

Actually, no.

NVidia has always been the first choice for graphics work, and especially now with large complex VFX pipelines there really isn't an alternative - particularly since most of that work is done on Linux which has always had excellent support from NVidia.

andrewstuart · on July 7, 2023

A big reason that Nvidia keeps winning is that AMD doesn’t bother to compete.

Competition means giving customers exciting, fast, low cost GPUs.

Nvidia, as #1, no longer needs to compete and has stopped winning via low cost, high performance GPUs. This opened a giant opportunity for AMD GPUs to give those things to consumers and start winning against Nvidia.

But instead AMD has just followed Nvidia into making slow, uncompetitive GPUs at high prices.

So in a real way AMD is keeping Nvidia at #1.

PedroBatista · on July 7, 2023

AMD was almost bankrupt ~5 years ago.

Is has been the years of investing and specially the software ( CUDA ) that keeps Nvidia as the "king". Hardware wise AMD it's pretty much up there or winning in a few cases.

myrryr · on July 7, 2023

Right? It is CUDA, that is why they are winning.

AMD has the patents for putting a SSD directly on the card, they could be killing it in the home AI market, but.... they just can't get it together.

paulmd · on July 7, 2023

> AMD has the patents for putting a SSD directly on the card, they could be killing it in the home AI market, but.... they just can't get it together.

Radeon SSG was just a PCIe switch chip on-card, with the SSD+GPU behind it, so functionally it is the same as having the SSD on the motherboard.

I'm not sure if they got patents but either way the problem it solved was not the one people think it solved. It wasn't "using the SSD as memory", and in the way that it did (block storage) any other SSD+GPU can function identically in the same way without needing the SSG tech. Putting the SSD on your mobo performs as high in every way on any GPU (CUDA RDMA Direct have been around for a while, since Fermi/Kepler at least).

It was just a convenience thing of getting a SSD mountpoint built into your GPU basically. They even showed up as a HBA in windows and were treated as a striped disk.

https://youtu.be/-fEjoJO4lEM?t=62

More recently this idea has re-surfaced with some of the modern GPUs with x8 interfaces (6600XT, 4060 Ti, etc) getting a couple M.2 drives in the other x4x4 via bifurcation, and this functionally performs the same as SSG but without the switch chip (needs hardware support instead). And if there are patents around using a switch chip that may be how the patent is evaded. But combination cards utilizing more than one type of hardware in general is not that novel (network card/ssd combos are another popular one) and I'm not sure AMD patented it or it's defensibly novel against prior art.

https://www.tomshardware.com/news/asus-demos-geforce-rtx-406...

gxs · on July 7, 2023

Hard to believe its share price was $1/share not too long ago.

Investing in AMD at $9/share is one of my best investments of all time.

lkramer · on July 7, 2023

I bought AMD at around $1.50 and sold at around $2.50 and congratulated myself at the time for how well I did :)

throwaway2037 · on July 7, 2023

Last time AMD was 9 USD per share was Q4 2016. That is a LONG time ago. What made you bet on AMD at that time? I'm still shocked how fast AMD turned around. And I cannot see the end of Intel's slow slide into dinosaur tech company, similar to IBM, HP & Compaq.

Hamuko · on July 7, 2023

>Hardware wise AMD it's pretty much up there or winning in a few cases.

Hard disagree as someone who's had two 7900 XTXs and is now looking to buy an Nvidia card. If I need to pay 1100€ for a product, I'd like it to work and AMD cannot deliver on that.

The lack of a recall for broken hardware also makes me distrust them from a customer support point of view.

guardiangod · on July 7, 2023

>The lack of a recall for broken hardware also makes me distrust them from a customer support point of view.

Have you ever heard of the Geforce 8 Series "Bumpgate"?

paulmd · on July 7, 2023

Bumpgate was just RoHS solder and nobody recalled for that, including AMD. Seen a lot of pictures of people baking their 7850 as well, it just didn't get a fancy name like bumpgate.

Bumpgate was an industry-wide problem and pretty much nobody recalled for it unless they were sued into doing it.

mochomocha · on July 7, 2023

> But instead AMD has just followed Nvidia into making slow, uncompetitive GPUs at high prices.

I'm curious: do you use GPUs? It's hard to take this claim seriously. Yes NVIDIA GPUs are expensive, but I don't think we live in the same matrix when you claim that their GPUs are slow and uncompetitive... Unless you have an alternative in mind, in which case I'm all ears.

selectodude · on July 7, 2023

Yeah I'm not sure how a GPU can be both slow and uncompetitive and also the best GPU available at any price. That would imply that they're extremely competitive.

Some people just have an ax to grind and will contort their facts to agree with preexisting thoughts and feelings. I think it's called cognitive dissonance.

pxc · on July 7, 2023

There are two senses of 'competitive' in play here: the sense in which a market can be 'competitive' or not by featuring alternatives that are comparable and whose successors leapfrog one another in various virtues over time, and the sense in which a product can be 'competitive' within a market by being more or less at least as good as whatever else is out there.

NVIDIA is definitely not 'competitive' in the sense of participating in a competitive market, or in the sense of being characterized by furious striving for rapid and continuous improvement. Many of their newer cards are barely 'competitive' with their own cards from a generation or two ago.

It's extremely clear to me what people mean when they say that NVIDIA or their GPUs are not meaningfully 'competitive'. I don't think it's that hard to see, actually.

tomnipotent · on July 7, 2023

> Many of their newer cards are barely 'competitive'

The 4060 is the only card this generation that's a flop, and I wouldn't consider one in four being "many".

> I don't think it's that hard to see, actually.

It is if you're not looking or trying to be disingenuous. Every generation has introduced new features and had noticeable impacts on the consumer products in its category (games). DLSS alone has been a game-changer, pun intended.

photonerd · on July 7, 2023

> the best GPU available at any price

That’s… very debatable. Don’t get me wrong, they’re definitely competitive at all levels but AMD absolutely is too & very arguably is better in some cases.

deelowe · on July 7, 2023

Are we discussing gaming or AI? When it comes to AI solutions, AMD isn't even close.

Keyframe · on July 7, 2023

Not really. Market also says otherwise.

photonerd · on July 7, 2023

Market says nothing of the sort! AMD is extremely popular still.

Plus while cards like the Nvidia RTX 4090 rank very highly they cost 60+% more than the extremely comparable AMD RX 7900 XTX.

That’s definitely not “best at any price”.

Keyframe · on July 7, 2023

When you say AMD is extremely popular still, yet Nvidia has what like 82-84% market share.. what are we actually talking about here in the context of "market talks"?

photonerd · on July 7, 2023

One: that number is wildly inaccurate and only considers one tiny slice of the GPU market.

Two; iOS has a minority share of the market too. Less than a third of it. Guess you’d also call Apple unsuccessful in the mobile space?

No. No you would not. So, clearly market share of one segment is not the be-all end-all you are claiming.

Keyframe · on July 7, 2023

Not how that works, friend. Neither data nor debate. You can't hoist talk to another sphere and continue, but even if we do there's a way back. Let's dove.in, shall we? Data is accurate, reported by numerous providers. If you don't trust any of data providers you can always compare revenue size of two companies where AMD accounts for CPUs and those deaps with console manufacturers as well and still doesn't come close, in fact percentage relative still holds. Consoles are B2B deals, people don't buy consoles because AMD. Same for mobile or apple (to a degree, because apple sells apple), as well as integrated Intel GPUs. Where consumers and businesses have a direct pick and choose - they did, and it's Nvidia for the better part, 80%+ part.

tomnipotent · on July 7, 2023

> Nvidia has what like 82-84% market share

That's the PC consumer market. Every Xbox and PlayStation shipped in the past decade has had an AMD GPU.

Keyframe · on July 7, 2023

Yes, and that's a great ArtX doing and legacy tracing roots to SGI and Nintendo64 cooperation which ATI bought (into). However, it's not at all relevant since consumers aren't buying those for AMD GPUs, they are specialized (to consoles), and they're buying it for the brand of consoles themselves not because they have AMD GPU in it. Same parallel could then be drawn for intel's integrated GPUs. We're talking discrete GPU cards where market directly picks and chooses and it chose Nvidia that it's not even a contest anymore it seems. Even HPC area is done for almost.

scns · on July 7, 2023

Place one and three of the Top500 are AMD powered, Nvidias' highest is position four.

https://top500.org/lists/top500/2023/06/

Keyframe · on July 7, 2023

Right, at the moment and that game changes all the time. However, we're talking share here where 5 out of top 10 are Nvidia (which also changes, yes - and not in favor of AMD but those "other" systems which are mostly under sanctions and can't buy Nvidia at scale). Top five accelerators / co-processors in TOP500 are Nvida per https://en.wikipedia.org/wiki/TOP500 . If you ask Nvidia they'll tell you 64-68% of TOP500 are Nvidia, but they count interconnect as well, not just GPU. Truth is somewhere down, probably within 1/3-1/2 area. It's really not hard to see AMD is putting up a fight, but against a giant. Nvidia outclassed AMD's sales in b2b as well as towards consumers by a big margin which is reflected both in revenue and ultimately market cap. Reason why that's so, since both are fabless, lies primarily within R&D Nvidia did execute on rather well - but that's just like my opinion, man. Everything else are facts.

photonerd · on July 7, 2023

The market however IS buying those. The market has spoken, remember?

Keyframe · on July 7, 2023

That's not how it works. But I'll bite since you seem to follow a religion here - you can follow market talks from other end, revenue. It tells the same story, eerily so in fact.

tomnipotent · on July 7, 2023

> It tells the same story, eerily so in fact.

NVIDIA had $11bln in 2022 revenues from graphics vs. AMD's $6bln.

Keyframe · on July 7, 2023

Nvidia has 7b per quarter, AMD 5b (out of which 1.7 are all GPUs including consoles)

andrewstuart · on July 7, 2023

AMD Radeon RX 7600 review: Another water-treading midrange GPU for $269

https://arstechnica.com/gadgets/2023/05/review-amds-269-rx-7...

“AMD fails again: RX 7600 review”

https://youtu.be/Yhoj2kfk-x0

stcroixx · on July 7, 2023

By having no competition and no reason to aggressively innovate.

andrewstuart · on July 7, 2023

This is the newest Nvidia card:

The brand new Nvidia RTX 4060 “can be slower than it’s predecessor in some cases”.

“ Nvidia GeForce RTX 4060 Ti (8GB) review: Disappointing for $400”

https://www.pcworld.com/article/1925928/nvidia-geforce-rtx-4...

“Do not buy”

“The Nvidia RTX 4060 ti is a waste of sand.”

https://youtu.be/Y2b0MWGwK_U

johncolanduoni · on July 7, 2023

That’s their newest mid-tier consumer card. On the data-center side they can’t make H100s fast enough, all the AI startups are clamoring for them. You also won’t see any reviews like that for the 4090; the only complaint I’ve seen there is that it’s hard to take advantage of it if your’re just gaming (but it’s still great for CUDA development).

senttoschool · on July 7, 2023

Nvidia knows that the profit margins at mid-tier GPUs are tiny - so they're pushing people to buy the 4070, 4080, and 4090.

Wafer prices have increased drastically in the last decade - to the point that midrange cards are no longer midrange priced.

It simply isn't really profitable to make real midrange cards anymore.

In addition, discrete GPUs market is a declining market and has been for 15 years or so. Therefore, in any declining market, the low-end and mid-range products will get squeezed and the high end will get pushed. The remaining buyers of discrete GPUs are willing to pay higher prices. Those who don't will buy a laptop with an Nvidia GPU already in it or some sort of SoC (like Apple Silicon/AMD APUs/Intel APUs).

andrewstuart · on July 7, 2023

The market for GOUs is declining because crypto is over and because the GPUs are so underpowered and overpriced that people don’t want to buy them.

And if GPUs cost so much to make, why do they get discounted so drastically when the manufacturers eventually decide to compete?

aurareturn · on July 7, 2023

>The market for GOUs is declining because crypto is over and because the GPUs are so underpowered and overpriced that people don’t want to buy them.

No, this is not true. DIY PCs aren't as popular as before due to the advancements in laptops and interest swinging to mobile. People are buying far more laptops than 10-20 years ago. In fact, gaming laptops outsell gaming desktops 2:1 now.[0] The gap is expected to widen.

The market for discrete GPUs has been declining long before crypto. Crypto just slowed the decline.

Notice the word discrete. The entire GPU market isn't declining. Only discrete.

>And if GPUs cost so much to make, why do they get discounted so drastically when the manufacturers eventually decide to compete?

Part of it is because of price inflation due to covid and crypto bubble. So prices are just going back to a more normal.

[0]https://web.archive.org/web/20210406085311/https://www.idc.c...

iraqmtpizza · on July 7, 2023

>gaming laptops outsell gaming desktops 2:1

Based on desktop CPU sales or based on pre-built sales? I don't know what qualifies as a gaming laptop these days but the best laptops for software development often have discrete graphics cards in them e.g. 1660 Ti

aurareturn · on July 7, 2023

Based on however IDC defines a gaming laptop.

I'm guessing it's having a non-APU AMD/Nvidia GPU inside the laptop.

But it makes sense to me. Laptops have gotten much better, and most gamers aren't buying Nvidia 4090. The most common GPU is just a GeForce GTX 1650 which many laptops have and have thermals that easily fit inside a laptop.

It's not surprising at all that gaming laptops outsell gaming desktops. This isn't the 2000s anymore where if you want to play PC games, you build a DIY desktop tower and buy a discrete GPU.

It's a myth that most PC gamers use top of the range GPUs. Most of them use low-end or mid-range GPUs.

chmod775 · on July 8, 2023

> In fact, gaming laptops outsell gaming desktops 2:1 now.

This doesn't show what you think it does. The majority of desktop gaming PCs are self built and won't show up in that statistic.

Additionally GPUs are the most commonly upgraded component during the lifetime of a built.

senttoschool · on July 10, 2023

>This doesn't show what you think it does. The majority of desktop gaming PCs are self built and won't show up in that statistic.

It probably does. I never paid for the report but I assume IDC collects data on DIY desktop towers and make an educated guess.

Regardless, gaming laptops sells far more than DIY computers.

chmod775 · on July 10, 2023

In 2020 Nvidia + AMD sold 41.5 million desktop GPUs.[1] That doesn't add up at all with 16.7 million desktop PCs.

[1] Second graphic: https://www.tomshardware.com/news/nvidia-maintains-lead-as-s...

senttoschool · on July 11, 2023

Discrete GPUs can go into old desktop PCs. That's my guess.

And it's not clear what % of them were used for gaming vs crypto mining.

thou4o2i34u234 · on July 7, 2023

Their margins on their enterprise cards are far far higher.

The A100 which is similar to 4090 in silicon goes for 15-20x. Nvidia is probably deriving as much profit from cloud at this point as gamers.

parineum · on July 7, 2023

They offer more than those two cards and the cards they are losing to are other Nvidia cards.

nl · on July 7, 2023

The 4060 is a pretty unusual case. In general NVidia cards fit on a very predictable price/performance curve where you don't get any performance for free, but consequentially you pay more and get more performance.

(admittedly I have zero interest in them as gaming hardware, but on the GPU compute side this is definitely the case)

paulmd · on July 7, 2023

It's kinda amazing how year after year reviewers release the "new products are crap, buy the thing we told you was crap last year" and people don't catch on.

The low-end is crawling along due to fixed cost overheads from PHY area that doesn't shrink. 4N is ~3x the price per area as Samsung, and PHY area becomes relatively much larger. The incentive is to cut every cost in PHY area - fewer memory channels to reclaim PHY area and using cache instead, cutting PCIe width to reclaim that PHY area, even cannibalizing the media engine/encoders to claw back that last little bit of space. AMD has actively been engaged in this battle with 6600XT (x8 PCIe, 128b memory bus) and 6500XT (x4 PCIe, 64b memory bus, no encoder) and RDNA2 generally shrinking memory bus and replacing it with cache in this same way (and 6600/6600XT and 5700/5700XT also regressed performance vs their predecessors in some situations just like 4060/4060 Ti). On top of that you have big fixed increases in manufacturing, shipping, etc, and the rest of the BOM is ballooning over time too (forget VRAM spot prices, ask automakers if their BOM is higher or lower than 2019... and it's not a small difference, it's probably 2x!)

In this world of organically-low performance increase within the low-end product segment, the impact of clearance sales exceeds the impact of new product generations. And this means we end up in this situation where reviewers do the "the new products suck, buy the ones we told you sucked last year" rubber-chicken routine year after year after year. But if everything is constantly bad, kinda nothing is, really. That's just how this product segment is now.

And when you look at things like the 3060 Ti being $275 on clearance, or the 6700XT being $300-320... the market consensus is that the old products are Good Enough. And the new products in fact may be worse in some ways, unless you are willing to step up in price to maintain the same product segment rather than the same price segment. Because TSMC is cranking the prices 25-50% every generation and Samsung was abnormally cheap to begin with, so there is a definite price step happening.

Hard to see how people don't understand that the $200-300 product segment is dying in the same way the $100-150 product segment already died. 6 months after launch you could buy a 7850 2GB for $150, or a 7750 for $100. What does $100 buy you in the new/retail GPU market these days? Does anybody actually seriously think the inability to deliver new $100-150 GPUs with enthusiast-tier performance is due to "greed" or "agreeing not to compete" as opposed to market realities? But people will defend it to the death that the privilege of selling them a $200 GPU is some massive profit opportunity that AMD and NVIDIA are just choosing to ignore and sandbag.

We are in the end of silicon in some ways. Post-Moore's Law the costs have been spiraling, and now we are to the point where consumers are deciding that no, it's not worth the cost increases. And companies aren't going to cut their throats and run zero margin or sell at a loss either. They will make the products they can make, and if the demand is low enough that it's not economical to continue developing products for that segment, they'll stop and continue in the segments that are still profitable. And that eventually flows through to TSMC and ASML, and node research will slow (hyper-NA is already effectively canceled due to excessive costs) and silicon research becomes incremental and iterative and slower rather than continuing to make even N7->N5->N3 sized progress. Yes, it can get slower.

And again, this doesn't mean "NVIDIA is leaving gaming", any more than they left gaming after the $100-150 market died. You can still make something in the 4070-class product very profitably at $600 and slimmer margins at $500, and the market access the gaming products provide is foundational for capturing the innovation happening in the other segments (it's why NVIDIA keeps being showered in money with stuff like AI while AMD is left out of the rain, you don't get to be in the segments like AI or OptiX without doing the work in gaming). But they can’t do the 4070 at $329 like it’s 2014 on a mature 28nm anymore. And the $200 market is as toast as the $100 market before it, if that's your budget then buy a console and benefit from the cost-reductions of integration and reduced modularity and fixed/stable hardware specs.

But gamers are “”emotionally unprepared”” for living in a world where there isn’t automatic progress at each price point. People do think of it as the privilege of AMD and nvidia getting to sell you an upgrade… the vendors see it as the “”privilege”” of selling you the lowest-margin product in their lowest-margin product family (all of that 6nm wafer is 10x as profitable for AMD doing literally anything else already). If it sucks oh well, and if you don’t buy it then they’ll stop making it, and it’s not malicious or conspiratorial, it’s just not where the tech is going in that price segment. People want cost-inefficient Lego-style modular product design even in the lowest-end product segments, and then get mad when it’s expensive, and start spewing conspiracies about collusion etc. That’s easier for people emotionally than just admit they’re wrong and being irrational.

Also there’s really no segment where any product here regressed. The predecessor to the 4060 isn’t the 3060 ti, a card that was a $400 MSRP / $450 street price even after mining. It’s the 3060, and the 4060 is way faster in all scenarios. The predecessor to the 7600 is the 6600, the 6700XT is a $480 MSRP card. People love to do this “it’s actually 2% slower (at a res nobody plays on those cards) than a card that’s only 20% faster” bit - and again this includes reviewers too. Nothing is actively regressing, it’s just not advancing as fast as people want, but they’re so emotionally immature they have to turn that into “2% slower than a card that’s 20% faster / 50% higher msrp” to express their frustration. It’s not slower than anything other than your expectations, but people emotionally love the framing of it somehow being a regression.

After 5 years of these reviews and discourse (everything post Pascal and even Pascal itself tbh) I’m just kinda tired with it. If you apple Moores law era standard then everything is going to suck going forward, period, with rare “this one is ok” for truly great ones. If everything is awful nothing is.

mpreda · on July 7, 2023

I use GPUs for FP64 (64-bit floating point) compute, and the best AMD GPU right now is still Radeon VII, which was released in 2019. Since then AMD split the GPU business into two lines, CDNA and RDNA, the point being to be able to charge much higher prices for the GPUs that provide good compute (i.e. CDNA).

The RDNA line is targeted at gamers. The reasoning apparently being "let's be careful that these RDNA GPUs are bad enough so they can't really be used for anything else but games; if the user is into compute, they should pay the big money for CDNA GPUs".

This is also reflected into the ROCm situation. ROCm has good support for CDNA GPUs, which is the hardware that is deployed in the huge national-lab GPU projects, where money is not an issue. The problem with that approach? normal people do not have access to CDNA GPUs that cost a multiple of 10000 USD apiece.

AstralStorm · on July 7, 2023

Indeed, you do need kind of a loss leader weak compute GPU to get people interested and experimenting with your platform. Cloud does not fulfill that purpose being too expensive, too limited and not secret enough.

Normal gaming nVidia GPUs fill that role. You need big guns, you buy a lot of their compute cards next.

AMD had decent compute capabiliites in their CDNA line, but they didn't follow through with drivers. The cards were pretty bad to try to use for compute with early ROCm, and just as bad for gaming.

arein3 · on July 7, 2023

You don't need a loss leader, just make the full drivers open source and I'll pay a premium

DeathArrow · on July 7, 2023

> I'm curious: do you use GPUs? It's hard to take this claim seriously. Yes NVIDIA GPUs are expensive, but I don't think we live in the same matrix when you claim that their GPUs are slow and uncompetitive... Unless you have an alternative in mind, in which case I'm all ears.

I think he meant Nvidia GPUs are more expensive than Nvidia GPUs from previous generations.

gxs · on July 7, 2023

He might be comparing to some of their older models that held up really well for a long time vs some of the disposable crap we get now.

est · on July 7, 2023

> Competition means giving customers exciting, fast, low cost GPUs.

You actually need to do more. The ML community chose nVidia because a decade ago, nVidia donated GPUs to universities for free. People like Hinton hacked CUDA as a poor man's HPC.

For example the original AlexNet use 224x224 resolution pictures, to fit the GTX 580 3GB memory.

woodson · on July 7, 2023

It wasn’t the donations. I bought a GTX 580 back then to train DNNs for speech recognition. There simply were no other reasonable options around.

pjmlp · on July 7, 2023

And tooling that also allowed them to run their existing Fortran, C++ and whatever else they thought out, instead of plain C.

eek2121 · on July 7, 2023

AMD actually IS competing, just not so much in gamer space (though they are actually competing there as well, just not with top end)

The MI 300 and MI 250 are making huge inroads to the point where they are finally pulling folks away from CUDA.

mpreda · on July 7, 2023

And who has access to MI300 and MI250 GPUs? I would program for them, but I can't ever get one, or anyone I know can.

How much does one MI300 cost, BTW? Is there a price published? are they available to buy anywhere?

scns · on July 7, 2023

Cray has access, place one and three in the top 500.

senttoschool · on July 7, 2023

>A big reason that Nvidia keeps winning is that AMD doesn’t bother to compete.

AMD definitely wants to compete. Why wouldn't they want a piece of a trillion dollar market?

The problem is that AMD was almost bankrupt, had poor leadership until Lisa Su, and they were just trying to survive making console chips for Sony and Microsoft until Zen2.

netheril96 · on July 7, 2023

AMD's problems with AI lie first in software rather than hardware.

mcv · on July 7, 2023

> But instead AMD has just followed Nvidia into making slow, uncompetitive GPUs at high prices.

It's true that modern GPUs are ridiculously expensive compared to where they were 10 years ago, but there still is a good sub-$200 GPU available: the RX 6600, from AMD. It's not the fastest, but from what I understand, it gives the most power for your buck of any modern GPU.

PaulKeeble · on July 7, 2023

I was so disappointed with the 7970 and 680 era of cards because the one time AMD launched first they priced a mid size die as if it was a top end card and Nvidia followed suit renaming the x70 card to the 680 as competition. It marked the beginning of the period where cards have got more expensive and renamed over and over. The 580 in pedigree looked a lot more like a Titan card/x90 does today than any x80. An x80 today is a x60 class card of that era in terms of die size, memory bus width and power consumption targets and a host of other core big measures of GPU design.

They have both been at it gradually increasing prices to the point where a mid range card of a decade ago now costs 5x as much and twice the price of an entire console. It's got pretty insane this generation with marginal gains and massive price hikes and another set of renaming cards to step the dies down again. Finally no one is buying them but these companies are big enough they will just blame the overall economy anyway.

didip · on July 7, 2023

Hard agree. AMD keep sleeping at the helm and there are no other competitors.

paulmd · on July 7, 2023

> But instead AMD has just followed Nvidia into making slow, uncompetitive GPUs at high prices.

This seems like a needlessly conspiratorial and complicated cope to avoid recognizing that the low end is simply suffering from fixed overhead and rising costs.

The alternative/null hypothesis is that both companies are responding to the market options that TSMC and BOM costs provide them, and the technology is simply moving slower in some segments. In this null hypothesis, it's not an active conspiracy to screw anyone, and nobody is deliberately "followed anyone into noncompetitiveness", there simply isn't a big market opportunity to sell $150 enthusiast GPUs anymore (7850 HD!) and make a decent profit based on the actual costs of building the product and getting the product through design/validation and manufacturing/shipping cycles.

Like, it's kinda facially absurd that gamers think that there's some golden opportunity to make low-margin $150 GPUs that everyone is just choosing to ignore because "they'd rather do AI/because they hate gamers". That's a shitty low-margin product and everyone is choosing to ignore it because it's not profitable, gamers are just kinda operating under this fallacy that $150 is a lot of money for an enthusiast-tier dGPU. And of course gamers will not touch a card with less than 8GB, and will scream a fit with 4060/6600 style 128b buses not being enough width. They don't like any of the compromises that are necessary to get down to this price point, they want a $150 card that's no-compromises quasi-midrange.

Shockingly, most companies are not interested in chasing the customer who wants the corvette for $25k. And yes, you can sustainably build a car for $25k, or maybe even a bit less... but it's not gonna be a corvette either. It's gonna be a midrange that you've "cut down" with feature lockout, or it's gonna be the shitbox econo-model that was bad to start with.

bmacho · on July 7, 2023

> Nvidia, as #1, no longer needs to compete and has stopped winning via low cost, high performance GPUs. This opened a giant opportunity for AMD GPUs to give those things to consumers and start winning against Nvidia.

> But instead AMD has just followed Nvidia into making slow, uncompetitive GPUs at high prices.

I am not really sure they could do that. If nvidia cards are cheaper to produce, then for AMD just ordering more cards can end in massive losses short term, and much less profit long term, as customers demand cheaper cards. Nvidia always could lower their prices as the updog, and hurting AMD really bad.

But yes, in retrospective, knowing that the demand for GPUs would remain ridiculously high for years, the best move for AMD was ordering and producing much much more cards. As the best move for us customers were jumping into crypto. (Not necessarily the best strategy.)

ThatPlayer · on July 7, 2023

I'm not sure that's true when even Intel's low cost GPU offerings are not doing any better. The cheaper AMD's RX7600 is probably a better value than Intel's more expensive A770. Nvidia's newest RTX 4060 is in that price range too, also performs similarly, but will probably have more mindshare.

Dalewyn · on July 7, 2023

Intel from what I can tell has better hardware than AMD, but they have even worse software than AMD.

It's really just Nvidia firmly winning the entire market the good old fashioned way. Those who can compete still can't hold a candle, and the market at large is content to keep buying Nvidia because their stuff is simply That Fucking Good(tm).

It's Nvidia's market to lose, and they clearly aren't losing any time soon.

ThatPlayer · on July 7, 2023

I'd say AMD still has worse software than Nvidia too, even before GPGPU/CUDA. About a year ago, I swapped a 1080TI with a friend's 5700XT because he was having crashing issues with games. I use it fine now with Linux drivers, so I doubt it was a hardware issue.

paulmd · on July 7, 2023

AMD drivers exist in this weird state of tautological pseudo-goodness where they're good as long as you ignore all the times they aren't. And if you point it out, people dig in with the "well I've never had a problem in 15 years now" and "go look at NVIDIA's tech support forum, they have bugs too".

NVIDIA has not had a sustained generational instability problem like 5700XT or Vega drivers in the modern era, and they haven't even had a more short-term shitstorm like RDNA3 launch drivers that lingered for nearly as long. And there have been multiple instances of top-5 e-sports titles being flatly broken on AMD drivers (often resolveable by going back to much older drivers) for prolonged periods of time (quarters/years) that simply don't happen on NVIDIA cards.

But of course there is a low-level stew of problems on both brands constantly. Power-saving with multimonitor is a great example of one that's been perma-broken for 10 years on both brands now. But there is also a high-level stew of Radeon-specific problems that pretty constantly churns and it gets discarded out of hand because "I've never had a problem" like that means 5700XT didn't exist. And it's people who you absolutely know are aware of the 5700XT issues.

Even working from a baseline assumption of intellectual honesty it's a super frustrating discourse overall, and I think in many cases the assumption may be unwarranted.

chrismarlow9 · on July 7, 2023

I'm not a pro in this field but I would think with AMD acquiring Xilinx they see a better way to the AI competition through FPGA. I would suspect others are in the same boat. GPU for AI is a nice hack, but still a hack. Make hardware that specifically is for that job.

tverbeure · on July 7, 2023

How can a notoriously area inefficient technology like FPGAs be hardware that is made specifically for the AI job?

And I’m not even touching the fact that FPGAs (which I love to work with!) are a pain to program and optimize for.

chrismarlow9 · on July 7, 2023

I mean I'm not that well versed on it but a quick Google search kinda points to faang, Tesla, and others investing in FPGA for AI things.

tverbeure · on July 7, 2023

Microsoft has historically been the biggest proponent for using FPGAs for AI.

It’s not that you can’t use them, and there may be narrow cases where they have a benefit. There are always trade-offs to be made.

But that doesn’t make them the best hardware for AI (there’s silicon overhead that will never be used) and it doesn’t change the fact that they’re much harder and slower than bare metal CUDA to optimize for, in a fast moving field.

Look at it this way: if you have a fixed DL architecture and a fixed set of network weights, there’s no question that you could come up with an ASIC that only does that, that will be cheaper than an FPGA or GPU, faster, and use lower power too. It will be the perfect inference solution for that specific problem.

It would also be obsolete in a few months, before the chip comes back from the fab.

scns · on July 7, 2023

Off topic but RME uses FPGAs in their audio interfaces and delivers sub Firewire lantencies over USB2.

DeathArrow · on July 7, 2023

> Make hardware that specifically is for that job.

That means an ASIC.

signa11 · on July 7, 2023

> So in a real way AMD is keeping Nvidia at #1.

huh :o) what are your thoughts about intel on this ? it was (and still is ?) the market leader in cpu's and decided to push fpga's for machine learning !

in the hindsight, it seems like a serious misstep.

sh34r · on July 7, 2023

It seems to me that Nvidia has largely abandoned the consumer market in order to chase the seemingly-bottomless demand for AI hype. While AMD is more than happy to stay in their lane, continue making record profits on their same market, and not waste billions trying to catch up on 10 years of CUDA development. By the time they have anything remotely comparable, the AI grift will be over just like crypto. Tech grifts only have a 1-3 year cycle these days.

TechnicolorByte · on July 7, 2023

Really surprised to see such low effort commentary akin to Reddit/twitter on here.

In what way has Nvidia abandoned the consumer market? They have something like 80%+ market share on discrete gaming GPUs and still rake in billions a year from GeForce, which at their revenue is a sizable fraction.

And what makes you say that current AI hype is a grift? Do you truly think that the work with transformers is just a fad? I swear people on this site were saying the same thing about AI in general 3+ years ago before transformers blew up the scene.

photonerd · on July 7, 2023

> I swear people on this site were saying the same thing about AI in general 3+ years ago before transformers blew up the scene.

They kind of weren’t super wrong. What people usually mean by this is that the product doesn’t match the hype. It very much didn’t.

Even now we’re only starting to catch up to the breathless promises from back then. Barely.

The main difference now is that the barrier to entry is lower so it’s starting to get some interesting uses cases come out

ccheney · on July 7, 2023

It's kinda strange seeing some people link advancements in transformer models with stuff like cryptos and NFTs. You've gotta ask, where are these thoughts coming from? Plus, the ongoing use of the 'stochastic parrot' argument is starting to feel a bit repetitive in these discussions.

BoorishBears · on July 7, 2023

The irony is that 99% of them haven't actually read the paper the term originated from.

One of it's only recurring warnings was the danger of underestimating what a "stochastic parrot" can do in practical terms.

stcroixx · on July 7, 2023

Their consumer stuff barely competes with itself. They can only get away with that because nobody is interested in that market. I’d also say current AI hype is hard to tell from a grift now - my team is doing some useless crap because CEO basically has tech FOMO. It’s so uncalled for and I’m only guessing not uncommon.

alpaca128 · on July 7, 2023

Companies doing useless crap with AI has been a thing long before the current AI popularity spike. I've heard a specialist complain about that trend 5 years ago, where clients couldn't say what they wanted to do with an AI but they "knew" they needed one. That doesn't make it a grift, though.

thescriptkiddie · on July 7, 2023

Nvida literally doubled the prices of their entire product line overnight and then spent their entire GDC keynote talking about how AI was going to eliminate the need for game developers.

sh34r · on July 7, 2023

Like crypto, fraud is the only use case that will pan out for transformers in the end.

We’ve been here before. ChatGPT isn’t functionally all that different from ELIZA from the 1960s. It’s tricking people just the same, into thinking it’s more than it really is.

I’m also eagerly awaiting the discovery process that will show that all of this generative AI was built on mass copyright infringement. OpenAI certainly ingested the entire z-library. They’d have to be stupid not to, right? If they didn’t use it, less ethical competitors would have beaten them to market.

Oh, I’m sure they used some sketchy subcontractor to do it. We’ve all seen a variation of this in our careers, where a dataset was acquired under questionable circumstances, and you don’t ask and don’t tell if you want to keep your job, right?

And that’s just speculation. There’s hard evidence for the Getty Images lawsuit.

alpaca128 · on July 7, 2023

Comparing ELIZA to GPT because it's not quite there yet is like comparing horse carriages to autonomous cars because they still have accidents. People have already found actual real-world uses for GPT, meanwhile ELIZA's most popular use-case is an obscure feature in Emacs.

Regarding the questionable training data I agree, but the likely outcome is that those AI models will be used regardless.

peddling-brink · on July 7, 2023

What is the ai grift? It’s doing real things. Some of those things are shitty, but that doesn’t make it a grift.

ericd · on July 7, 2023

I'm guessing it's a gut feeling based on seeing a lot of the same types of influencers who used to endlessly hype up crypto now making similar noise about AI. But I agree that that assessment is unfair to AI, even if the hype is ahead of current abilities.

jtriangle · on July 7, 2023

Best kept secret in the AI boom really is that you don't need a ton of local horsepower to run a model and do something useful, you only really need the grunt once, to do the training in a reasonable amount of time.

Combine that with the stuff that Mythic was working on in texas (before they ran out of burn) and we're looking at a world where, most people won't have a need for heavy GPU compute.

sh34r · on July 7, 2023

It’s gonna take another decade before the suits at F500 companies figure that out, though. You wouldn’t believe the wasteful nonsense that’s in production already…

narism · on July 7, 2023

Mythic is supposedly back, they raised more money in March 2023.

jtriangle · on July 7, 2023

Oh hell yeah, super happy for them. They've got the right idea for edgeAI stuff, I hope they can survive, or, at least sell their IP to someone big and buy a few acres to DGAF on.

lexandstuff · on July 7, 2023

I hate to break it to you, but crypto has been "over" many times, only to come roaring back. It's cyclical, based on the price of Bitcoin.

I expect people will go nuts about it again in a few years before temporarily abandoning it.

At least that was the case in 2011, 2013, 2017 and 2020-21.

I don't see any value in crypto projects personally, but if you think it's going away forever, expect disappointment.

sh34r · on July 7, 2023

As long as sanctions and ransomware exist, I don’t think it will ever fully die. But I don’t think the price will ever hit an ATH again without SBF, tether, and their co-conspirators pumping the price by printing stablecoins out of thin air.

Pretty much everyone has heard of crypto by now. The Ponzi scheme is out of marks. There will be no next time. Sell now if you’re still holding the bag.

iraqmtpizza · on July 7, 2023

crypto could stand still and inflation would bring it to an all-time high in a few years at this rate

sh34r · on July 7, 2023

Inflation hurts speculative assets the most. In the last 2 years, BTC is down 50% despite inflation being at the highest level seen since the 1980s.

Const-me · on July 7, 2023

CUDA was a great technology before 2018.

Then in 2018, they prohibited usage of GeForce GPUs in data centers: https://www.datacenterdynamics.com/en/news/nvidia-updates-ge... And during 5 years which followed, they have reached the point where for many use cases nVidia is dramatically worse value compared to competitors.

On my day job I do CAM/CAE, we compute a lot of FP64 numbers. Because we use D3D tech for GPGPU, we recently bought some computers with AMD 7900 XTX. These GPUs were sold for about $1000. An nVidia equivalent is L40: the AMD is 1.459 TFlops and 0.96 TB/sec memory bandwidth, the nVidia is 1.414 TFlops and 0.864 TB/sec memory bandwidth, but the nVidia is sold for about $9000. That’s an order of magnitude difference in cost efficiency.

P_I_Staker · on July 7, 2023

I agree the cost difference being substantial, but something seems off here. I think you can buy nVidia cores that can beat anything AMD has for ~$1500-2000. That is gaming focused, but it makes me extremely skeptical of the overall numbers.

If the AMD chip really is better for your job, that's not that crazy a claim, but it make no sense that nVidia wouldn't have something for 1.5-2x as much. This seems to be the going rate, currently.

If you're getting comparable speed for 1/9 the cost, I don't think that product would exist for very long. I've been looking at graphics cards and nVidia clearly beats AMD in every category with around a 50-100% markup.

Const-me · on July 7, 2023

Gamers don’t care about FP64 performance, and it seems nVidia is using that for market segmentation. The FP64 performance for RTX 4090 is 1.142 TFlops, for RTX 3090 Ti 0.524 TFlops. AMD doesn’t do that, FP64 performance is consistently better there, and have been this way for quite a few years. For example, the figure for 3090 Ti (a $2000 card from 2022) is similar to Radeon Vega 56, a $400 card from 2017 which can do 0.518 TFlops.

And another thing: nVidia forbids usage of GeForce cards in data centers, while AMD allows that. I don’t know how specifically they define datacenter, whether it’s enforceable, or whether it’s tested in courts of various jurisdictions. I just don’t want to find out answers to these questions at the legal expenses of my employer. I believe they would prefer to not cut corners like that.

I think nVidia only beats AMD due to the ecosystem: for GPGPU that’s CUDA (and especially the included first-party libraries like BLAS, FFT, DNN and others), also due to the support in popular libraries like TensorFlow. However, it’s not that hard to ignore the ecosystem, and instead write some compute shaders in HLSL. Here’s a non-trivial open-source project unrelated to CAE, where I managed to do just that with decent results: https://github.com/Const-me/Whisper That software even works on Linux, probably due to Valve’s work on DXVK 2.0 (a compatibility layer which implements D3D11 on top of Vulkan).

aseipp · on July 8, 2023

> However, it’s not that hard to ignore the ecosystem

I'd say this will only work if you have stable results or models to target like Whisper, that's one thing, and "going your own way" can help improve portability and stuff; Llama.cpp is another good example. But a lot of the software demand is not driven by that, it's driven by continuously evolving models and needs; a lot of the bloat or whatever you want to call it is a result of that.

Besides that, the programming models are moving on. The open source Nvidia Linux driver now enables fully heterogeneous memory management on x86 across the CPU and GPU. This means the GPU and CPU do not need the programmer to enforce memory coherency, perform device-specific allocations, or copy memory; migrations, page table/TLB flushes, etc all work out of the box with no modifications to userspace software. So now your io_uring asynchronous loop can write training data to memory that is implicitly available to the GPU, no matter what memory allocator you're using. It basically means arbitrary CPU compute and arbitrary GPU compute is now composable using the memory substrate (and OS kernel) as a coherent transport/storage layer. On x86/Nvidia, this works on the granularity of a page, but on the Grace Hopper superchip, this is going to take place at the level of a cache line. Multiple Hopper Superchips can be NVLink'd together over infiniband so this works across the cluster. You can drive an entire rack of systems this way and it works.

For people actually doing a lot of GPU-specific programming, or deploying models on servers (e.g. for API usage), this is going to be a big deal in the long run, and it started way back when they first introduced unified virtual memory. AMD is moving this way too for their compute stacks, I assume. The compute shader model just isn't evolving for these kinds of needs and it isn't clear it's going to anytime soon.

Const-me · on July 9, 2023

I’m not sure heterogeneous memory is such a huge deal, due to performance numbers. PCI express is relatively slow in terms of bandwidth, and especially latency. To compensate, GPUs have dedicated piece of hardware to asynchronously copy blocks of data over PCIe. In modern low-level APIs that hardware is even directly exposed to programmers, as transfer queue in Vulkan, and copy command queue in D3D12.

Manually moving data with APIs like cudaMemcpy or ID3D11DeviceContext.CopyResource complicates the code, but much faster than unified memory. Especially if you did it correctly with pipelining, and GPU computes something else (like previous batch of work) while the new data is being copied.

Speaking of new features, I would rather expect GPGPU users to be interested in DirectStorage technology which allows GPUs to efficiently load data from SSD. That thing is currently Windows-only but supported by all 3 GPU vendors. Because it was implemented primarily for videogames, it works just fine with compute shaders.

braza · on July 9, 2023

It’s really disturbing for me the possibility that a that a hardware vendor could enforce that on court.

Does someone knows what’s the rationale for that from Nvidia?

Const-me · on July 9, 2023

I think it’s money. nVidia is selling same chips on different boards for different prices.

An example from the current generation, $800 GeForce 4070 Ti, and $2500 L4 are based on the same chip. The difference between these two cards is amount of VRAM, some settings which control clock frequencies, and these legal/software limitations. Without the legal limitations, people who need to compute stuff which fits in 12GB VRAM would only pay them $800 for the GeForce model.

jjoonathan · on July 7, 2023

Because AMD's gpgpu offerings have been egregiously buggy for at least a decade. Now that AMD has money I'm hoping they can turn this around, but Geohotz hitting driver crashes from a loop around a demo suggests to me that things are still pretty bad.

ccheney · on July 7, 2023

Geohot is working w/ AMD and essentially pleading with them to fix their driver stacks - check out his interview on Latent Space

voxadam · on July 7, 2023

I'm pretty sure geohot gave up on his AMD plan pretty quickly.

https://news.ycombinator.com/item?id=36189705