Same with bandwidth though, usually pro/max memory has much higher speed

andy_ppp · 2025-10-15T14:15:50 1760537750

Yes the M4 Base has 120 GB/s, Pro 273 GB/s and Max has 546 GB/s... That means M5 Pro is potentially around 348 GB/s and M5 Max is almost at 700 GB/s - for comparison a 4090 has around 1,000 GB/s. So pretty incredible!

sgt · 2025-10-15T16:34:50 1760546090

Also I think even an M3 Ultra is more cost effective at running LLMs than 4090 or 5090. Mostly due to being more energy efficient. And less fragile than running a gamer PC build.

andy_ppp · 2025-10-15T17:11:12 1760548272

It can run larger models quite slowly but lacks matmul acceleration (included in the M5) that is very useful for context and prompt performance at inference time. I will probably burn my budget with an M5 Max with 256gb (maybe even 512gb) memory, the price will be upsetting but I guess that is life!

sgt · 2025-10-15T18:51:04 1760554264

Yes! I think smaller models on the M3 Ultra is interesting enough, but now with matmul/ tensors on M5 Ultra or Max, with decent unified mem, it will be a gamechanger.

I can easily imagine companies running Mac Studios in prod. Apple should release another Xserve.

andy_ppp · 2025-10-16T08:09:57 1760602197

Yes completely, my guess is M6 will have external GPUs perfect for AI accelerators at home and in datacenters.

replete · 2025-10-15T20:13:43 1760559223

I think the M5 Max will be more like 614GB/s, unless they somehow have exceeded DDR5x-9600 or added more than 32 memory controllers

andy_ppp · 2025-10-16T08:17:33 1760602653

DDR5-9600 is 153GB/s from a single channel, Max has 4 channels… these are all theoretical values of course - real world none of these, even the graphics card will get that near to those… so not sure what you’re saying.