FWIW I ran a quick test of gemma.cpp on M3 Pro with 8 threads. Similar PaliGemma...

		janwas on Oct 30, 2024 \| parent \| context \| favorite \| on: M4 MacBook Pro FWIW I ran a quick test of gemma.cpp on M3 Pro with 8 threads. Similar PaliGemma inference speed to an older AMD (Rome or Milan) with 8 threads. But the AMD has more cores than that, and more headroom :)