I feel there’s a gap missing in this thread (or I may be the one missing it) Dee...

gilbetron · on Jan 28, 2025

I think you are missing it:

That has a great overview - this is a new model, but also a distillation. They used new techniques to make it really cheap (comparatively).

sghiassy · on Jan 28, 2025

Thank you for the link. I have a lot of respect for Stratechery. I learned a lot, and agree, I’m the one who was missing it haha

cubefox · on Jan 28, 2025

This comment seems to be complete nonsense. See here https://arxiv.org/abs/2412.19437v1

sghiassy · on Jan 28, 2025

The internet would be a little better, if people were a little nicer

cubefox · on Jan 29, 2025

Sorry

sghiassy · on Jan 29, 2025

Thanks for saying that :) I appreciate you