I've run their distilled 70B model and didn't come away too impressed -- feels s...

		rsanek on Jan 28, 2025 \| parent \| context \| favorite \| on: Nvidia’s $589B DeepSeek rout I've run their distilled 70B model and didn't come away too impressed -- feels similar to the existing base model it was trained on, which also rivaled GPT4