I tested Llama-3-70b-8192 on Groq against ChatGPT 4, and while Groq ran it super fast, it hallucinated one answer, and didn’t get the logic correct on another question.
So, ChatGPT 4 is still more reliable for my use case. But if I were to want an LLM to process data, summarize, and so forth, Llama-3 on Groq is very fast.
Questions:
Do you know anything about Intel Hala Point?
Groq: bullshit, but admitted it when I called it out.
ChatGPT: did a Bing search (it knew what it didn’t know).
Question 2a (separate chat):
If you’re in Canada, what’s the best way to use a TFSA?
2b: Okay, if your portfolio has some tech stocks, some cash cows, and some government bonds, which should be allocated to the TFSA?
The reason I chose Question 2 is that most banks are happy to recommend bad products if it benefits them. Llama-3’s answer reflects the bank bullshit. ChatGPT 4 gives the advice your trustworthy and financially savvy friend would give you.
Follow-on questions for Llama-3:
2c: You have it backwards.
2d: Why did you get it backwards? Were you influenced by the glut of “advice” proffered by banks?
So, ChatGPT 4 is still more reliable for my use case. But if I were to want an LLM to process data, summarize, and so forth, Llama-3 on Groq is very fast.
Questions:
Do you know anything about Intel Hala Point?
Groq: bullshit, but admitted it when I called it out. ChatGPT: did a Bing search (it knew what it didn’t know).
Question 2a (separate chat): If you’re in Canada, what’s the best way to use a TFSA?
2b: Okay, if your portfolio has some tech stocks, some cash cows, and some government bonds, which should be allocated to the TFSA?
The reason I chose Question 2 is that most banks are happy to recommend bad products if it benefits them. Llama-3’s answer reflects the bank bullshit. ChatGPT 4 gives the advice your trustworthy and financially savvy friend would give you.
Follow-on questions for Llama-3:
2c: You have it backwards.
2d: Why did you get it backwards? Were you influenced by the glut of “advice” proffered by banks?