Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Are the Vicuna weights available for download, and are they llama.cpp compatible? I can't grok that by skimming the page...


The github page (https://github.com/lm-sys/FastChat#vicuna-weights) is better:

> We release Vicuna weights as delta weights to comply with the LLaMA model license. You can add our delta to the original LLaMA weights to obtain the Vicuna weights.


Officially, only as deltas against LLaMa weights, and needing a complicated and resource-intensive conversion procedure. Unofficially, yes, a pre-converted llama.cpp compatible ggml file is available, but obviously I won't publish the link here to avoid violating the Y Combinator's terms of use.


Vicuna's fine-tune of the LLaMA weights are available for download, called "deltas".

So you get the LLaMA weights (somewhere), then apply the Vicuna deltas to them to end up with the Vicuna model.


The weight deltas are available: https://github.com/lm-sys/FastChat#vicuna-weights




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: