Are the Vicuna weights available for download, and are they llama.cpp compatible...

TehCorwiz · on April 5, 2023

The github page (https://github.com/lm-sys/FastChat#vicuna-weights) is better:

> We release Vicuna weights as delta weights to comply with the LLaMA model license. You can add our delta to the original LLaMA weights to obtain the Vicuna weights.

patrakov · on April 5, 2023

Officially, only as deltas against LLaMa weights, and needing a complicated and resource-intensive conversion procedure. Unofficially, yes, a pre-converted llama.cpp compatible ggml file is available, but obviously I won't publish the link here to avoid violating the Y Combinator's terms of use.

detrites · on April 5, 2023

Vicuna's fine-tune of the LLaMA weights are available for download, called "deltas".

So you get the LLaMA weights (somewhere), then apply the Vicuna deltas to them to end up with the Vicuna model.

danielbln · on April 5, 2023

The weight deltas are available: https://github.com/lm-sys/FastChat#vicuna-weights