Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I would try the Qwen models before LLaVa

Do you need the embeddings to be private? Or just the photos?



For photo indexing I'd run CLIP directly and save on compute, no need to use a whole language model.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: