Hacker Newsnew | past | comments | ask | show | jobs | submit | fromlogin
Show HN: Made a batching LLM API for a project. Mistral 200 tk/s on RTX 3090 (github.com/epolewski)
3 points by muttled on Dec 26, 2023 | past

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: