I would like to know what inference speeds they are achieving exactly on what ha...

		StochasticLi 10 months ago \| parent \| context \| favorite \| on: Life of an inference request (vLLM V1): How LLMs a... I would like to know what inference speeds they are achieving exactly on what hardware. I skimmed and searched the article and didn't find that info.