industry

Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference (huggingface.co)

huggingface.co · 1 year ago · write a board post referencing this

login to comment.