Inference engines that actually use OSS to full extent? (responses, harmony)

#147
by Howaboua - opened

Hi! A genuine question. I am wondering what is the best way to take advantage of all the features - is vLLM the only engine that supports the model fully (as far as I understand)? Thanks in advance for any help.

Sign up or log in to comment