Vllm usage guide

#6
by finneouspj - opened

Hello,

Vllm usage guide is missing some steps. Trying to run it complains the "mellum" arch is not supported by transformers.

Interesting model for consumer GPUs. Would love to test it ASAP. Thanks for your support.

Hmm, https://huggingface.co/JetBrains/Mellum2-12B-A2.5B-Instruct#serving-with-vllm should start with what version of vllm is expected.

Yes, should work with the latest nightly, waiting for the proper release version and will put it in the readme

Just tested nightly vLLM, the model works fine. I used these recipes. It also should be available in the future v0.22.1 release.

Excellent thank you it is running now

finneouspj changed discussion status to closed

Sign up or log in to comment