Bug: Missing modeling_iquestpltcoder.py file prevents model from loading

#3
by JohnMolotov - opened

Hi there,

Thank you for releasing the LoopCoder-V2 model!

It looks like the repository is currently missing the core modeling file required to initialize the custom architecture. While configuration_iquestpltcoder.py and tokenization_iquestcoder.py are present, modeling_iquestpltcoder.py appears to have been omitted during the upload.

When attempting to load the model via standard HF transformers or vLLM with trust_remote_code=True, it fails at architecture resolution because the AutoModelForCausalLM hook has no blueprint to construct the layer weights.

Could you please upload modeling_iquestpltcoder.py to the repository? Thank you!

I ran into the same problem. There's a YouTube video where the model is being tested, but I think he's running the model in V1, not V2. https://www.youtube.com/watch?v=ruxvS5Bd3mU

Multilingual-Multimodal-NLP org

Sry for inconvenience, we will release the codebase for inference of v2 version

Multilingual-Multimodal-NLP org

Hi, we updated the readme, and users can use the vLLM branch for inference.

For vLLM inference, install vLLM from yxing-bj/vllm and use transformers==4.57.1, then start the server with the following command:

vllm serve $MODEL --port 8080 \
    --max-num-batched-tokens 8192 --max-num-seqs 512 -tp 1 -dp 1 --trust-remote-code \
    --cudagraph-capture-sizes 1 2 4 8 12 16 24 32
Multilingual-Multimodal-NLP org

We sincerely apologize, but support for the Transformers implementation will be added in a future update. For now, we recommend using the specified version of vLLM for inference. Thank you very much for trying our model.

for this issue reported here , any clue ?
https://huggingface.co/Multilingual-Multimodal-NLP/LoopCoder-V2/discussions/2#6a37afe84fd6509975343085

Thank you for the quick response, yxing-bj/vllm works great.

With a little patch for quantization I've even been able to get it running in under 8GB VRAM! I'm excited to see how it performs on real tasks.

JohnMolotov changed discussion status to closed

Sign up or log in to comment