--- base_model: - Qwen/Qwen2.5-VL-7B-Instruct language: - en - zh license: mit metrics: - accuracy pipeline_tag: image-text-to-text library_name: transformers --- The CodeV-SFT model is used to continue training CodeV-RL, as described in the paper [CodeV: Code with Images for Faithful Visual Reasoning via Tool-Aware Policy Optimization](https://huggingface.co/papers/2511.19661). For the code repository, please refer to: [https://github.com/RenlyH/CodeV](https://github.com/RenlyH/CodeV)