InterVL-HW1

Trained and exported on 2025-10-13_11-29-14.

  • Backbone: InternVLChatModel
  • AMP dtype: bfloat16
  • Uses video pixel_values with temporal mean-pooling in vision encoder.
  • Includes training checkpoint in checkpoints/.

If you trained with a monkey-patched forward, runtime weights are still standard. You can reuse them with the original InternVLChatModel codebase.

Downloads last month
4
Safetensors
Model size
1B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support