These are the GGUF's of the model LFM2-VL-3B.

Usage Notes:

  • Download the latest llama.cpp to use these quantizations.
  • Try to use the best quality you can run.
  • For the mmproj file, the F32 version is recommended for best results (F32 > BF16 > F16).
Downloads last month
40
GGUF
Model size
3B params
Architecture
lfm2
Hardware compatibility
Log In to add your hardware

8-bit

16-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for noctrex/LFM2-VL-3B-GGUF

Quantized
(5)
this model