PPL and KLD graphs for each quant

#7
by 4cast - opened

It would be great if the Unsloth team could create a graph with model size (gb) and KLD/PPL to show the loss in quality between each quant because I have heard many mixed messages on what to use. If this could be done, thank you!

It would be great if the Unsloth team could create a graph with model size (gb) and KLD/PPL to show the loss in quality between each quant because I have heard many mixed messages on what to use. If this could be done, thank you!

I second this

Thx so much.
Could you also include APEX quants in your analysis?
https://huggingface.co/mudler/MiniMax-M2.7-APEX-GGUF
I fully understand if you can't do this as they haven't even uploaded their own benchmarks yet. However, this would be another great reference point.

Edit: They removed the quants due to an issue. Will update message when they fix it.

Sign up or log in to comment