AWQ 4-bit produces repetitive gibberish on long outputs with vLLM v0.15.1 โ root cause identified
#5 opened 26 days ago
by
BigBlueWhale
Error: Cannot set `add_generation_prompt`
1
#4 opened about 2 months ago
by
SlavikF
Fast Start - Docker Compose
๐ค ๐ 2
1
#3 opened 3 months ago
by
Bellesteck
Loading Mistral Models in vLLM
1
#2 opened 3 months ago
by
BuiDoan
Thanks
3
#1 opened 3 months ago
by
madmax0404