view article Article Speculative Decoding in Practice: How EAGLE3 Makes LLMs Faster Without Changing Their Outputs 21 days ago โข 8
view article Article 2x Faster on a 229B MoE: EAGLE3 Speculative Decoding for MiniMax-M2.5 14 days ago โข 2
cyankiwi/Qwen3.5-122B-A10B-AWQ-8bit Image-Text-to-Text โข 39B โข Updated 28 days ago โข 2.83k โข 3
cyankiwi/Qwen3-30B-A3B-Instruct-2507-AWQ-4bit Text Generation โข 5B โข Updated Mar 23 โข 36.5k โข 32