view article Article Speculative Decoding in Practice: How EAGLE3 Makes LLMs Faster Without Changing Their Outputs lujangusface • Apr 3 • 8
view article Article 2x Faster on a 229B MoE: EAGLE3 Speculative Decoding for MiniMax-M2.5 lujangusface • Apr 9 • 3