Homepage
Research
Experience
Blogs
Teaching
Services
Talks
Awards
News
Blogs
MineDraft: A Framework for Batch Parallel Speculative Decoding
Mar 2026
MineDraft accelerates large language model inference by overlapping the drafting and verification stages of speculative decoding, hiding latency and unlocking substantial throughput gains in batch settings.