huggingface/daily-papers

BlockPilot: Instance-Adaptive Policy Learning for Diffusion-based Speculative Decoding

Hao Zhang, Yiming Hu, Yong Wang, Mingqiao Mo, Xin Xiao 2026-06-29 · 20:00 UTC

BlockPilot introduces instance-adaptive policy learning to optimize diffusion-based speculative decoding. While current methods rely on fixed inference block sizes and uniform strategies, BlockPilot aims to improve upon these limitations to enhance the efficiency of generating multiple tokens per forward pass. This approach builds on block-level diffusion to maintain lossless acceleration during LLM inference.

Read original

→ View original source

← Back to homepage

BlockPilot: Instance-Adaptive Policy Learning for Diffusion-based Speculative Decoding

Related Articles

Building an Agentic AI Data Science Team with LLMs: A Multi-Agent Machine Learning Workflow

AMD is back in play: ZINC now beats llama.cpp on our RDNA4 local LLM sweep

I mapped the "Dynamic Grammar" of LLMs: How hidden states move, stabilize, and decide

New attack provides one more reason why AI browsers are a bad idea

Claude Sonnet 5