huggingface/daily-papers
Breaking Entropy Bounds: Accelerating RL Training via MTP with Rejection Sampling
Yucheng Li, Huiqiang Jiang, Yang Xu, Jianxin Yang, Yi Zhang
2026-06-09 · 20:00 UTC
Breaking Entropy Bounds: Accelerating RL Training via MTP with Re
← Back to homepage