DAS: Distribution-Aware Speculative Decoding for RL Post-Training
A speculative-decoding framework that accelerates reinforcement-learning rollout generation for large language models. A length-aware speculation policy prioritizes aggressive decoding on long-tail trajectories, reducing rollout makespan by up to 50% on agentic reasoning workloads with no change in accuracy.






