Black-Box On-Policy Distillation of Large Language Models Paper • 2511.10643 • Published 25 days ago • 46
AdaSPEC: Selective Knowledge Distillation for Efficient Speculative Decoders Paper • 2510.19779 • Published Oct 22 • 60