Reward-Guided Speculative Decoding for Efficient LLM Reasoning Paper • 2501.19324 • Published 11 days ago • 34
Reasoning Datasets Collection Distilled synthetic Reasoning datasets • 7 items • Updated 9 days ago • 50
TinyLlama/TinyLlama-1.1B-intermediate-step-1195k-token-2.5T Text Generation • Updated Dec 29, 2023 • 1.8k • 52