VideoRoPE: What Makes for Good Video Rotary Position Embedding? Paper • 2502.05173 • Published 4 days ago • 57
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published 7 days ago • 153
Reward-Guided Speculative Decoding for Efficient LLM Reasoning Paper • 2501.19324 • Published 11 days ago • 34
GuardReasoner: Towards Reasoning-based LLM Safeguards Paper • 2501.18492 • Published 12 days ago • 80
Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate Paper • 2501.17703 • Published 13 days ago • 51
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper • 2501.17161 • Published 14 days ago • 101
Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions Paper • 1712.05884 • Published Dec 16, 2017 • 3
Continuous Autoregressive Models with Noise Augmentation Avoid Error Accumulation Paper • 2411.18447 • Published Nov 27, 2024 • 2
Emilia: A Large-Scale, Extensive, Multilingual, and Diverse Dataset for Speech Generation Paper • 2501.15907 • Published 15 days ago • 15
Towards General-Purpose Model-Free Reinforcement Learning Paper • 2501.16142 • Published 15 days ago • 24
Codec-SUPERB: An In-Depth Analysis of Sound Codec Models Paper • 2402.13071 • Published Feb 20, 2024 • 1
Seed-TTS: A Family of High-Quality Versatile Speech Generation Models Paper • 2406.02430 • Published Jun 4, 2024 • 33