PhD Knowledge Not Required: A Reasoning Challenge for Large Language Models Paper • 2502.01584 • Published 8 days ago • 9
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published 7 days ago • 153
view article Article Mastering Long Contexts in LLMs with KVPress By nvidia and 1 other • 19 days ago • 62
Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate Paper • 2501.17703 • Published 13 days ago • 51
OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models Paper • 2502.01061 • Published 8 days ago • 168
The Differences Between Direct Alignment Algorithms are a Blur Paper • 2502.01237 • Published 8 days ago • 109
GuardReasoner: Towards Reasoning-based LLM Safeguards Paper • 2501.18492 • Published 12 days ago • 80
view article Article How biased is Whisper ? Evaluating Whisper Models for Robustness to Diverse English Accents By Steveeeeeeen • 13 days ago • 16
view article Article 🅰️ℹ️ 1️⃣0️⃣1️⃣ The Keys to Prompt Optimization By Kseniase and 1 other • 13 days ago • 4
Tulu 3 Models Collection All models released with Tulu 3 -- state of the art open post-training recipes. • 10 items • Updated about 17 hours ago • 90
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 3 items • Updated 15 days ago • 336
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published 20 days ago • 314