view article Article π0 and π0-FAST: Vision-Language-Action Models for General Robot Control 8 days ago • 90
view article Article Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial By open-r1 • 11 days ago • 33
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published 20 days ago • 314
view article Article Yay! Organizations can now publish blog Articles By huggingface and 3 others • 22 days ago • 33
view article Article Alpine Agent: An AI Agent to Navigate Your Winter Mountain Adventures By florentgbelidji • 25 days ago • 3
view article Article MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era By MiniMax-AI • 27 days ago • 40
MiniMax-01: Scaling Foundation Models with Lightning Attention Paper • 2501.08313 • Published 28 days ago • 273
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published Jan 8 • 255
view article Article Python Is All You Need? Introducing Dria-Agent-α By andthattoo and 1 other • Jan 10 • 22