view article Article Ο0 and Ο0-FAST: Vision-Language-Action Models for General Robot Control 8 days ago β’ 90
Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch Paper β’ 2501.18512 β’ Published 12 days ago β’ 25
Tulu 3 Models Collection All models released with Tulu 3 -- state of the art open post-training recipes. β’ 10 items β’ Updated about 18 hours ago β’ 90
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper β’ 2501.12948 β’ Published 20 days ago β’ 314
view article Article Hugging Face and FriendliAI partner to supercharge model deployment on the Hub 21 days ago β’ 30
view article Article Yay! Organizations can now publish blog Articles By huggingface and 3 others β’ 22 days ago β’ 33
view article Article π¦Έπ»#7: From Agentic AI to Physical AI By Kseniase β’ about 1 month ago β’ 7
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models Paper β’ 2501.03262 β’ Published Jan 4 β’ 90
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though Paper β’ 2501.04682 β’ Published Jan 8 β’ 89
TACO Models Collection This collection contains the best-performing TACO models based on LLaMA-3/Qwen2 and SigLIP/CLIP. β’ 3 items β’ Updated Dec 20, 2024 β’ 8