view article Article Computational Model for Symbolic Representations: An Interaction Framework for Human-AI Collaboration By Severian • 22 days ago • 4
view article Article π0 and π0-FAST: Vision-Language-Action Models for General Robot Control 8 days ago • 92
Tulu 3 Models Collection All models released with Tulu 3 -- state of the art open post-training recipes. • 10 items • Updated about 21 hours ago • 90
DeepSeek-R1-ReDistill Collection Re-distilled DeepSeek R1 models • 4 items • Updated 12 days ago • 12
Qwen2.5-1M Collection The long-context version of Qwen2.5, supporting 1M-token context lengths • 2 items • Updated 16 days ago • 99
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 3 items • Updated 16 days ago • 337
Rei-12B Collection A small preview of what might become the first(or second?) stepping stone for Magnum v5 • 3 items • Updated 17 days ago • 2
SmolVLM 256M & 500M Collection Collection for models & demos for even smoller SmolVLM release • 12 items • Updated 19 days ago • 68
view article Article Mastering Long Contexts in LLMs with KVPress By nvidia and 1 other • 19 days ago • 62
VideoLLaMA3 Collection Frontier Multimodal Foundation Models for Video Understanding • 14 items • Updated 4 days ago • 11
view article Article The SOTA Text-to-speech and Zero Shot Voice cloning model that no one knows about... By srinivasbilla • 22 days ago • 60
OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking Paper • 2501.09751 • Published 26 days ago • 47
Sana Collection ⚡️Sana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer • 21 items • Updated 1 day ago • 87