High-Fidelity Simultaneous Speech-To-Speech Translation Paper • 2502.03382 • Published 6 days ago • 8
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published 7 days ago • 153
VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models Paper • 2502.02492 • Published 7 days ago • 48
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 134
view article Article Mini-R1: Reproduce Deepseek R1 „aha moment“ a RL tutorial By open-r1 • 11 days ago • 33
Exploring the sustainable scaling of AI dilemma: A projective study of corporations' AI environmental impacts Paper • 2501.14334 • Published 18 days ago • 17
view article Article Mastering Long Contexts in LLMs with KVPress By nvidia and 1 other • 19 days ago • 62
view article Article Hugging Face and FriendliAI partner to supercharge model deployment on the Hub 21 days ago • 30
view article Article Yay! Organizations can now publish blog Articles By huggingface and 3 others • 22 days ago • 33
view article Article MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era By MiniMax-AI • 27 days ago • 40
view article Article Train 400x faster Static Embedding Models with Sentence Transformers 28 days ago • 142