view article Article From Llasa to Llasagna 🍕: Finetuning LLaSA to generates Italian speech and other languages By Steveeeeeeen and 1 other • about 10 hours ago • 15
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published 7 days ago • 153
GTE models Collection General Text Embedding Models Released by Tongyi Lab of Alibaba Group • 21 items • Updated 22 days ago • 22
view article Article Agentic RAG Stack (1/5) - Index and retrieve documents for vector search using Sentence Transformers and DuckDB By davidberenstein1957 • 15 days ago • 15
view article Article KV Caching Explained: Optimizing Transformer Inference Efficiency By not-lain • 13 days ago • 24
view article Article 🚀 Build a Qwen 2.5 VL API endpoint with Hugging Face spaces and Docker! By ariG23498 • 14 days ago • 14
Language Detection Collection StaticVectors models to detect language. Exports of FastText that run in NumPy without needing FastText • 2 items • Updated 16 days ago • 3
view article Article Mastering Long Contexts in LLMs with KVPress By nvidia and 1 other • 19 days ago • 62
view article Article Hugging Face and FriendliAI partner to supercharge model deployment on the Hub 21 days ago • 30
view article Article Fine-tune ModernBERT for RAG with Synthetic Data By sdiazlor and 2 others • 22 days ago • 35