view article Article KV Caching Explained: Optimizing Transformer Inference Efficiency By not-lain β’ 13 days ago β’ 24
view article Article SmolVLM Grows Smaller β Introducing the 250M & 500M Models! 20 days ago β’ 124
view article Article Train 400x faster Static Embedding Models with Sentence Transformers 28 days ago β’ 142
view article Article Fine-tune ModernBERT for RAG with Synthetic Data By sdiazlor and 2 others β’ 22 days ago β’ 35
Towards Best Practices for Open Datasets for LLM Training Paper β’ 2501.08365 β’ Published 28 days ago β’ 54
view article Article Crowd-sourced Open Preference Dataset for Text-to-Image Generation By RapidataAI and 4 others β’ Jan 7 β’ 18
Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models Paper β’ 2412.09645 β’ Published Dec 10, 2024 β’ 35
RetroLLM: Empowering Large Language Models to Retrieve Fine-grained Evidence within Generation Paper β’ 2412.11919 β’ Published Dec 16, 2024 β’ 33
Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation Paper β’ 2412.03304 β’ Published Dec 4, 2024 β’ 17
view article Article πΊπ¦ββ¬ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs By wolfram β’ Dec 4, 2024 β’ 76
view article Article Use Models from the Hugging Face Hub in LM Studio By yagilb β’ Nov 28, 2024 β’ 138
view article Article To what extent are we responsible for our content and how to create safer Spaces? By davidberenstein1957 β’ Aug 30, 2024 β’ 5
view article Article Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK By davidberenstein1957 and 1 other β’ Nov 21, 2024 β’ 35
view article Article How to optimize your data labelling project with custom interfaces By burtenshaw and 9 others β’ Oct 16, 2024 β’ 18
view article Article How to build a custom text classifier without days of human labeling By sdiazlor and 4 others β’ Oct 17, 2024 β’ 55
view article Article Model2Vec: Distill a Small Fast Model from any Sentence Transformer By Pringled and 1 other β’ Oct 14, 2024 β’ 69