Towards Best Practices for Open Datasets for LLM Training Paper • 2501.08365 • Published 28 days ago • 54
TinySwallow Collection Compact Japanese models trained with "TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models" • 5 items • Updated 13 days ago • 13
EmbodiedEval: Evaluate Multimodal LLMs as Embodied Agents Paper • 2501.11858 • Published 22 days ago • 5
UI-TARS: Pioneering Automated GUI Interaction with Native Agents Paper • 2501.12326 • Published 21 days ago • 49
view article Article Train 400x faster Static Embedding Models with Sentence Transformers 28 days ago • 142
view article Article How to generate text: using different decoding methods for language generation with Transformers Mar 1, 2020 • 153
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated 6 days ago • 227
SmolVLM Collection State-of-the-art compact VLMs for on-device applications: Base, Synthetic, and Instruct • 5 items • Updated Dec 22, 2024 • 33
view article Article Introducing Observers: AI Observability with Hugging Face datasets through a lightweight SDK By davidberenstein1957 and 1 other • Nov 21, 2024 • 35
view article Article Transformers.js v3: WebGPU support, new models & tasks, and more… Oct 22, 2024 • 67
view article Article Model2Vec: Distill a Small Fast Model from any Sentence Transformer By Pringled and 1 other • Oct 14, 2024 • 69
view article Article Fine-Tuning 1B LLaMA 3.2: A Comprehensive Step-by-Step Guide with Code By ImranzamanML • Oct 2, 2024 • 46
NVLM 1.0 Collection A family of frontier-class multimodal large language models (LLMs) that achieve state-of-the-art results on vision-language tasks and text-only tasks. • 2 items • Updated 26 days ago • 51
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 26 items • Updated Jan 8 • 554