Collection for models & demos for even smoller SmolVLM release
![](https://cdn-avatars.huggingface.co/v1/production/uploads/651e96991b97c9f33d26bde6/e4VK7uW5sTeCYupD0s_ob.png)
Hugging Face TB Research
Enterprise
community
AI & ML interests
Exploring smol models and high quality web and synthetic datasets, generated by LLMs (TB is for Textbook, as inspired by the "Textbooks are all your need" paper)
Recent Activity
View all activity
Organization Card
HuggingFaceTB
This is the home for smol models (SmolLM & SmolVLM) and high quality pre-training datasets. We released:
- FineWeb-Edu: a filtered version of FineWeb dataset for educational content, paper available here.
- Cosmopedia: the largest open synthetic dataset, with 25B tokens and 30M samples. It contains synthetic textbooks, blog posts, and stories, posts generated by Mixtral. Blog post available here.
- Smollm-Corpus: the pre-training corpus of SmolLM: Cosmopedia v0.2, FineWeb-Edu dedup and Python-Edu. Blog post available here.
- SmolLM2 models: a series of strong small models in three sizes: 135M, 360M and 1.7B
- SmolVLM: a 2 billion Vision Language Model (VLM) built for on-device inference. It uses SmolLM2-1.7B as a language backbone. Blog post available here.
- FineMath: the best public math pretraining dataset with 50B tokens of mathematical and problem solving data.
News 🗞️
- FineMath: the best public math pretraining dataset with 50B tokens of mathematical and problem solving data https://huggingface.co/datasets/HuggingFaceTB/finemath
![](https://cdn-uploads.huggingface.co/production/uploads/61c141342aac764ce1654e43/RvHjdlRT5gGQt5mJuhXH9.png)
Collections
9
spaces
8
Running
35
SmolVLM 256M Instruct WebGPU
🐨
Generate descriptions for images using WebGPU technology
Running
29
SmolVLM 500M Instruct WebGPU
💻
Running
on
Zero
46
SmolVLM
📊
Generate descriptions from images and text prompts
Running
on
Zero
120
SmolVLM
📊
Generate text responses using images and text prompts
Running
18
SmolLM2 1.7B Instruct WebGPU
🚀
A blazingly fast & powerful AI chatbot that runs in-browser!
Running
132
SmolLM 360M Instruct WebGPU
🚀
A blazingly fast and powerful AI chatbot that runs locally.
models
50
![](https://cdn-avatars.huggingface.co/v1/production/uploads/651e96991b97c9f33d26bde6/e4VK7uW5sTeCYupD0s_ob.png)
HuggingFaceTB/SmolLM2-135M
Text Generation
•
Updated
•
200k
•
55
![](https://cdn-avatars.huggingface.co/v1/production/uploads/651e96991b97c9f33d26bde6/e4VK7uW5sTeCYupD0s_ob.png)
HuggingFaceTB/SmolLM2-360M
Text Generation
•
Updated
•
14.4k
•
31
![](https://cdn-avatars.huggingface.co/v1/production/uploads/651e96991b97c9f33d26bde6/e4VK7uW5sTeCYupD0s_ob.png)
HuggingFaceTB/SmolLM2-135M-Instruct
Text Generation
•
Updated
•
131k
•
112
![](https://cdn-avatars.huggingface.co/v1/production/uploads/651e96991b97c9f33d26bde6/e4VK7uW5sTeCYupD0s_ob.png)
HuggingFaceTB/SmolLM2-360M-Instruct
Text Generation
•
Updated
•
1.28M
•
86
![](https://cdn-avatars.huggingface.co/v1/production/uploads/651e96991b97c9f33d26bde6/e4VK7uW5sTeCYupD0s_ob.png)
HuggingFaceTB/SmolLM2-1.7B-Instruct
Text Generation
•
Updated
•
110k
•
513
![](https://cdn-avatars.huggingface.co/v1/production/uploads/651e96991b97c9f33d26bde6/e4VK7uW5sTeCYupD0s_ob.png)
HuggingFaceTB/SmolLM2-1.7B
Text Generation
•
Updated
•
71.6k
•
94
![](https://cdn-avatars.huggingface.co/v1/production/uploads/651e96991b97c9f33d26bde6/e4VK7uW5sTeCYupD0s_ob.png)
HuggingFaceTB/SmolVLM-256M-Instruct
Image-Text-to-Text
•
Updated
•
25.3k
•
139
![](https://cdn-avatars.huggingface.co/v1/production/uploads/651e96991b97c9f33d26bde6/e4VK7uW5sTeCYupD0s_ob.png)
HuggingFaceTB/SmolVLM-500M-Instruct
Image-Text-to-Text
•
Updated
•
19.2k
•
98
![](https://cdn-avatars.huggingface.co/v1/production/uploads/651e96991b97c9f33d26bde6/e4VK7uW5sTeCYupD0s_ob.png)
HuggingFaceTB/SmolVLM-500M-Base
Image-Text-to-Text
•
Updated
•
310
•
8
![](https://cdn-avatars.huggingface.co/v1/production/uploads/651e96991b97c9f33d26bde6/e4VK7uW5sTeCYupD0s_ob.png)
HuggingFaceTB/SmolVLM-256M-Base
Image-Text-to-Text
•
Updated
•
1.74k
•
8
datasets
35
HuggingFaceTB/smoltalk
Viewer
•
Updated
•
2.2M
•
7.92k
•
298
HuggingFaceTB/smol-smoltalk
Viewer
•
Updated
•
485k
•
712
•
28
HuggingFaceTB/finemath
Viewer
•
Updated
•
48.3M
•
19k
•
279
HuggingFaceTB/everyday-conversations-llama3.1-2k
Viewer
•
Updated
•
2.38k
•
484
•
93
HuggingFaceTB/MagPie-Pro-300k-MT
Viewer
•
Updated
•
300k
•
48
HuggingFaceTB/finemath_contamination_report
Viewer
•
Updated
•
5.33k
•
73
•
1
HuggingFaceTB/math_tasks
Viewer
•
Updated
•
21.3k
•
345
•
1
HuggingFaceTB/MATH
Updated
•
112
•
4
HuggingFaceTB/smollm-corpus
Viewer
•
Updated
•
237M
•
10.4k
•
294
HuggingFaceTB/instruct-data-basics-smollm-H4
Viewer
•
Updated
•
767
•
110