Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation.
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1653062669432-60741a2e69a66931a0273f0c.png)
HuggingFaceM4
Enterprise
company
AI & ML interests
None defined yet.
Recent Activity
View all activity
Organization Card
HuggingFaceM4 is the multimodal team at Hugging Face, working on vision-language models.
Within this organization on the Hugging Face hub, you can access the Idefics models (version 1 IDEFICS, version 2 Idefics2, version 3 Idefics3), datasets used for the training like OBELICS, WebSight, The Cauldron or Docmatix, and interactive tools to visualize the results.
Collections
4
WebSight is a dataset of 823,000 HTML/CSS codes representing synthetically generated English websites, each accompanied by a corresponding screenshot.
-
HuggingFaceM4/WebSight
Viewer • Updated • 2.75M • 13.5k • 343 -
HuggingFaceM4/VLM_WebSight_finetuned
Text Generation • Updated • 13.4k • 180 -
869
Screenshot to HTML
⚡Convert screenshots to HTML code
-
Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset
Paper • 2403.09029 • Published • 55
spaces
14
pinned
Runtime error
377
IDEFICS Playground
🐨
Running
99
Idefics3
📊
Generate text based on an image and prompt
Running
on
Zero
15
Florence 2
📉
Generate answers from images and questions
Running
on
Zero
869
Screenshot to HTML
⚡
Convert screenshots to HTML code
Runtime error
168
IDEFICS2 Playground
🐨
Running
145
Idefics 8b
🐠
Generate text from images and prompts
models
34
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1653062669432-60741a2e69a66931a0273f0c.png)
HuggingFaceM4/Idefics3-8B-Llama3
Image-Text-to-Text
•
Updated
•
48.6k
•
264
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1653062669432-60741a2e69a66931a0273f0c.png)
HuggingFaceM4/Florence-2-DocVQA
Image-Text-to-Text
•
Updated
•
2.56k
•
57
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1653062669432-60741a2e69a66931a0273f0c.png)
HuggingFaceM4/idefics2-8b
Image-Text-to-Text
•
Updated
•
23.5k
•
602
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1653062669432-60741a2e69a66931a0273f0c.png)
HuggingFaceM4/idefics2-8b-base
Image-Text-to-Text
•
Updated
•
986
•
27
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1653062669432-60741a2e69a66931a0273f0c.png)
HuggingFaceM4/idefics2-8b-chatty
Image-Text-to-Text
•
Updated
•
834
•
92
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1653062669432-60741a2e69a66931a0273f0c.png)
HuggingFaceM4/siglip-so400m-14-364-flash-attn2-navit
Zero-Shot Image Classification
•
Updated
•
6
•
1
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1653062669432-60741a2e69a66931a0273f0c.png)
HuggingFaceM4/siglip-so400m-14-700-flash-attn2-navit
Zero-Shot Image Classification
•
Updated
•
111
•
2
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1653062669432-60741a2e69a66931a0273f0c.png)
HuggingFaceM4/siglip-so400m-14-384-flash-attn2-navit
Zero-Shot Image Classification
•
Updated
•
160
•
1
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1653062669432-60741a2e69a66931a0273f0c.png)
HuggingFaceM4/idefics2-8b-chatty-AWQ
Image-Text-to-Text
•
Updated
•
67
•
5
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1653062669432-60741a2e69a66931a0273f0c.png)
HuggingFaceM4/idefics2-8b-AWQ
Image-Text-to-Text
•
Updated
•
139
•
26
datasets
78
HuggingFaceM4/Caltech-101
Updated
•
486
•
2
HuggingFaceM4/Docmatix
Viewer
•
Updated
•
2.55M
•
12k
•
246
HuggingFaceM4/the_cauldron
Viewer
•
Updated
•
1.88M
•
76.2k
•
366
HuggingFaceM4/FairFace
Viewer
•
Updated
•
195k
•
819
•
10
HuggingFaceM4/MMBench
Viewer
•
Updated
•
11k
•
77
•
1
HuggingFaceM4/WebSight
Viewer
•
Updated
•
2.75M
•
13.5k
•
343
HuggingFaceM4/debug_MMMU_mcq_to_remove
Viewer
•
Updated
•
10.9k
•
100
HuggingFaceM4/debug_MMMU_open_ended_to_remove
Viewer
•
Updated
•
689
•
62
HuggingFaceM4/debug_MathVista_mcq_to_remove
Viewer
•
Updated
•
3.39k
•
47
HuggingFaceM4/debug_MathVista_open_ended_to_remove
Viewer
•
Updated
•
2.75k
•
50