Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Together AI
SambaNova
fal
Replicate
HF Inference API
Misc
Reset Misc
multimodal
Inference Endpoints
text-generation-inference
AutoTrain Compatible
custom_code
4-bit precision
Eval Results
Merge
8-bit precision
Mixture of Experts
Misc with no match
text-embeddings-inference
Carbon Emissions
Apply filters
Models
537
Full-text search
Edit filters
Sort: Trending
Active filters:
multimodal
Clear all
unsloth/Qwen2-VL-72B-Instruct-bnb-4bit
Image-Text-to-Text
•
Updated
Nov 22, 2024
•
767
•
3
unsloth/llava-1.5-7b-hf
Image-Text-to-Text
•
Updated
Nov 22, 2024
•
40
•
1
unsloth/llava-v1.6-mistral-7b-hf
Image-Text-to-Text
•
Updated
Nov 21, 2024
•
196
•
1
NCSOFT/VARCO-VISION-14B
Image-Text-to-Text
•
Updated
Dec 31, 2024
•
603
•
22
NCSOFT/VARCO-VISION-14B-HF
Image-Text-to-Text
•
Updated
Dec 31, 2024
•
1.41k
•
22
CogACT/CogACT-Base
Robotics
•
Updated
Dec 4, 2024
•
3.61k
•
6
Flex-Data/bm-v1
Audio-Text-to-Text
•
Updated
Dec 4, 2024
•
2
unsloth/Qwen2-VL-2B-Instruct-unsloth-bnb-4bit
Image-Text-to-Text
•
Updated
Dec 4, 2024
•
18.8k
•
5
unsloth/Qwen2-VL-7B-Instruct-unsloth-bnb-4bit
Image-Text-to-Text
•
Updated
Dec 4, 2024
•
43k
•
9
CogACT/CogACT-Large
Robotics
•
Updated
Dec 4, 2024
•
973
•
1
rhymes-ai/Aria-Base-64K
Image-Text-to-Text
•
Updated
Dec 1, 2024
•
1.22k
•
12
rhymes-ai/Aria-Chat
Image-Text-to-Text
•
Updated
Dec 15, 2024
•
121
•
10
Qwen/Qwen2-VL-72B
Image-Text-to-Text
•
Updated
Dec 6, 2024
•
2.27k
•
71
unsloth/Pixtral-12B-2409-unsloth-bnb-4bit
Image-Text-to-Text
•
Updated
Dec 4, 2024
•
3.85k
•
5
unsloth/Llama-3.2-11B-Vision-unsloth-bnb-4bit
Image-Text-to-Text
•
Updated
Dec 4, 2024
•
1.08k
•
3
AI-Safeguard/Ivy-VL-llava
Visual Question Answering
•
Updated
Dec 31, 2024
•
1.12k
•
59
bartowski/Qwen2-VL-2B-Instruct-GGUF
Image-Text-to-Text
•
Updated
Dec 17, 2024
•
6.85k
•
19
lmstudio-community/Qwen2-VL-7B-Instruct-GGUF
Image-Text-to-Text
•
Updated
Jan 6
•
11.7k
•
2
bartowski/Qwen2-VL-7B-Instruct-GGUF
Image-Text-to-Text
•
Updated
Dec 17, 2024
•
24k
•
33
bartowski/Qwen2-VL-72B-Instruct-GGUF
Image-Text-to-Text
•
Updated
Dec 18, 2024
•
4.61k
•
10
second-state/Qwen2-VL-7B-Instruct-GGUF
Image-Text-to-Text
•
Updated
Jan 11
•
466
•
2
mradermacher/Qwen2-VL-72B-Instruct-abliterated-i1-GGUF
Updated
Dec 15, 2024
•
225
•
1
GoodiesHere/Apollo-LMMs-Apollo-1_5B-t32
Video-Text-to-Text
•
Updated
Dec 18, 2024
•
197
•
9
GoodiesHere/Apollo-LMMs-Apollo-3B-t32
Text Generation
•
Updated
Dec 18, 2024
•
199
•
17
GoodiesHere/Apollo-LMMs-Apollo-7B-t32
Video-Text-to-Text
•
Updated
Dec 18, 2024
•
417
•
50
prithivMLmods/Qwen2-VL-Ocrtest-2B-Instruct
Image-Text-to-Text
•
Updated
Dec 21, 2024
•
501
•
12
nvidia/NVLM-D-72B-mcore
Image-Text-to-Text
•
Updated
28 days ago
•
7
mradermacher/UGround-V1-7B-GGUF
Updated
Jan 4
•
56
•
1
osunlp/UGround-V1-72B-Preview
Image-Text-to-Text
•
Updated
about 1 month ago
•
195
•
2
nintwentydo/Razorback-12B-v0.1
Image-Text-to-Text
•
Updated
Jan 10
•
16
•
2
Previous
1
2
3
4
5
6
...
18
Next