Update README.md
Browse files
README.md
CHANGED
@@ -9,11 +9,12 @@ datasets:
|
|
9 |
# Model Card for SpaceLLaVA
|
10 |
|
11 |
**SpaceLlama3.1** uses [llama3.1-8B](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B) as the llm backbone along with the fused DINOv2+SigLIP features of [prismatic-vlms](https://github.com/TRI-ML/prismatic-vlms).
|
12 |
-
Uses a full fine-tune on the [spacellava dataset](https://huggingface.co/datasets/remyxai/vqasynth_spacellava) designed with [VQASynth](https://github.com/remyxai/VQASynth/tree/main) to enhance spatial reasoning as in [SpatialVLM](https://spatial-vlm.github.io/).
|
13 |
|
14 |
|
15 |
## Model Details
|
16 |
|
|
|
|
|
17 |
### Model Description
|
18 |
|
19 |
This model uses data synthesis techniques and publically available models to reproduce the work described in SpatialVLM to enhance the spatial reasoning of multimodal models.
|
|
|
9 |
# Model Card for SpaceLLaVA
|
10 |
|
11 |
**SpaceLlama3.1** uses [llama3.1-8B](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B) as the llm backbone along with the fused DINOv2+SigLIP features of [prismatic-vlms](https://github.com/TRI-ML/prismatic-vlms).
|
|
|
12 |
|
13 |
|
14 |
## Model Details
|
15 |
|
16 |
+
Uses a full fine-tune on the [spacellava dataset](https://huggingface.co/datasets/remyxai/vqasynth_spacellava) designed with [VQASynth](https://github.com/remyxai/VQASynth/tree/main) to enhance spatial reasoning as in [SpatialVLM](https://spatial-vlm.github.io/).
|
17 |
+
|
18 |
### Model Description
|
19 |
|
20 |
This model uses data synthesis techniques and publically available models to reproduce the work described in SpatialVLM to enhance the spatial reasoning of multimodal models.
|