remyxai
/

SpaceLlama3.1

Inference Endpoints

Model card Files Files and versions Community

salma-remyx commited on Jul 26, 2024

Commit

b847b44

·

verified ·

1 Parent(s): 9cb2773

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -8,7 +8,8 @@ datasets:
 # Model Card for SpaceLLaVA
-**SpaceLlama3.1** uses [llama3.1-8B](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B) as the llm backbone along with the fused DINOv2+SigLIP features of [prismatic-vlms](https://github.com/TRI-ML/prismatic-vlms) for a full fine-tune on a [dataset](https://huggingface.co/datasets/remyxai/vqasynth_spacellava) designed with [VQASynth](https://github.com/remyxai/VQASynth/tree/main) to enhance spatial reasoning as in [SpatialVLM](https://spatial-vlm.github.io/).
 ## Model Details
@@ -21,7 +22,6 @@ With a pipeline of expert models, we can infer spatial relationships between obj
 - **Developed by:** remyx.ai
 - **Model type:** MultiModal Model, Vision Language Model, Prismatic-vlms, Llama 3.1
-- **License:** Apache-2.0
 - **Finetuned from model:** Llama 3.1
 ### Model Sources

 # Model Card for SpaceLLaVA
+**SpaceLlama3.1** uses [llama3.1-8B](https://huggingface.co/meta-llama/Meta-Llama-3.1-8B) as the llm backbone along with the fused DINOv2+SigLIP features of [prismatic-vlms](https://github.com/TRI-ML/prismatic-vlms).
+Uses a full fine-tune on the [spacellava dataset](https://huggingface.co/datasets/remyxai/vqasynth_spacellava) designed with [VQASynth](https://github.com/remyxai/VQASynth/tree/main) to enhance spatial reasoning as in [SpatialVLM](https://spatial-vlm.github.io/).
 ## Model Details
 - **Developed by:** remyx.ai
 - **Model type:** MultiModal Model, Vision Language Model, Prismatic-vlms, Llama 3.1
 - **Finetuned from model:** Llama 3.1
 ### Model Sources