One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos
Zechen Bai 1 Tong He 2 Haiyang Mei 1 Pichao Wang 2 Ziteng Gao 1 Joya Chen 1 Lei Liu 2 Zheng Zhang 2 Mike Zheng Shou 1
NeurIPS 2024
1 Show Lab, National University of Singapore 2 Amazon
Please find the code at: https://github.com/showlab/VideoLISA
- Downloads last month
- 367
Inference Providers
NEW
This model is not currently available via any of the supported third-party Inference Providers, and
HF Inference API was unable to determine this model's library.
Model tree for ZechenBai/VideoLISA-3.8B
Base model
MBZUAI/LLaVA-Phi-3-mini-4k-instruct