Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence
Paper
•
2505.23747
•
Published
•
68
This repository contains the model described in Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence.
Project page: https://diankun-wu.github.io/Spatial-MLLM/
Base model
Qwen/Qwen2.5-VL-3B-Instruct