3D Aware Region Prompted Vision Language Model a8cheng/sr3d-nvila-8b-multiview-scans Updated Feb 17 • 7 a8cheng/sr3d-nvila-8b-multiview-videos Updated Feb 17 • 3 • 1 a8cheng/sr3d-nvila-8b-singleview-pretrain Updated Feb 17 • 19 • 1 a8cheng/SR-3D-Bench Viewer • Updated Feb 19 • 2.48k • 70 • 1
SpatialRGPT: Grounded Spatial Reasoning in VLMs a8cheng/SpatialRGPT-VILA1.5-8B Updated Oct 6, 2024 • 17 • 7 a8cheng/OpenSpatialDataset Updated Oct 3, 2024 • 67 • 13 a8cheng/SpatialRGPT-Bench Viewer • Updated May 5, 2025 • 1.41k • 700 • 12
NaVILA: Legged Robot Vision-Language-Action Model for Naviga a8cheng/navila-llama3-8b-8f Updated Mar 11, 2025 • 538 • 7 a8cheng/navila-qwen2-7b-64k-64f Updated Mar 11, 2025 • 39 • 2 a8cheng/navila-siglip-llama3-8b-v1.5-pretrain Updated Jul 6, 2025 • 135 • 2 a8cheng/NaVILA-Dataset Updated Jul 6, 2025 • 300 • 10
3D Aware Region Prompted Vision Language Model a8cheng/sr3d-nvila-8b-multiview-scans Updated Feb 17 • 7 a8cheng/sr3d-nvila-8b-multiview-videos Updated Feb 17 • 3 • 1 a8cheng/sr3d-nvila-8b-singleview-pretrain Updated Feb 17 • 19 • 1 a8cheng/SR-3D-Bench Viewer • Updated Feb 19 • 2.48k • 70 • 1
NaVILA: Legged Robot Vision-Language-Action Model for Naviga a8cheng/navila-llama3-8b-8f Updated Mar 11, 2025 • 538 • 7 a8cheng/navila-qwen2-7b-64k-64f Updated Mar 11, 2025 • 39 • 2 a8cheng/navila-siglip-llama3-8b-v1.5-pretrain Updated Jul 6, 2025 • 135 • 2 a8cheng/NaVILA-Dataset Updated Jul 6, 2025 • 300 • 10
SpatialRGPT: Grounded Spatial Reasoning in VLMs a8cheng/SpatialRGPT-VILA1.5-8B Updated Oct 6, 2024 • 17 • 7 a8cheng/OpenSpatialDataset Updated Oct 3, 2024 • 67 • 13 a8cheng/SpatialRGPT-Bench Viewer • Updated May 5, 2025 • 1.41k • 700 • 12