XVLA Collection X-VLA is a soft-prompted Transformer for cross-embodiment robot learning • 6 items • Updated 6 days ago • 9
X-VLA: Soft-Prompted Transformer as Scalable Cross-Embodiment Vision-Language-Action Model Paper • 2510.10274 • Published Oct 11 • 14
MM-Instruct: Generated Visual Instructions for Large Multimodal Model Alignment Paper • 2406.19736 • Published Jun 28, 2024 • 3