nvidia/Nemotron-Research-Reasoning-Qwen-1.5B Text Generation • 2B • Updated 17 days ago • 5.15k • 235
Descriptive Caption Enhancement with Visual Specialists for Multimodal Perception Paper • 2412.14233 • Published Dec 18, 2024 • 6