linshaohui's picture

linshaohui

kimlin123

·

AI & ML interests

None yet

Organizations

None yet

authored a paper 11 months ago

Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models

Paper • 2503.06749 • Published Mar 9, 2025 • 31

authored a paper over 1 year ago

Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

Paper • 2405.21075 • Published May 31, 2024 • 26