QEVA: A Reference-Free Evaluation Metric for Narrative Video Summarization with Multimodal Question Answering Paper โข 2604.24052 โข Published 20 days ago
Visual Funnel: Resolving Contextual Blindness in Multimodal Large Language Models Paper โข 2512.10362 โข Published Dec 11, 2025 โข 1