Wings: Learning Multimodal LLMs without Text-only Forgetting Paper β’ 2406.03496 β’ Published Jun 5, 2024
LPO: Towards Accurate GUI Agent Interaction via Location Preference Optimization Paper β’ 2506.09373 β’ Published Jun 11
Unified Multimodal Understanding and Generation Models: Advances, Challenges, and Opportunities Paper β’ 2505.02567 β’ Published May 5 β’ 80