Yusu Qian's picture

2 7

Yusu Qian

YusuQian

·

AI & ML interests

multimodal llm research

Recent Activity

upvoted a paper about 1 month ago

PRISM-Bench: A Benchmark of Puzzle-Based Visual Tasks with CoT Error Detection

upvoted a paper about 2 months ago

Pico-Banana-400K: A Large-Scale Dataset for Text-Guided Image Editing

upvoted a paper about 2 months ago

OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM

View all activity

Organizations

commented a paper 7 months ago

GIE-Bench: Towards Grounded Evaluation for Text-Guided Image Editing

Paper • 2505.11493 • Published May 16 • 3 •

commented a paper about 1 year ago

How Easy is It to Fool Your Multimodal LLMs? An Empirical Analysis on Deceptive Prompts

Paper • 2402.13220 • Published Feb 20, 2024 • 15 •