Benlin Liu

Tim666

https://liubl1217.github.io/

AI & ML interests

None yet

Recent Activity

authored a paper 11 days ago

Structure From Tracking: Distilling Structure-Preserving Motion for Video Generation

upvoted a paper 11 days ago

Structure From Tracking: Distilling Structure-Preserving Motion for Video Generation

authored a paper 9 months ago

LiveVQA: Live Visual Knowledge Seeking

View all activity

Organizations

authored a paper 11 days ago

Structure From Tracking: Distilling Structure-Preserving Motion for Video Generation

Paper • 2512.11792 • Published 14 days ago • 9

upvoted a paper 11 days ago

Structure From Tracking: Distilling Structure-Preserving Motion for Video Generation

Paper • 2512.11792 • Published 14 days ago • 9

authored a paper 9 months ago

LiveVQA: Live Visual Knowledge Seeking

Paper • 2504.05288 • Published Apr 7 • 15

liked a model about 1 year ago

lmms-lab/LLaVA-Video-72B-Qwen2

Text Generation • 73B • Updated Oct 25, 2024 • 12.4k • 20

upvoted a paper about 1 year ago

Interleaved Scene Graph for Interleaved Text-and-Image Generation Assessment

Paper • 2411.17188 • Published Nov 26, 2024 • 20

authored a paper about 1 year ago

Interleaved Scene Graph for Interleaved Text-and-Image Generation Assessment

Paper • 2411.17188 • Published Nov 26, 2024 • 20

liked a dataset about 1 year ago

THUdyh/Oryx-SFT-Data

Preview • Updated Oct 23, 2024 • 823 • 7

authored 4 papers over 1 year ago

Coarse Correspondence Elicit 3D Spacetime Understanding in Multimodal Language Model

Paper • 2408.00754 • Published Aug 1, 2024 • 23

Efficient Inference of Vision Instruction-Following Models with Elastic Cache

Paper • 2407.18121 • Published Jul 25, 2024 • 17

TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering

Paper • 2303.11897 • Published Mar 21, 2023

Unleashing Text-to-Image Diffusion Models for Visual Perception

Paper • 2303.02153 • Published Mar 3, 2023

upvoted 2 papers over 1 year ago

Coarse Correspondence Elicit 3D Spacetime Understanding in Multimodal Language Model

Paper • 2408.00754 • Published Aug 1, 2024 • 23

Efficient Inference of Vision Instruction-Following Models with Elastic Cache

Paper • 2407.18121 • Published Jul 25, 2024 • 17

liked a model almost 2 years ago

stabilityai/stable-video-diffusion-img2vid

Image-to-Video • Updated Jul 10, 2024 • 135k • 1k

Benlin Liu

AI & ML interests

Recent Activity

Organizations

Tim666's activity