Haoran Wei's picture

Haoran Wei

HaoranWei

·

AI & ML interests

LLM，CV，OVOD

Recent Activity

upvoted a paper 11 days ago

MemOCR: Layout-Aware Visual Memory for Efficient Long-Horizon Reasoning

upvoted a paper 11 days ago

CodeOCR: On the Effectiveness of Vision Language Models in Code Understanding

upvoted a paper 11 days ago

Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVR

View all activity

Organizations

upvoted 3 papers 11 days ago

MemOCR: Layout-Aware Visual Memory for Efficient Long-Horizon Reasoning

Paper • 2601.21468 • Published 19 days ago • 21

CodeOCR: On the Effectiveness of Vision Language Models in Code Understanding

Paper • 2602.01785 • Published 15 days ago • 93

Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVR

Paper • 2602.05261 • Published 12 days ago • 48

upvoted a paper 12 days ago

ReGuLaR: Variational Latent Reasoning Guided by Rendered Chain-of-Thought

Paper • 2601.23184 • Published 18 days ago • 35

upvoted a collection 15 days ago

DeepSeek-OCR

2 items • Updated 15 days ago • 13

upvoted a paper 19 days ago

DeepSeek-OCR 2: Visual Causal Flow

Paper • 2601.20552 • Published 20 days ago • 60

upvoted 3 papers about 1 month ago

STEP3-VL-10B Technical Report

Paper • 2601.09668 • Published Jan 14 • 193

BabyVision: Visual Reasoning Beyond Language

Paper • 2601.06521 • Published Jan 10 • 196

AgentOCR: Reimagining Agent History via Optical Self-Compression

Paper • 2601.04786 • Published Jan 8 • 29

upvoted a paper 3 months ago

DeepSeek-V3.2: Pushing the Frontier of Open Large Language Models

Paper • 2512.02556 • Published Dec 2, 2025 • 256

upvoted a paper 4 months ago

DeepSeek-OCR: Contexts Optical Compression

Paper • 2510.18234 • Published Oct 21, 2025 • 92

upvoted a paper 6 months ago

NextStep-1: Toward Autoregressive Image Generation with Continuous Tokens at Scale

Paper • 2508.10711 • Published Aug 14, 2025 • 145

upvoted a collection 6 months ago

NextStep-1

11 items • Updated 1 day ago • 34

upvoted a collection 7 months ago

Step3

2 items • Updated Jul 31, 2025 • 21

upvoted a paper about 1 year ago

Slow Perception: Let's Perceive Geometric Figures Step-by-step

Paper • 2412.20631 • Published Dec 30, 2024 • 15

upvoted a collection about 1 year ago

Document AI

All the papers that can fundementally help in creating a true open-source processing pipeline. • 1 item • Updated Nov 11, 2024 • 1

upvoted a paper about 1 year ago

Focus Anywhere for Fine-grained Multi-page Document Understanding

Paper • 2405.14295 • Published May 23, 2024 • 1

upvoted a collection about 1 year ago

PixMo

A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 10 items • Updated Dec 23, 2025 • 86

upvoted 2 papers over 1 year ago

General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Paper • 2409.01704 • Published Sep 3, 2024 • 83

DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation

Paper • 2406.16855 • Published Jun 24, 2024 • 57