qiangwei's picture

6

qiangwei

qiangwei97

AI & ML interests

None yet

Organizations

None yet

upvoted 6 papers 2 months ago

Benefits and Pitfalls of Reinforcement Learning for Language Model Planning: A Theoretical Perspective

Paper • 2509.22613 • Published Sep 26 • 9

DeepScientist: Advancing Frontier-Pushing Scientific Findings Progressively

Paper • 2509.26603 • Published Sep 30 • 16

Learning Human-Perceived Fakeness in AI-Generated Videos via Multimodal LLMs

Paper • 2509.22646 • Published Sep 26 • 16

Learning to See Before Seeing: Demystifying LLM Visual Priors from Language Pre-training

Paper • 2509.26625 • Published Sep 30 • 43

Thinking-Free Policy Initialization Makes Distilled Reasoning Models More Effective and Efficient Reasoners

Paper • 2509.26226 • Published Sep 30 • 32

Winning the Pruning Gamble: A Unified Approach to Joint Sample and Token Pruning for Efficient Supervised Fine-Tuning

Paper • 2509.23873 • Published Sep 28 • 67