Ximing Lu's picture

3 7

Ximing Lu

Ximing

·

AI & ML interests

None yet

Recent Activity

authored a paper 6 days ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

upvoted a paper 6 days ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

upvoted a paper about 1 month ago

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

View all activity

Organizations

upvoted a paper 6 days ago

GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization

Paper • 2601.05242 • Published 7 days ago • 179

upvoted a paper about 1 month ago

ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration

Paper • 2511.21689 • Published Nov 26, 2025 • 116

upvoted an article 3 months ago

Article

Can Your LLM Think Like a Professional? Introducing ProfBench

Oct 28, 2025

•

18

upvoted 2 papers 3 months ago

BroRL: Scaling Reinforcement Learning via Broadened Exploration

Paper • 2510.01180 • Published Oct 1, 2025 • 18

DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search

Paper • 2509.25454 • Published Sep 29, 2025 • 141

upvoted a paper 6 months ago

The Invisible Leash: Why RLVR May Not Escape Its Origin

Paper • 2507.14843 • Published Jul 20, 2025 • 85

upvoted a paper 8 months ago

ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models

Paper • 2505.24864 • Published May 30, 2025 • 143