3 13 2

ChengpengLi

AI & ML interests

LLM for Reasoning, reinforcement learning, recommendation system, diffusion models

Recent Activity

upvoted a paper about 2 months ago

Agentic Entropy-Balanced Policy Optimization

upvoted a paper 2 months ago

Quantile Advantage Estimation for Entropy-Safe Reasoning

upvoted a paper 4 months ago

We-Math 2.0: A Versatile MathBook System for Incentivizing Visual Mathematical Reasoning

View all activity

Organizations

None yet

upvoted a paper about 2 months ago

Agentic Entropy-Balanced Policy Optimization

Paper • 2510.14545 • Published Oct 16 • 104

upvoted a paper 2 months ago

Quantile Advantage Estimation for Entropy-Safe Reasoning

Paper • 2509.22611 • Published Sep 26 • 118

upvoted 2 papers 4 months ago

We-Math 2.0: A Versatile MathBook System for Incentivizing Visual Mathematical Reasoning

Paper • 2508.10433 • Published Aug 14 • 144

Agentic Reinforced Policy Optimization

Paper • 2507.19849 • Published Jul 26 • 158

commented a paper 6 months ago

CoRT: Code-integrated Reasoning within Thinking

Paper • 2506.09820 • Published Jun 11 • 17 •

upvoted a paper 7 months ago

Tool-Star: Empowering LLM-Brained Multi-Tool Reasoner via Reinforcement Learning

Paper • 2505.16410 • Published May 22 • 58

authored a paper 9 months ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 113

commented a paper 9 months ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 113 •

upvoted a paper 9 months ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 113

published a model 10 months ago

ChengpengLi/START

Updated Feb 21

upvoted 2 papers 11 months ago

Enabling Scalable Oversight via Self-Evolving Critic

Paper • 2501.05727 • Published Jan 10 • 74

The Lessons of Developing Process Reward Models in Mathematical Reasoning

Paper • 2501.07301 • Published Jan 13 • 99

upvoted a paper 12 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 376

liked a Space about 1 year ago

Qwen2.5 Math Demo

🧮

232

Describe and solve math problems from images or sketches

upvoted 2 collections about 1 year ago

Qwen2.5-Math

Collection

Math-specific model series based on Qwen2.5 • 11 items • Updated Jul 21 • 88

Qwen2-Math

Collection

Math-specific model series based on Qwen2 • 8 items • Updated Jul 21 • 52

liked a model over 1 year ago

Qwen/Qwen2-Math-72B

Text Generation • 73B • Updated Aug 8, 2024 • 440 • 30

authored 2 papers over 1 year ago

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15, 2024 • 167

DotaMath: Decomposition of Thought with Code Assistance and Self-correction for Mathematical Reasoning

Paper • 2407.04078 • Published Jul 4, 2024 • 21

upvoted a paper over 1 year ago

Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models

Paper • 2406.13542 • Published Jun 19, 2024 • 17

ChengpengLi

AI & ML interests

Recent Activity

Organizations

ChengpengLi's activity

Qwen2.5 Math Demo