Shaobai Jiang
shaobaij
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 1 hour ago
Data-Efficient RLVR via Off-Policy Influence Guidance
upvoted
a
paper
about 1 hour ago
Beyond Reasoning Gains: Mitigating General Capabilities Forgetting in
Large Reasoning Models
upvoted
a
paper
about 1 hour ago
Tongyi DeepResearch Technical Report
Organizations
None yet