3 5 9

zhengliang

mangopy

AI & ML interests

None yet

Recent Activity

liked a dataset 9 days ago

mangopy/ToolRet-before-sample

upvoted a paper 29 days ago

Deep Research: A Systematic Survey

upvoted a paper about 2 months ago

Too Good to be Bad: On the Failure of LLMs to Role-Play Villains

View all activity

Organizations

None yet

liked a dataset 9 days ago

mangopy/ToolRet-before-sample

Viewer • Updated Mar 1, 2025 • 62.8k • 370 • 3

upvoted a paper 29 days ago

Deep Research: A Systematic Survey

Paper • 2512.02038 • Published Nov 24, 2025 • 65

upvoted a paper about 2 months ago

Too Good to be Bad: On the Failure of LLMs to Role-Play Villains

Paper • 2511.04962 • Published Nov 7, 2025 • 53

upvoted a paper 2 months ago

Annotation-Efficient Universal Honesty Alignment

Paper • 2510.17509 • Published Oct 20, 2025 • 21

updated a model 3 months ago

mangopy/OpenReward-Qwen2.5-3B-instruct-half-correct-half-wrong-84-step

3B • Updated Sep 25, 2025 • 7

published a model 3 months ago

mangopy/OpenReward-Qwen2.5-3B-instruct-half-correct-half-wrong-84-step

3B • Updated Sep 25, 2025 • 7

updated a model 3 months ago

mangopy/OpenReward-Qwen2.5-7B-instruct-only-em-96-step

8B • Updated Sep 23, 2025 • 8

published a model 3 months ago

mangopy/OpenReward-Qwen2.5-7B-instruct-only-em-96-step

8B • Updated Sep 23, 2025 • 8

updated a model 3 months ago

mangopy/OpenReward-Qwen2.5-7B-instruct-half-correct-half-wrong-84-step

8B • Updated Sep 22, 2025 • 8

published a model 3 months ago

mangopy/OpenReward-Qwen2.5-7B-instruct-half-correct-half-wrong-84-step

8B • Updated Sep 22, 2025 • 8

updated a model 3 months ago

mangopy/OpenReward-Qwen2.5-7B-instruct-one-search-96-step

8B • Updated Sep 21, 2025 • 5

published 2 models 3 months ago

mangopy/OpenReward-Qwen2.5-7B-instruct-one-search-96-step

8B • Updated Sep 21, 2025 • 5

mangopy/OpenReward-Qwen2.5-7B-instruct-one-search-96

Updated Sep 21, 2025

updated a model 3 months ago

mangopy/OpenRM-Qwen2.5-7B-instruct

8B • Updated Sep 19, 2025 • 5

published a model 3 months ago

mangopy/OpenRM-Qwen2.5-7B-instruct

8B • Updated Sep 19, 2025 • 5

updated a model 3 months ago

mangopy/OpenReward-Qwen2.5-7B-instruct-96-test

8B • Updated Sep 19, 2025 • 3

published a model 3 months ago

mangopy/OpenReward-Qwen2.5-7B-instruct-96-test

8B • Updated Sep 19, 2025 • 3

updated a model 3 months ago

mangopy/OpenReward-Qwen2.5-7B-instruct-96

8B • Updated Sep 19, 2025 • 6

published a model 3 months ago

mangopy/OpenReward-Qwen2.5-7B-instruct-96

8B • Updated Sep 19, 2025 • 6

upvoted a paper 6 months ago

RLVER: Reinforcement Learning with Verifiable Emotion Rewards for Empathetic Agents

Paper • 2507.03112 • Published Jul 3, 2025 • 32

zhengliang

AI & ML interests

Recent Activity

Organizations

mangopy's activity