Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
41
223
53
KABI
dongguanting
Follow
MurrayTom's profile picture
AndroidGuy's profile picture
melttree's profile picture
68 followers
·
106 following
https://dongguanting.github.io/
kakakbibibi
dongguanting
AI & ML interests
Reasoning and Alignment for Large Language Models
Recent Activity
upvoted
a
paper
2 days ago
RAGEN-2: Reasoning Collapse in Agentic RL
upvoted
a
paper
8 days ago
SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization
commented
on
a paper
8 days ago
The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook
View all activity
Organizations
dongguanting
's datasets
11
Sort: Recently updated
dongguanting/ARPO-RL-DeepSearch-1K
Viewer
•
Updated
Oct 17, 2025
•
1.07k
•
60
•
6
dongguanting/ARPO-RL-Reasoning-10K
Viewer
•
Updated
Oct 17, 2025
•
10k
•
116
•
4
dongguanting/ARPO-SFT-54K
Viewer
•
Updated
Oct 17, 2025
•
54.6k
•
140
•
15
dongguanting/RAG-Error-Critic-100K
Viewer
•
Updated
Jun 28, 2025
•
100k
•
31
•
3
dongguanting/Tool-Star-SFT-54K
Viewer
•
Updated
May 29, 2025
•
54k
•
262
•
10
dongguanting/Multi-Tool-RL-10K
Viewer
•
Updated
May 25, 2025
•
10k
•
87
•
5
dongguanting/RAG-QA-40K
Viewer
•
Updated
Dec 27, 2024
•
32.8k
•
24
•
2
dongguanting/ShareGPT-12K
Viewer
•
Updated
Dec 27, 2024
•
12.9k
•
71
•
1
dongguanting/VIF-RAG-QA-110K
Viewer
•
Updated
Dec 27, 2024
•
111k
•
44
•
7
dongguanting/DotamathQA
Viewer
•
Updated
Dec 26, 2024
•
574k
•
50
•
2
dongguanting/VIF-RAG-QA-20K
Viewer
•
Updated
Nov 1, 2024
•
20k
•
6
•
4