One Model to Critique Them All: Rewarding Agentic Tool-Use via Efficient Reasoning
Renhao Li
RioLee
·
AI & ML interests
None yet
Recent Activity
upvoted
a
collection
about 1 month ago
ToolRM
updated
a collection
about 1 month ago
ToolRM
authored
a paper
about 1 month ago
CoEvol: Constructing Better Responses for Instruction Finetuning through
Multi-Agent Cooperation