5 12 14

Boyuan Zheng

boyuanzheng010

https://boyuanzheng010.github.io/

AI & ML interests

Language Agents, Multilinguality

Recent Activity

upvoted a paper 5 months ago

Agent Learning via Early Experience

upvoted a paper 5 months ago

The Alignment Waltz: Jointly Training Agents to Collaborate for Safety

upvoted a paper 6 months ago

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

View all activity

Organizations

upvoted 2 papers 5 months ago

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9, 2025 • 273

The Alignment Waltz: Jointly Training Agents to Collaborate for Safety

Paper • 2510.08240 • Published Oct 9, 2025 • 41

upvoted a paper 6 months ago

LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model

Paper • 2509.00676 • Published Aug 31, 2025 • 85

updated a dataset 7 months ago

osunlp/WebGuard

Viewer • Updated Jul 28, 2025 • 6k • 90 • 2

published a dataset 7 months ago

osunlp/WebGuard

Viewer • Updated Jul 28, 2025 • 6k • 90 • 2

updated a dataset 7 months ago

boyuanzheng010/webguard_test

Viewer • Updated Jul 24, 2025 • 6.49k • 72

published a dataset 7 months ago

boyuanzheng010/webguard_test

Viewer • Updated Jul 24, 2025 • 6.49k • 72

upvoted a paper 8 months ago

Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge

Paper • 2506.21506 • Published Jun 26, 2025 • 52

updated a dataset 10 months ago

boyuanzheng010/webguard

Viewer • Updated May 16, 2025 • 6.49k • 7 • 1

published a dataset 10 months ago

boyuanzheng010/webguard

Viewer • Updated May 16, 2025 • 6.49k • 7 • 1

upvoted a paper 11 months ago

AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories

Paper • 2504.08942 • Published Apr 11, 2025 • 28

liked a Space 11 months ago

Agent Reward Bench Demo

💻

Explore agent trajectories and judgments in web benchmarks

upvoted a paper 11 months ago

SkillWeaver: Web Agents can Self-Improve by Discovering and Honing Skills

Paper • 2504.07079 • Published Apr 9, 2025 • 12

commented a paper 11 months ago

SkillWeaver: Web Agents can Self-Improve by Discovering and Honing Skills

Paper • 2504.07079 • Published Apr 9, 2025 • 12 •

published a model 11 months ago

boyuanzheng010/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

Updated Apr 6, 2025

updated a model 11 months ago

boyuanzheng010/Qwen2.5-1.5B-Open-R1-Distill

Text Generation • 2B • Updated Apr 2, 2025 • 1

published a model 11 months ago

boyuanzheng010/Qwen2.5-1.5B-Open-R1-Distill

Text Generation • 2B • Updated Apr 2, 2025 • 1

liked a Space 12 months ago

Online-Mind2Web Leaderboard

🌐

Visualize AI agent performance with tables and interactive plots

upvoted an article 12 months ago

Article

Open R1: Update #3

Mar 11, 2025

•

297