arxiv:2606.09426
Yif Yang
Yif29
AI & ML interests
None yet
Recent Activity
updated a Space 2 days ago
microsoft/AVGen-Bench-Leaderboard authored a paper 13 days ago
WeaveBench: A Long-Horizon, Real-World Benchmark for Computer-Use Agents with Hybrid Interfaces