Yilong Zhao

ylzhao

https://happierpig.github.io/

happierpig

AI & ML interests

None yet

Recent Activity

upvoted a paper 6 days ago

Quant VideoGen: Auto-Regressive Long Video Generation via 2-Bit KV-Cache Quantization

upvoted a paper 6 days ago

HySparse: A Hybrid Sparse Attention Architecture with Oracle Token Selection and KV Cache Sharing

upvoted an article about 2 months ago

Who Routes LLM Routers? RouterArena: Building the Evaluation Foundation for LLM Routing

View all activity

Organizations

upvoted 2 papers 6 days ago

Quant VideoGen: Auto-Regressive Long Video Generation via 2-Bit KV-Cache Quantization

Paper • 2602.02958 • Published 8 days ago • 32

HySparse: A Hybrid Sparse Attention Architecture with Oracle Token Selection and KV Cache Sharing

Paper • 2602.03560 • Published 8 days ago • 41

upvoted an article about 2 months ago

Article

Who Routes LLM Routers? RouterArena: Building the Evaluation Foundation for LLM Routing

Nov 11, 2025

•

upvoted a paper 2 months ago

VLASH: Real-Time VLAs via Future-State-Aware Asynchronous Inference

Paper • 2512.01031 • Published Nov 30, 2025 • 25

upvoted a paper 4 months ago

Self-Forcing++: Towards Minute-Scale High-Quality Video Generation

Paper • 2510.02283 • Published Oct 2, 2025 • 96

upvoted 6 papers 8 months ago

Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights

Paper • 2506.16406 • Published Jun 19, 2025 • 130

Multiverse: Your Language Models Secretly Decide How to Parallelize and Merge Generation

Paper • 2506.09991 • Published Jun 11, 2025 • 55

Seedance 1.0: Exploring the Boundaries of Video Generation Models

Paper • 2506.09113 • Published Jun 10, 2025 • 105

RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics

Paper • 2506.04308 • Published Jun 4, 2025 • 43

REASONING GYM: Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Paper • 2505.24760 • Published May 30, 2025 • 74

AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time

Paper • 2505.24863 • Published May 30, 2025 • 97

upvoted a paper 9 months ago

Sparse VideoGen2: Accelerate Video Generation with Sparse Attention via Semantic-Aware Permutation

Paper • 2505.18875 • Published May 24, 2025 • 42

upvoted an article 10 months ago

Article

PipelineRL

Apr 25, 2025

•

upvoted 3 papers 12 months ago

S*: Test Time Scaling for Code Generation

Paper • 2502.14382 • Published Feb 20, 2025 • 63

The Danger of Overthinking: Examining the Reasoning-Action Dilemma in Agentic Tasks

Paper • 2502.08235 • Published Feb 12, 2025 • 59

Efficient-vDiT: Efficient Video Diffusion Transformers With Attention Tile

Paper • 2502.06155 • Published Feb 10, 2025 • 10

upvoted an article about 1 year ago

Article

Process Reinforcement through Implicit Rewards

Jan 3, 2025

•

upvoted a paper about 1 year ago

Efficiently Serving LLM Reasoning Programs with Certaindex

Paper • 2412.20993 • Published Dec 30, 2024 • 36

upvoted 2 papers over 1 year ago

MIO: A Foundation Model on Multimodal Tokens

Paper • 2409.17692 • Published Sep 26, 2024 • 53

Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference

Paper • 2406.10774 • Published Jun 16, 2024 • 4

Yilong Zhao

AI & ML interests

Recent Activity

Organizations

ylzhao's activity

Who Routes LLM Routers? RouterArena: Building the Evaluation Foundation for LLM Routing

PipelineRL

Process Reinforcement through Implicit Rewards