YSH

BestWishYsh

https://shyuanbest.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a collection 3 days ago

Cosmos3

upvoted an article 3 days ago

Welcome NVIDIA Cosmos 3: The First Open Omni-model for Physical AI Reasoning and Action

upvoted a paper 6 days ago

Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments

View all activity

Organizations

upvoted a collection 3 days ago

Cosmos3

Collection

Omnimodal World Models for Physical AI • 15 items • Updated about 2 hours ago • 70

upvoted an article 3 days ago

Article

Welcome NVIDIA Cosmos 3: The First Open Omni-model for Physical AI Reasoning and Action

nvidia

•

3 days ago

• 64

upvoted a paper 6 days ago

Qwen-VLA: Unifying Vision-Language-Action Modeling across Tasks, Environments, and Robot Embodiments

Paper • 2605.30280 • Published 7 days ago • 136

upvoted a paper 7 days ago

OSP-Next: Efficient High-Quality Video Generation with Sparse Sequence Parallelism, HiF8 Quantization, and Reinforcement Learning

Paper • 2605.28691 • Published 8 days ago • 24

upvoted 2 papers 20 days ago

Causal Forcing++: Scalable Few-Step Autoregressive Diffusion Distillation for Real-Time Interactive Video Generation

Paper • 2605.15141 • Published 21 days ago • 93

FASTER: Rethinking Real-Time Flow VLAs

Paper • 2603.19199 • Published Mar 19 • 59

upvoted a paper 21 days ago

Continuous Latent Diffusion Language Model

Paper • 2605.06548 • Published 28 days ago • 80

upvoted a paper 24 days ago

HumanNet: Scaling Human-centric Video Learning to One Million Hours

Paper • 2605.06747 • Published 28 days ago • 52

upvoted 2 papers 26 days ago

Video Generation with Predictive Latents

Paper • 2605.02134 • Published May 4 • 24

RLDX-1 Technical Report

Paper • 2605.03269 • Published about 1 month ago • 125

upvoted 3 papers about 1 month ago

upvoted an article about 2 months ago

Article

Nucleus-Image: Scaling Text-to-Image with Sparse Mixture of Experts

NucleusAI

•

Apr 14

• 11

upvoted a paper about 2 months ago

OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation

Paper • 2604.11804 • Published Apr 13 • 72

upvoted an article about 2 months ago

Article

`LeRobotDataset:v3.0`: Bringing large-scale datasets to `lerobot`

fracapuano, aractingi, lhoestq, CarolinePascal, pepijn223, jadechoghari, cadene, aliberts, AdilZtn, nepyope, imstevenpmwork

•

Sep 16, 2025

• 56

upvoted 3 papers 2 months ago

SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and Planning

Paper • 2603.23483 • Published Mar 24 • 63

Manifold-Aware Exploration for Reinforcement Learning in Video Generation

Paper • 2603.21872 • Published Mar 23 • 34

Astrolabe: Steering Forward-Process Reinforcement Learning for Distilled Autoregressive Video Models

Paper • 2603.17051 • Published Mar 17 • 109

upvoted an article 3 months ago

Article

Introducing Modular Diffusers - Composable Building Blocks for Diffusion Pipelines

YiYiXu, OzzyGT, dn6, sayakpaul

•

Mar 5

• 51

YSH

AI & ML interests

Recent Activity

Organizations

BestWishYsh's activity

Welcome NVIDIA Cosmos 3: The First Open Omni-model for Physical AI Reasoning and Action

Nucleus-Image: Scaling Text-to-Image with Sparse Mixture of Experts

`LeRobotDataset:v3.0`: Bringing large-scale datasets to `lerobot`

Introducing Modular Diffusers - Composable Building Blocks for Diffusion Pipelines