UnityVideo / README.md
JackAILab's picture
Update README.md
bf9ad2d verified

๐Ÿ“– Introduction

UnityVideo is a unified generalist framework for multi-task multi-modal video understanding that enables:

  • ๐ŸŽจ Text-to-Video Generation: Create high-quality videos from text descriptions
  • ๐ŸŽฎ Controllable Generation: Fine-grained control over video generation with various modalities
  • ๐Ÿ” Modality Estimation: Estimate depth, normal, and other modalities from video
  • ๐ŸŒŸ Zero-Shot Generalization: Strong generalization to novel objects and styles without additional training

Our unified architecture achieves state-of-the-art performance across multiple video generation benchmarks while maintaining efficiency and scalability.