facebook/metaclip-2-worldwide-b16-384
Zero-Shot Image Classification
•
0.6B
•
Updated
•
33
•
2
None defined yet.
TV2TV: A Unified Framework for Interleaved Language and Video Generation
TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models