inclusionAI/Ming-UniAudio-16B-A3B
Any-to-Any
•
18B
•
Updated
•
310
•
72
https://huggingface.co/papers/2501.03006
Convert images to structured documents and answer questions
Visualize rich, dense image features locally in your browser
Generate videos from start and end images with prompts
Nano Banana for Hugging Face PRO users
Generate a podcast audio from a script and voice samples
Real-time video captioning powered by FastVLM